It doesn’t change much, it still applies anyway because when talking about hypothetical really powerful models, ideally we’d want them to follow very strong principles regardless of who asks. E.g. if an AI was in charge of a military obviously it wouldn’t be open, but it shouldn’t accept orders to commit war crimes even from a general or a president.
It doesn’t change much, it still applies anyway because when talking about hypothetical really powerful models, ideally we’d want them to follow very strong principles regardless of who asks. E.g. if an AI was in charge of a military obviously it wouldn’t be open, but it shouldn’t accept orders to commit war crimes even from a general or a president.
Whether or not it obeys orders is irrelevant for open-source/open-weight models where this can be removed as this research shows.