How do we teach human children to not behave deceptively?
If we can do it with humans, why doesn’t that scale to AI? And if we can’t do it with humans, what factors keep it in check among humans, and how can those factors scale for AI?
We don’t. Humans lie constantly when we can get away with it. It is generally expected in society that humans will lie to preserve people’s feelings, lie to avoid awkwardness, and commit small deceptions for personal gain (though this third one is less often said out loud). Some humans do much worse than this.
What keeps it in check is that very few humans have the ability to destroy large parts of the world, and no human has the ability to destroy everyone else in the world and still have a world where they can survive and optimally pursue their goals afterwards. If there is no plan that can achieve this for a human, humans being able to lie doesn’t make it worse.
How do we teach human children to not behave deceptively?
If we can do it with humans, why doesn’t that scale to AI? And if we can’t do it with humans, what factors keep it in check among humans, and how can those factors scale for AI?
We don’t. Humans lie constantly when we can get away with it. It is generally expected in society that humans will lie to preserve people’s feelings, lie to avoid awkwardness, and commit small deceptions for personal gain (though this third one is less often said out loud). Some humans do much worse than this.
What keeps it in check is that very few humans have the ability to destroy large parts of the world, and no human has the ability to destroy everyone else in the world and still have a world where they can survive and optimally pursue their goals afterwards. If there is no plan that can achieve this for a human, humans being able to lie doesn’t make it worse.