Whenever I use the colloquial phrase “the AI believes a false X” I mean that we are using utility indifference to accomplish that goal, without actually giving the AI false beliefs.
Whenever I use the colloquial phrase “the AI believes a false X” I mean that we are using utility indifference to accomplish that goal, without actually giving the AI false beliefs.