Eh I think it seems somewhat easy for an AI to understand what our wish is at a common sense level, gpt4 can clearly understand to a degree. However, it’s yet to be proven if we can make them care about it (i.e. no deceptive alignment).
Part of the problem is that humans themselves are often bad at knowing what they want. :/
Eh I think it seems somewhat easy for an AI to understand what our wish is at a common sense level, gpt4 can clearly understand to a degree. However, it’s yet to be proven if we can make them care about it (i.e. no deceptive alignment).
Part of the problem is that humans themselves are often bad at knowing what they want. :/