Stuart_Armstrong comments on Research Agenda in reverse: what would a solution look like?

Stuart_Armstrong 1 Jul 2019 15:52 UTC
LW: 5 AF: 2
AF

Do you think that AI system doesn’t “know” what humans would want, even if it doesn’t optimize for it?

I think the AI would not know that, because “what humans would want” is not defined. “What humans say they want”, “what, upon reflection, humans would agree they want...”, etc can be done, but “what humans want” is not a defined things about the world or about humans—without extra assumptions (which cannot be deduced from observation).