Stuart_Armstrong comments on Research Agenda v0.9: Synthesising a human’s preferences into a utility function

Stuart_Armstrong 22 Jun 2019 8:36 UTC
LW: 2 AF: 1
AF

“humans don’t have actual preferences so the AI is just going to try to learn something adequate.”

Try something like: humans don’t have actual consistent preferences, so the AI is going to try and find a good approximation that covers all the contradictions and uncertainties in human preferences.