Explicitly figuring out what our values are and formalizing them, is only one possible sequence of steps to get AI with our values.
It seems like most people don’t think that this approach will work. So there are a number of proposals to use AI itself to assist in this process. E.g. “motivational scaffolding” sounds like it solves the second step (formalizing the values.)
Explicitly figuring out what our values are and formalizing them, is only one possible sequence of steps to get AI with our values.
It seems like most people don’t think that this approach will work. So there are a number of proposals to use AI itself to assist in this process. E.g. “motivational scaffolding” sounds like it solves the second step (formalizing the values.)