I was confused for a moment. You start out by saying there’s no alternative to CEV, then end up by saying there’s a consensus that CEV isn’t a good first alignment target.
Doesn’t that mean that whether or how to pursue CEV it’s not relevant to whether we live or die? It seems like we should focus on the alignment targets we’ll pursue first, and leave CEV and the deeper nature of values and preferences for the Long Reflection—if we can arrange to get one.
I certainly hope you’re right that there’s a de-facto consensus that CEV/value alignment probably isn’t relevant for our first do-or-die shots at alignment. It sure looks that way to me, so I’d like to see more LW brainpower going toward detailed analyses of the alignment schemes on which we’re most likely to bet the future.
I think it’s still relevant because it creates a rallying point around what to do after you made substantial progress aligning AGI, which helps coordination in the run up to it, but I agree that most effort should go into other approaches.
I was confused for a moment. You start out by saying there’s no alternative to CEV, then end up by saying there’s a consensus that CEV isn’t a good first alignment target.
Doesn’t that mean that whether or how to pursue CEV it’s not relevant to whether we live or die? It seems like we should focus on the alignment targets we’ll pursue first, and leave CEV and the deeper nature of values and preferences for the Long Reflection—if we can arrange to get one.
I certainly hope you’re right that there’s a de-facto consensus that CEV/value alignment probably isn’t relevant for our first do-or-die shots at alignment. It sure looks that way to me, so I’d like to see more LW brainpower going toward detailed analyses of the alignment schemes on which we’re most likely to bet the future.
I think it’s still relevant because it creates a rallying point around what to do after you made substantial progress aligning AGI, which helps coordination in the run up to it, but I agree that most effort should go into other approaches.