The catastrophic convergence conjecture was originally formulated in terms of “outer alignment catastrophes tending to come from power-seeking behavior.” I think that this was a mistake: I meant to talk about impact alignment catastrophes tending to be caused by power-seeking. I’ve updated the post accordingly.
The catastrophic convergence conjecture was originally formulated in terms of “outer alignment catastrophes tending to come from power-seeking behavior.” I think that this was a mistake: I meant to talk about impact alignment catastrophes tending to be caused by power-seeking. I’ve updated the post accordingly.