I wrote a review here. There, I identify the main generators of Christiano’s disagreement with Yudkowsky[1] and add some critical commentary. I also frame it in terms of a broader debate in the AI alignment community.
I divide those into “takeoff speeds”, “attitude towards prosaic alignment” and “the metadebate” (the last one is about what kind of debate norms should we have about this or what kind of arguments should we listen to.)
I wrote a review here. There, I identify the main generators of Christiano’s disagreement with Yudkowsky[1] and add some critical commentary. I also frame it in terms of a broader debate in the AI alignment community.
I divide those into “takeoff speeds”, “attitude towards prosaic alignment” and “the metadebate” (the last one is about what kind of debate norms should we have about this or what kind of arguments should we listen to.)