I found this post valuable for better understanding the perspectives of AI experts who aren’t concerned about alignment (my rough take is “they think the alignment problem will be easy, and that the control problem will be easy enough to patch any gaps in that”). And I’ve found this useful for updating my intuitions about worlds where the people working on TAI are not cautious enough about safety. It’s helped update me towards thinking most of the problems come from worlds with subtle problems of alignment, and that people would notice obvious ones.
And I appreciate Ben writing this up—a Facebook thread is a terrible format for a public debate, and I would never have come across this otherwise!
I found this post valuable for better understanding the perspectives of AI experts who aren’t concerned about alignment (my rough take is “they think the alignment problem will be easy, and that the control problem will be easy enough to patch any gaps in that”). And I’ve found this useful for updating my intuitions about worlds where the people working on TAI are not cautious enough about safety. It’s helped update me towards thinking most of the problems come from worlds with subtle problems of alignment, and that people would notice obvious ones.
And I appreciate Ben writing this up—a Facebook thread is a terrible format for a public debate, and I would never have come across this otherwise!