I would be more sympathetic if you made a move like, “I’ll accept continuity through the human range of intelligence, and that we’ll only have to align systems as collectively powerful as humans, but I still think that hands-on experience is only...” In particular, I think there is a real disagreement about the relative value of experimenting on future dangerous systems instead of working on theory or trying to carefully construct analogous situations today by thinking in detail about alignment difficulties in the future.
I would be more sympathetic if you made a move like, “I’ll accept continuity through the human range of intelligence, and that we’ll only have to align systems as collectively powerful as humans, but I still think that hands-on experience is only...” In particular, I think there is a real disagreement about the relative value of experimenting on future dangerous systems instead of working on theory or trying to carefully construct analogous situations today by thinking in detail about alignment difficulties in the future.