I guess I would lean towards saying that once powerful AI systems exist, we’ll need powerful aligned systems relatively fast in order to develop against them, otherwise we’ll be screwed. In other words, AI arms race dynamics push us towards a world where systems are deployed with an insufficient amount of testing and this provides one path for us to fall victim to an AI system that you might have expected iterative design to catch.
I guess I would lean towards saying that once powerful AI systems exist, we’ll need powerful aligned systems relatively fast in order to develop against them, otherwise we’ll be screwed. In other words, AI arms race dynamics push us towards a world where systems are deployed with an insufficient amount of testing and this provides one path for us to fall victim to an AI system that you might have expected iterative design to catch.