ryan_greenblatt comments on We might be missing some key feature of AI takeoff; it’ll probably seem like “we could’ve seen this coming”

ryan_greenblatt 10 May 2024 15:20 UTC
5 points
0
My guess is this is probably right given some non-trivial, but not insane countermeasures, but those countermeasures may not actually be employed in practice.

(E.g. countermeasures comparable in cost and difficulty to Google’s mechanisms for ensuring security and reliability. These required substantial work and some iteration but no fundamental advances.)

I’m currently thinking about one of my specialties as making sure these countermeasures and tests of these countermeasures are in place.

(This is broadly what we’re trying to get at in the ai control post.)