This post is about risks in the development of increasingly capable AI, in particular the risk of losing control to AI and extinction risk. I’ll suggest that a key question is, “When do we need to take this kind of risk seriously?”

We’ll look at the issue of ‘agency overhang’, which suggests that adding agent-like abilities to AI could result in a sudden and surprising increase in these kinds of risks.

I’ll draw on intuitions about humans taking administrative and political control (with reference to the ‘Dictator Book Club’) and rephrase agency overhang as ‘power play overhang’.

I’ll finish by suggesting that a lot of people may be making a subtle but important mistake in imagining just one fairly specific path to dangerous AI.

Is There a Power Play Overhang?

crispweed8 May 2024 17:39 UTC

1 point

0 comments1 min readLW link

AI Risk

What links here?

crispweed's comment on We might be missing some key feature of AI takeoff; it’ll probably seem like “we could’ve seen this coming” by Lukas_Gloor (14 May 2024 8:41 UTC; 1 point)

No comments.