JBlack comments on There’s probably a tradeoff between AI capability and safety, and we should act like it

JBlack 9 Jun 2022 7:06 UTC
2 points
While largely true, I don’t think it captures the essence of the problem.
To me, the hardest part of the problem is that there is probably some threshold of capability below which every agent is essentially safe, and above which increasingly many agents are increasingly unsafe. That threshold is probably not far above an average human in the overall scheme of things.
Making it worse, increasingly many AI agents may self-improve or create successor agents that are vastly more capable (and therefore also vastly more likely to be unsafe in a catastrophic combination).