Michael Tontchev comments on To the average human, controlled AI is just as lethal as ‘misaligned’ AI

Michael Tontchev 14 Mar 2024 23:39 UTC
1 point
0
The super simple claim is:
If an unaligned AI by itself can do near-world-ending damage, an identically powerful AI that is instead alignable to a specific person can do the same damage.
I agree that it could likely do damage, but it does cut off the branches of the risk tree where many AIs are required to do damage in a way that relies on them being similarly internally misaligned, or at least more likely to cooperate amongst themselves than with humans.
So I’m not convinced it’s necessarily the same distribution of damage probabilities, but it still leaves a lot of room for doom. E.g. if you really can engineer superspreadable and damaging pathogens, you may not need that many AIs cooperating.
- YonatanK 15 Mar 2024 3:10 UTC
  1 point
  0
  Parent
  If an unaligned AI by itself can do near-world-ending damage, an identically powerful AI that is instead alignable to a specific person can do the same damage.
  If you mean that as the simplified version of my claim, I don’t agree that it is equivalent.
  
  Your starting point, with a powerful AI that can do damage by itself, is wrong. My starting point is groups of people whom we would not currently consider to be sources of risk, who become very dangerous as novel weaponry, along with changes in relations of economic production, unlock the means and the motive to kill very large numbers of people.
  And (as I’ve tried to clarify in my other responses) the comparison of this scenario to misaligned AI cases is not the point, it’s the threat from both sides of the alignment question.