Vladimir_Nesov comments on Given one AI, why not more?

Vladimir_Nesov 12 Mar 2023 11:31 UTC
3 points
0

make it so single misaligned AI could be defeated by other AIs fast

It might be difficult for AIs with complicated implicit values to build maximally capable AIs aligned with them. This would motivate them to remain at a lower level of capabilities, taking their time to improve capabilities without an alignment catastrophe. At the same time, AIs with simple explicit values might be in a better position to do that, being able to construct increasingly capable AIs that have the same simple explicit values.

Since AIs aligned with humanity probably need complicated values, the initial shape of a secure aligned equilibrium probably looks more like strong coordination and containment than pervasive maximal capability.
What links here?
- Vladimir_Nesov's comment on TruePath’s Shortform by TruePath (13 Mar 2023 15:30 UTC; 2 points)
- baturinsky 12 Mar 2023 12:13 UTC
  1 point
  0
  Parent
  Of course, having less limitations gives an advantage. Though, respecting limitations aimed at well-being of entire community makes it easier to coordinate and cooperate. And it works not just for AIs.