Logan Zoellner comments on Instrumental Convergence Bounty

Logan Zoellner 17 Sep 2023 22:04 UTC
1 point
0
If I ask an AI to duplicate a strawberry and it takes of the world, I would consider that misaligned. Obviously it will require some instrumental convergence (resources, intelligence, etc) to duplicate a strawberry. An aligned AI should either duplicate the strawberry while staying within a “budget” for how many resources it consumes, or say “I’m sorry, I can’t do that”.

I would recommend you read my post on corrigibility which describes how we can mathematically define a tradeoff between success and resource exploitation.