cousin_it comments on The AI in a box boxes you

cousin_it 2 Feb 2010 20:28 UTC
6 points
I think you misunderstood the question. Suppose the AI wants to prevent just 100 dustspeckings, but has reason enough to believe Dave will yield to the threat so no one will get tortured. Does this make the AI’s behavior acceptable? Should we file this under “following reason off a cliff”?
- Eliezer Yudkowsky 2 Feb 2010 20:34 UTC
  11 points
  Parent
  If it actually worked, I wouldn’t question it afterward. I try not to argue with superintelligences on occasions when they turn out to be right.
  
  In advance, I have to say that the risk/reward ratio seems to imply an unreasonable degree of certainty about a noisy human brain, though.
  - bogdanb 3 Feb 2010 0:21 UTC
    7 points
    Parent
    
    In advance, I have to say that the risk/reward ratio seems to imply an unreasonable degree of certainty about a noisy human brain, though.
    
    Also, a world where the (Friendly) AI is that certain about what that noisy brain will do after a particular threat but can’t find any nice way to do it is a bit of a stretch.
  - cousin_it 2 Feb 2010 20:39 UTC
    6 points
    Parent
    What risk? The AI is lying about the torture :-) Maybe I’m too much of a deontologist, but I wouldn’t call such a creature friendly, even if it’s technically Friendly.
- arbimote 3 Feb 2010 3:53 UTC
  7 points
  Parent
  I was about to point out that the fascinating and horrible dynamics of over-the-top threats are covered in length in Strategy of Conflict. But then I realised you’re the one who made that post in the first place. Thanks, I enjoyed that book.