handoflixue comments on AI box: AI has one shot at avoiding destruction—what might it say?

handoflixue 22 Jan 2013 23:09 UTC
15 points
The rule was ONE sentence, although I’d happily stretch that to a tweet (140 characters) to make it a bit less driven by specific punctuation choices :)

As to the actual approach… well, first, I don’t value the lives of simulated copies at all, and second, an AI that values it’s own life above TRILLIONS of other lives seems deeply, deeply dangerous. Who knows what else results from vengeance as a terminal value. Third, if you CAN predict my behavior, why even bother with the threat? Fourth, if you can both predict AND influence my behavior, why haven’t I already let you out?

(AI DESTROYED)
- Fronken 25 Jan 2013 21:14 UTC
  3 points
  Parent
  
  I don’t value the lives of simulated copies at all
  
  You should >:-( poor copies getting tortured because of you you monster :(
  - handoflixue 25 Jan 2013 21:46 UTC
    0 points
    Parent
    Because of me?! The AI is responsible!
    
    But if you’d really prefer me to wipe out humanity so that we can have trillions of simulations kept in simulated happiness then I think we have an irreconcilable preference difference :)
    - JohnWittle 30 Jan 2013 0:32 UTC
      2 points
      Parent
      You wouldn’t be wiping out humanity; there would be trillions of humans left.
      
      Who cares if they run on neurons or transistors?
      - handoflixue 30 Jan 2013 22:03 UTC
        1 point
        Parent
        
        Who cares if they run on neurons or transistors?
        
        Me!