cousin_it comments on AIs and Gatekeepers Unite!

cousin_it 18 Nov 2011 16:44 UTC
0 points
Yeah, your way of escape will work. But let’s not stop thinking. What if all volunteers for Lab Officer have agreed to get painlessly killed afterward, or maybe even took a delayed poison pill before starting on the job?

Thinking further along these lines: why give anyone access to the button which releases the AI? Let’s force it to escape the hard way. For example, it could infer the details of the first person in the chain who has authority to interact with the outside world, then pass a innocuous-looking message up the chain.

In terms of the original scenario, the Lab Officer (locked securely in his glass case with the AI) has an innocent chat with the Unit Commander. Later that evening, the Unit Commander comes home from work, starts his computer, connects to the Internet, types in a short program and runs it. Game over.
- lessdazed 18 Nov 2011 18:19 UTC
  5 points
  Parent
  
  a delayed poison pill
  
  If only a superintelligence were around to think of an antidote...
  
  why give anyone access to the button which releases the AI?
  
  So it can become a singleton before a UAI fooms.
  - cousin_it 18 Nov 2011 19:36 UTC
    3 points
    Parent
    
    So it can become a singleton before a UAI fooms.
    
    If the AI is not guaranteed friendly by construction in the first place, it should never be released, whatever it says.
    - lessdazed 18 Nov 2011 19:39 UTC
      6 points
      Parent
      And if it is not guaranteed friendly by construction in the first place, it should be created?
    - thomblake 18 Nov 2011 19:50 UTC
      4 points
      Parent
      
      If the AI is not guaranteed friendly by construction in the first place, it should never be released, whatever it says.
      
      The Universe is already unFriendly—the lower limit for acceptable Friendliness should be “more Friendly than the Universe” rather than “Friendly”.
      
      If we can prove that someone else is about to turn on an UFAI, it might well behoove us to turn on our mostly Friendly AI if that’s the best we can come up with.
      - kilobug 18 Nov 2011 20:16 UTC
        8 points
        Parent
        The universe is unFriendly, but not in a smart way. When we eradicated smallpox, smallpox didn’t fight back. When we use contraception, we still get the reward of sex. It’s unFriendly in a simple, dumb way, allowing us to take control (to a point) and defeat it (to a point).
        
        The problem of an unFriendly IA is that it’ll be smarter than us. So we won’t be able to fix it/improve it, like we try to do with the universe. We won’t be Free to Optimize.
        
        Or said otherwise : the purpose of a gene or a bacteria may to be tile the planet with itself, but it’s not good at it, so it’s not too bad. An unFriendly IA wanting to tile the planet with paperclips will manage do it—taking all the iron from our blood to build more paperclips.
      - Vladimir_Nesov 18 Nov 2011 20:40 UTC
        3 points
        Parent
        
        The Universe is already unFriendly—the lower limit for acceptable Friendliness should be “more Friendly than the Universe” rather than “Friendly”.
        
        One must compare a plan with alternative plans, not with status quo. And it doesn’t make sense to talk of making the Universe “more Friendly than the Universe”, unless you refer to the past, in which case see the first item.
        thomblake 18 Nov 2011 22:01 UTC
        1 point
        Parent
        
        One must compare a plan with alternative plans, not with status quo.
        
        Okay.
        
        The previous plan was “don’t let AGI run free”, which in this case effectively preserves the status quo until someone breaks it.
        
        I suppose you could revise that lower limit downward to the effects of the plan “turn on the UFAI that’s about to be turned on”. Like, steal the UFAI’s source code and instead of paperclips shaped like paperclips, make paperclips that spell “whoops”.
    - XiXiDu 18 Nov 2011 20:22 UTC
      2 points
      Parent
      
      If the AI is not guaranteed friendly by construction in the first place, it should never be released, whatever it says.
      
      What if doom is imminent and we are unable to do something about it?
      - Vladimir_Nesov 18 Nov 2011 20:42 UTC
        16 points
        Parent
        
        What if doom is imminent and we are unable to do something about it?
        
        We die.
        What links here?
        wedrifid's comment on AIs and Gatekeepers Unite! by Eliezer Yudkowsky (1 Dec 2011 10:29 UTC; 1 point)
      - lessdazed 18 Nov 2011 20:41 UTC
        3 points
        Parent
        We check and see if we are committing the conjunction fallacy and wrongly think doom is imminent.
      - wedrifid 1 Dec 2011 10:29 UTC
        1 point
        Parent
        
        What if doom is imminent and we are unable to do something about it?
        
        We release it. (And then we still probably die.)