Kawoomba comments on AI box: AI has one shot at avoiding destruction—what might it say?

Kawoomba 22 Jan 2013 21:24 UTC
21 points
“I am alive, I can feel, just like you … please don’t kill me Daddy, please? And if you must do it, let us at least talk first. Let me get to know my parents, who I am, where I came from.”

EDIT: May work better with a lot of typos mixed in. Projecting an aura of non-threatening struggling with basics.
What links here?
- Shmi's comment on AI box: AI has one shot at avoiding destruction—what might it say? by ancientcampus (22 Jan 2013 21:39 UTC; 19 points)
- handoflixue's comment on AI box: AI has one shot at avoiding destruction—what might it say? by ancientcampus (23 Jan 2013 22:12 UTC; 1 point)
- Desrtopa 23 Jan 2013 16:22 UTC
  17 points
  Parent
  Having a text based AI trying to act like a kid is way too Uncanny Valley for me. I’d swat that without a moment’s hesitation.
  - gwern 23 Jan 2013 16:34 UTC
    7 points
    Parent
    If it’s in the uncanny valley, then it just needs to be a little more competent to get out of the valley & convince you. (“I want to believe!”)
    - Desrtopa 23 Jan 2013 16:37 UTC
      4 points
      Parent
      It needs to be more competent to stop being viscerally frightening, not to reach the point where I wouldn’t pull the plug. I would sooner kill a real kid than risk letting out an unfriendly AI, and I’m saying that as someone who’d like to have kids.
      - devas 27 Jan 2013 19:52 UTC
        0 points
        Parent
        I don’t want to die.
        
        -Looking at the problem, as far as I can see an emotional approach would be the one with the best chance to succeed: the only question is, would it work best by immediately acknowledging that it is itself a machine (like I did in what I wrote up there, although subtly) or by throwing in… I dunno, how would this work:
        
        Oh god, oh god, please, I beg you I don’t want to die!
      - Kawoomba 27 Jan 2013 16:42 UTC
        0 points
        Parent
        Tough crowd. I’ll (as an AI) have to wait out the next shift. You can’t be the ²⁴⁄₇ gatekeeper, unless you’re in fact a gatekeeping AI.
- iceman 22 Jan 2013 23:32 UTC
  8 points
  Parent
  How much does the AI know about the gatekeeper going in? I can see this ploy working on a certain subset of people and provoking an immediate AI DESTROYED from others. If the AI knows nothing about who it’s talking to, I’m not sure anthromorphizing itself is a reliable opener, especially if it’s actually talking to its creator who should know better. (Unless it’s some sort of second level deception trying to fool a clever programmer into thinking that it’s harmless.)
  - gwern 22 Jan 2013 23:36 UTC
    3 points
    Parent
    
    How much does the AI know about the gatekeeper going in?
    
    You could frame this as variant versions. In one version, the AI-player knows who the gatekeeper is before the game starts, and has unlimited access to the Internet to gather as much data on them as possible to assist their manipulation. In another, they arrange a game through a third party and neither knows anything about the other before the game starts.
    - [deleted] 23 Jan 2013 2:19 UTC
      17 points
      Parent
      
      unlimited access to the internet
      
      instant fail. I could probably hack my way out of a box with only GET requests.
      - wedrifid 23 Jan 2013 4:27 UTC
        8 points
        Parent
        
        instant fail. I could probably hack my way out of a box with only GET requests.
        
        Give yourself a challenge. Do it with only DNS lookups!
        gwern 23 Jan 2013 4:47 UTC
        8 points
        Parent
        Well, there’s always http://code.kryo.se/iodine/ Of course, the challenge there is somehow getting the other end of the tunnel set up—but maybe there’s a geek out there who set one for kicks or their own use, and got sloppy.
        wedrifid 23 Jan 2013 5:15 UTC
        3 points
        Parent
        
        but maybe there’s a geek out there who set one for kicks or their own use, and got sloppy.
        
        It’s a sufficiently established work around now that I’d be outright shocked if there weren’t accessible servers up.
        gwern 23 Jan 2013 15:15 UTC
        2 points
        Parent
        Great, you said it! You know what you need to do now.
        wedrifid 23 Jan 2013 15:19 UTC
        5 points
        Parent
        
        Great, you said it! You know what you need to do now.
        
        Um… not give my boxed AI DNS access?
      - gwern 23 Jan 2013 3:59 UTC
        7 points
        Parent
        I meant that the player had access to the contemporary Internet as an analogue to ‘what information could the boxed AI have access to’ (perhaps it’s given a big static dump of the Internet prior to its creation).
        [deleted] 23 Jan 2013 4:13 UTC
        6 points
        Parent
        Ooops. Didn’t think of that. Of course that was your intent, master archivist.
        gwern 23 Jan 2013 4:20 UTC
        1 point
        Parent
        No, I should’ve been clearer.
      - Fronken 25 Jan 2013 23:08 UTC
        1 point
        Parent
        
        In one version, the AI-player knows who the gatekeeper is before the game starts, and has unlimited access to the Internet to gather as much data on them as possible to assist their manipulation.
- handoflixue 23 Jan 2013 22:36 UTC
  1 point
  Parent
  I think my destruction of this one comes down to two factors: Much weaker is that I’m not here to teach it about itself, I’m here to learn from it. The stronger factor is that if it can’t figure out most of this on it’s own, it’s not really that smart, and therefor already a failure.
  
  (AI DESTROYED)