HungryHobo comments on An example of deadly non-general AI

HungryHobo 21 Aug 2014 14:24 UTC
11 points
Thing is: a narrow AI that doesn’t model human minds and attempts to disrupt it’s strategies isn’t going to hide how it plans to do it.

So you build your narrow super-medicine-bot and ask it to plan out how it will achieve the goal you’ve given it and to provide a full walkthrough and description.

it’s not a general AI, it doesn’t have any programming for understanding lying or misleading anyone so it lays out the plan in full for the human operator. (why would it not?)

who promptly changes the criteria for success and tries again.
- hairyfigment 24 Aug 2014 6:38 UTC
  4 points
  Parent
  I could well be confused about this, but: if the AI “doesn’t model human minds” at all, how could it interpret the command to “provide a full walkthrough and description”?
- Stuart_Armstrong 21 Aug 2014 14:43 UTC
  2 points
  Parent
  
  who promptly changes the criteria for success and tries again.
  
  Until they stumble upon an AI that lies, possibly inadvertently, and then we’re dead...
  
  But I do agree that general intelligence is more dangerous, it’s just that narrow intelligence isn’t harmless.
  - Mark_Friedenbach 21 Aug 2014 18:07 UTC
    0 points
    Parent
    How do you convincingly lie without having the capability to think up a convincing lie?
    - VAuroch 22 Aug 2014 0:41 UTC
      5 points
      Parent
      Think you’re telling the truth.
      - Nornagest 22 Aug 2014 1:00 UTC
        6 points
        Parent
        Or be telling the truth, but be misinterpreted.
    - Stuart_Armstrong 22 Aug 2014 10:05 UTC
      2 points
      Parent
      Every statement an AI tells us will be a lie to some extent, simply in terms of being a simplification so that we can understand it. If we end up selecting against simplifications that reveal nefarious plans...
      
      But the narrow AI I had above might not even be capable of lying—it might just simply spit out the drug design, with a list of estimated improvements according to the criteria it’s been given, without anyone ever realising that “reduced mortality” was code for “everyone’s dead already”.
      - Luke_A_Somers 22 Aug 2014 11:23 UTC
        1 point
        Parent
        
        Every statement an AI tells us will be a lie to some extent, simply in terms of being a simplification so that we can understand it.
        
        Not so. You can definitely ask questions about complicated things that have simple answers.
        Stuart_Armstrong 22 Aug 2014 12:08 UTC
        2 points
        Parent
        Yes, that was an exaggeration—I was thinking of most real-world questions.
        Luke_A_Somers 22 Aug 2014 18:51 UTC
        4 points
        Parent
        I was thinking of most real-world questions that aren’t of the form ‘Why X?’ or ‘How do I X?’.
        
        “How much/many X?” → number
        
        “When will X?” → number
        
        “Is X?” → boolean
        
        “What are the chances of X if I Y?” → number
        
        Also, any answer that simplifies isn’t a lie if its simplified status is made clear.
- ancientcampus 23 Aug 2014 18:11 UTC
  1 point
  Parent
  I think this sums it up well. To my understanding, I think it would only require someone “looking over its shoulder”, asking its specific objective for each drug and the expected results of the drug. I doubt a “limited intelligence” would be able to lie. That is, unless it somehow mutated/accidentally became a more general AI, but then we’ve jumped rails into a different problem.
  
  It’s possible that I’m paying too much attention to your example, and not enough attention to your general point. I guess the moral of the story is, though, “limited AI can still be dangerous if you don’t take proper precautions”, or “incautiously coded objectives can be just as dangerous in limited AI as in general AI”. Which I agree with, and is a good point.