ike comments on Summoning the Least Powerful Genie

ike 16 Sep 2015 18:34 UTC
0 points
“it is not an agent” is not a description of how to build an AI that is in fact, not an agent. It’s barely better than “not an unsafe AI”.

Besides, isn’t “giving an answer to the prediction” a rather agenty thing for such an AI to do?
- TheAncientGeek 16 Sep 2015 20:02 UTC
  1 point
  Parent
  
  “it is not an agent” is not a description of how to build an AI that is in fact, not an agent. It’s barely better than “not an unsafe AI”.
  
  Non-agents aren’t all that mysterious. We can already build non agents. Google is a non-agent.
  
  Besides, isn’t “giving an answer to the prediction” a rather agenty thing for such an AI to do?
  
  No, it;s a response. Non agency means not doing anything unless prompted.
  - ike 16 Sep 2015 20:16 UTC
    2 points
    Parent
    
    Non agents aren’t all that mysterious. We can already build non agents. Google is a non agent.
    
    Compare: safe (in the FAI sense) computer programs aren’t that mysterious. We can already build safe computer programs. Android is a safe computer program.
    
    Non agency means not doing anything unless prompted.
    
    Well, who cares if it doesn’t do anything unless prompted, if it takes over the universe when prompted to answer a question? And if you can rigorously tell it not to do that, you’ve already solved FAI.
    - TheAncientGeek 16 Sep 2015 20:39 UTC
      1 point
      Parent
      
      Non agents aren’t all that mysterious. We can already build non agents. Google is a non agent.
      
      Compare: safe (in the FAI sense) computer programs aren’t that mysterious. We can already build safe computer programs. Android is a safe computer program.
      
      Do you have a valid argument that nonagentive programmes would be dangerous? Because saying “it would agentively do X” isn’t a valid argument. Pointing out the hidden pitfalls of such programmes is something MIRI could usefully do. An unargued belief that everything is dangerous is not useful.
      
      ?Well, who cares if it doesn’t do anything unless prompted, if it takes over the universe when prompted to answer a question
      
      Oh, you went there.
      
      Well: how likely is an AI designed to be nonagentive as a safety feature to have that particular failure mode?
      
      And if you can rigorously tell it not to do that, you’ve already solved FAI.
      
      You may have achieved safety., but it has nothing to do with “achieving FAI” in the MIRI sense of hardcoding the totality of human value. The whole point is that it is much easier, because you are just not building in agency.
      - lmm 17 Sep 2015 20:11 UTC
        0 points
        Parent
        A program designed to answer a question necessarily wants to answer that question. A superintelligent program trying to answer that particular question runs the risk of acting as a paperclip maximizer.
        
        Suppose you build a superintelligent program that is designed to make precise predictions, by being more creative and better at predictions than any human would. Why are you confident that one of the creative things this program does to make itself better at predictions isn’t turning the matter of the Earth into computronium as step 1?
        Lumifer 17 Sep 2015 20:34 UTC
        2 points
        Parent
        
        A program designed to answer a question necessarily wants to answer that question.
        
        I don’t think my calculator wants anything.
        lmm 18 Sep 2015 20:24 UTC
        0 points
        Parent
        Does an amoeba want anything? Does a fly? A dog? A human?
        
        You’re right, of course, that we have better models for a calculator than as an agent. But that’s only because we understand calculators and they have a very limited range of behaviour. As a program gets more complex and creative it becomes more predictive to think of it as wanting things (or rather, the alternative models become less predictive).
        Lumifer 18 Sep 2015 20:38 UTC
        0 points
        Parent
        Notice the difference (emphasis mine):
        
        A program designed to answer a question necessarily wants to answer that question
        
        vs
        
        ...it becomes more predictive to think of it as wanting things
        
        VoiceOfRa 20 Sep 2015 20:42 UTC
        1 point
        Parent
        Well, the fundamental problem is that LW-style qualiafree-rationalism has no way to define what the word “want” means.
        lmm 20 Sep 2015 18:40 UTC
        0 points
        Parent
        Is there a difference between “x is y” and “assuming that x is y generates more accurate predictions than the alternatives”? What else would “is” mean?
        Lumifer 21 Sep 2015 15:08 UTC
        0 points
        Parent
        
        Is there a difference between “x is y” and “assuming that x is y generates more accurate predictions than the alternatives”? What else would “is” mean?
        
        Are you saying the model with the currently-best predictive ability is reality??
        lmm 25 Sep 2015 6:51 UTC
        0 points
        Parent
        Not quite—rather the everyday usage of “real” refers to the model with the currently-best predictive ability. http://lesswrong.com/lw/on/reductionism/ - we would all say “the aeroplane wings are real”.
        Expand this thread
        Lumifer 25 Sep 2015 14:40 UTC
        0 points
        Parent
        
        rather the everyday usage of “real” refers to the model with the currently-best predictive ability
        
        Errr… no? I don’t think this is true. I’m guessing that you want to point out that we don’t have direct access to the territory and that maps is all we have, but that’s not very relevant to the original issue of replacing “I find it convenient to think of that code as wanting something” with “this code wants” and insisting that the code’s desires are real.
        
        Anthropomorphization is not the way to reality.
        TheAncientGeek 17 Sep 2015 21:50 UTC
        0 points
        Parent
        
        A program designed to answer a question necessarily wants to answer that question. A superintelligent program trying to answer that particular question runs the risk of acting as a paperclip maximizer.
        
        What does that mean? It’s necessarily satisfying a utility function? It isn’t as Lumifer’s calculator shows.
        
        Suppose you build a superintelligent program that is designed to make precise predictions, by being more creative and better at predictions than any human would. Why are you confident that one of the creative things this program does to make itself better at predictions isn’t turning the matter of the Earth into computronium as step 1?
        
        I can be confident that nonagents wont’t do agentive things.
        lmm 18 Sep 2015 20:25 UTC
        0 points
        Parent
        Why are you so confident your program is a nonagent? Do you have some formula for nonagent-ness? Do you have a program that you can feed some source code to and it will output whether that source code forms an agent or not?
        TheAncientGeek 19 Sep 2015 8:09 UTC
        0 points
        Parent
        It’s all standard software engineering.
        lmm 20 Sep 2015 18:39 UTC
        0 points
        Parent
        I’m a professional software engineer, feel free to get technical.
        TheAncientGeek 21 Sep 2015 9:45 UTC
        2 points
        Parent
        Have you ever heard of someone designing a nonagentive programme that unexpectedly turned out to be agentive? Because to me that sounds like into the workshop to build a skateboard abd coming with a F1 car.
        lmm 25 Sep 2015 6:48 UTC
        0 points
        Parent
        I’ve known plenty of cases where people’s programs were more agentive than they expected. And we don’t have a good track record on predicting which parts of what people do are hard for computers—we thought chess would be harder than computer vision, but the opposite turned out to be true.
        Expand this thread
        TheAncientGeek 28 Sep 2015 14:43 UTC
        0 points
        Parent
        
        I’ve known plenty of cases where people’s programs were more agentive than they expected.
        
        I haven’t: have you any specific examples?
        Lumifer 25 Sep 2015 14:54 UTC
        0 points
        Parent
        
        I’ve known plenty of cases where people’s programs were more agentive than they expected.
        
        “Doing something other than what the programmer expects” != “agentive”. An optimizer picking a solution that you did not consider is not being agentive.
      - ike 16 Sep 2015 20:53 UTC
        0 points
        Parent
        
        Do you have a valid argument that nonagentive programmes would be dangerous? Because saying “it would agentively do X” isn’t a valid argument. Pointing out the hidden pitfalls of such programmes is something MIRI could usefully do. An unargued belief that everything is dangerous is not useful.
        
        I’m claiming that “nonagent” is not descriptive enough to actually build one. You replied that we already have non agents, and I replied that we already have safe computer programs. Just like we can’t extrapolate from our safe programs that any AI will be safe, we can’t extrapolate from our safe nonagents that any non-agent will be safe.
        
        Well: how likely is an AI designed to be nonagentive as a safety feature to have that particular failure mode?
        
        I still have little idea what you mean by nonagent. It’s a black box, that may have some recognizable features from the outside, but doesn’t tell you how to build it.
        TheAncientGeek 16 Sep 2015 21:19 UTC
        1 point
        Parent
        I replied that we can already build nonagents.
        
        It remains the case that if you think they could be dangerous, you need to explain how.
        
        I still have little idea what you mean by nonagent. It’s a black box, that may have some recognizable features from the outside, but doesn’t tell you how to build it.
        
        Again, we already know how to build them, in that we have them.
        
        Worse than that. MIRI can’t actually build anything they propose. It’s just that some MIRI people have a reflex habit of complaining that anything outside of MIRI land is too vague.