jhuffman comments on Complexity of Value ≠ Complexity of Outcome

jhuffman 30 Jan 2010 3:53 UTC
−2 points

If it were the case that only a few of our values scale, then we can potentially obtain almost all that we desire by creating a superintelligence with just those values.

Can we really expect a superintelligence to stick with the values we give it ? Our own values change over time; sometimes without even external stimulus just internal reflection. I don’t see how we can bound a superintelligence without doing more computation than we expect it to do in its lifetime.
- Zack_M_Davis 30 Jan 2010 4:02 UTC
  5 points
  Parent
  
  Our own values change over time
  
  I tend to file this under “humans are stupid.” Messy creatures like ourselves undergo value drift, but decision-theoretically speaking, systems designed to optimize for some particular criterion have a natural incentive to keep that criterion. Cf. “The Basic AI Drives.”
  - timtyler 30 Jan 2010 13:54 UTC
    1 point
    Parent
    It is probably best to model those as infections—or sometimes malfunctions.
    
    Humans get infected with pathogens that make them do things like sneeze. Their values have not changed to value spreading snot on their neigbours, rather they are infected with germs—and the germs do value that.
    
    It’s much the same with mind-viruses. A catholic conversion is best modelled as a memetic infection—rather than a genuine change in underlying values. Such people can be cured.
    - gregconen 30 Jan 2010 18:17 UTC
      6 points
      Parent
      
      Such people can be cured.
      
      The fact that a change is reversible does not make it not real.
      
      The fact that the final value system can be modeled as a starting value system modified by “memetic infection” does not make the final value system invalid. They are two different but equivalent ways of modelling the state.
      - timtyler 30 Jan 2010 20:32 UTC
        1 point
        Parent
        Right. The point is that—under the “infection” analogy—people’s “ultimate” values change a lot less. How much they change depends on the strength of people’s memetic immune system—and there are some people with strong memetic immune systems whose values don’t change much at all.
        gregconen 31 Jan 2010 1:16 UTC
        0 points
        Parent
        I’m not sure I follow you.
        
        Are you saying that some agents change their values less often than others (or equivalently, are less likely to acquire “infections”)?
  - Nick_Tarleton 30 Jan 2010 21:36 UTC
    0 points
    Parent
    Also, I suspect a lot of people who talk about how human values change are thinking of things, like aesthetics and preferred flavors of ice cream, that aren’t plausibly terminal values and that we often want to change over time.
- wedrifid 30 Jan 2010 15:46 UTC
  3 points
  Parent
  
  Can we really expect a superintelligence to stick with the values we give it ?
  
  Yes.
  
  I don’t see how we can bound a superintelligence without doing more computation than we expect it to do in its lifetime.
  
  I once proved that a program will print out only prime numbers endlessly. I really, really wish I kept the working out.
  - timtyler 30 Jan 2010 17:42 UTC
    2 points
    Parent
    Is that program still running? ;-)
    - wedrifid 30 Jan 2010 22:37 UTC
      0 points
      Parent
      Hush you. You weren’t supposed to notice that. :D
- timtyler 30 Jan 2010 13:35 UTC
  0 points
  Parent
  Quite a bit of ink has been spilled on this issue. Eliezer Yudkowsky and Steve Omohundro have argued that it is possible. Have you examined their arguments?
- Thomas 30 Jan 2010 12:15 UTC
  −3 points
  Parent
  Nothing changes from the inside, unless it is preprogrammed for.
  - jhuffman 30 Jan 2010 13:12 UTC
    −4 points
    Parent
    You cannot pre-program all the routines for handling all future states for anything you can call an AI much less a “superintelligence”. AI must be able to learn, and there is no reason all such learning is only based on new external stimuli.
    - Thomas 30 Jan 2010 13:18 UTC
      −3 points
      Parent
      So you say, then a magic happens and something new is born.
      
      No, it doesn’t. Just the physics acted onto the engraved algorithms and/or data.
      - jhuffman 30 Jan 2010 13:30 UTC
        0 points
        Parent
        No magic; and yes all you have is algorithms and data. Obviously the algorithms contain an aspect of learning, and eventually the data guides decision pathways far more than the original algorithms; and even the algorithms themselves are mutable data.
        
        edit: I should note, I’m just talking about some of our crude “AI” systems that we build today. I don’t know that this would be the actual software architecture of anything that could become a superintelligence. But it would have these capabilities and more...
        Thomas 30 Jan 2010 13:40 UTC
        3 points
        Parent
        Crude or non crude AI, a physical configuration at the start and a physical configuration at any time since.
        
        You can name it whatever you choose.