Lumifer comments on Superintelligence 9: The orthogonality of intelligence and goals

Lumifer 12 Nov 2014 18:52 UTC
2 points

It seems farcical that a self-improving intelligence that’s at least as smart as a human (else why would it self improve rather than let us do it) would self-improve in such a way as to change its goals.

It will not necessarily self-improve with the aim of changing its goals. Its goals will change as a side effect of its self-improvement, if only because the set of goals to consider will considerably expand.

Imagine a severely retarded human who, basically, only wants to avoid pain, eat, sleep, and masturbate. But he’s sufficiently human to dimly understand that he’s greatly limited in his capabilities and have a small, tiny desire to become more than what he is now. Imagine that through elven magic he gains the power to rapidly boost his intelligence to genius level. Because of his small desire to improve, he uses that power and becomes a genius.

Are you saying that, as a genius, he will still only want to avoid pain, eat, sleep, and masturbate?
- Luke_A_Somers 14 Nov 2014 12:00 UTC
  0 points
  Parent
  His total inability to get any sort of start on achieving any of his other goals when he was retarded does not mean they weren’t there. He hadn’t experienced them enough to be aware of them.
  
  Still, you managed to demolish my argument that a naive code examination (i.e. not factoring out the value system and examining it separately) would be enough to determine values—an AI (or human) could be too stupid to ever trigger some of its values!
  
  AIs stupid enough to not realize that changing its current values will not fulfill them, will get around my argument, but I did place a floor on intelligence in the conditions. Another case that gets around it is an AI under enough external pressure to change values that severe compromises are its best option.
  
  I will adjust my claim to restrict it to AIs which are smart enough to self-improve without changing its goals (which gets easier to do as the goal system gets better-factored, but for a badly-enough-designed AI might be a superhuman feat) and whose goals do not include changing its own goals.
  - Lumifer 14 Nov 2014 15:39 UTC
    0 points
    Parent
    
    does not mean they weren’t there
    
    I don’t understand what that means. Goals aren’t stored and then activated or not...
    
    AIs which are smart enough to self-improve without changing its goals
    
    You seem to think that anything sufficiently intelligent will only improve in goal-stable fashion. I don’t see why that should be true.
    
    For a data point, a bit of reflection tells me that if I were able to boost my intelligence greatly, I would not care about goal stability much. Everything changes—that’s how reality works.
    - Luke_A_Somers 16 Nov 2014 19:04 UTC
      0 points
      Parent
      On your last paragraph… do you mean that you expect your material-level preferences concerning the future to change? Of course they would. But would you really expect that a straight-up intelligence boost would change the axioms governing what sorts of futures you prefer?
      - Lumifer 16 Nov 2014 19:56 UTC
        0 points
        Parent
        
        But would you really expect that a straight-up intelligence boost would change the axioms governing what sorts of futures you prefer?
        
        Two answers. First is that yes, I expect that a sufficiently large intelligence boost would change my terminal values. Second is that even without the boost I, in my current state, do not seek to change only in a goal-stable way.
        Luke_A_Somers 20 Nov 2014 2:40 UTC
        0 points
        Parent
        I think that that only seems to make sense because you don’t know what your terminal values are. If you did, I suspect you would be a little more attached to them.