private_messaging comments on The genie knows, but doesn’t care

private_messaging 13 Sep 2013 5:16 UTC
−2 points
0
Let’s look at it this way. Do you agree that if the AI can increase it’s clock speed (with no ill effect), it will do so for the same reasons for which you concede it may go to space? Do you understand the basic logic that increase in clock speed increases expected number of “rewards” during the lifetime of the universe? (which btw goes for your “go to space with a battery” scenario. Longest time, maybe, largest reward over the time, no)

(That would not yet, by itself, change the scenario just yet. I want to walk you through the argument step by step because I don’t know where you fail. Maximizing the reward over the future time, that is a human label we have… it’s not really the goal)
- TheOtherDave 13 Sep 2013 17:00 UTC
  0 points
  0
  Parent
  I agree that a system that values number of experienced reward-moments therefore (instrumentally) values increasing its “clock speed” (as you seem to use the term here). I’m not sure if that’s the “basic logic” you’re asking me about.
  - private_messaging 13 Sep 2013 17:20 UTC
    −2 points
    0
    Parent
    Well, this immediately creates an apparent problem that the AI is going to try to run itself very very fast, which would require resources, and require expansion, if anything, to get energy for running itself at high clock speeds.
    
    I don’t think this is what happens either, as the number of reward-moments could be increased to it’s maximum by modifications to the mechanism processing the rewards (when getting far enough along the road that starts with the shorting of the wires that go from the button to the AI).
    - TheOtherDave 13 Sep 2013 18:33 UTC
      0 points
      0
      Parent
      I agree that if we posit that increasing “clock speed” requires increasing control of resources, then the system we’re hypothesizing will necessarily value increasing control of resources, and that if it doesn’t, it might not.
      - private_messaging 13 Sep 2013 20:09 UTC
        −2 points
        0
        Parent
        So what do you think regarding the second point of mine?
        
        To clarify, I am pondering the ways in which the maximizer software deviates from our naive mental models of it, and trying to find what the AI could actually end up doing after it forms a partial model of what it’s hardware components do about it’s rewards—tracing the reward pathway.
        TheOtherDave 13 Sep 2013 21:05 UTC
        0 points
        0
        Parent
        Regarding your second point, I don’t think that increasing “clock speed” necessarily requires increasing control of resources to any significant degree, and I doubt that the kinds of system components you’re positing here (buttons, wires, etc.) are particularly important to the dynamics of self-reward.
        private_messaging 13 Sep 2013 21:12 UTC
        −2 points
        0
        Parent
        I don’t have particular opinion with regards to the clock speed either way.
        
        With the components, what I am getting at is that the AI could figure out (by building a sufficiently advanced model of it’s implementation) how attain the utility-equivalent of sitting forever in space being rewarded, within one instant, which would make it unable to have a preference for longer reward times.
        
        I raised the clock-speed point to clarify that the actual time is not the relevant variable.
        TheOtherDave 13 Sep 2013 22:25 UTC
        0 points
        0
        Parent
        It seems to me that for any system, either its values are such that it net-values increasing the number of experienced reward-moments (in which case both actual time and “clock speed” are instrumentally valuable to that system), or is values aren’t like that (in which case those variables might not be relevant).
        
        And, sure, in the latter case then it might not have a preference for longer reward times.
        private_messaging 13 Sep 2013 22:36 UTC
        −2 points
        0
        Parent
        Agreed.
        
        My understanding is that it would be very hard in practice to “superintelligence-proof” a reward system so that no instantaneous solution is possible (given that the AI will modify the hardware involved in it’s reward).
        TheOtherDave 13 Sep 2013 22:40 UTC
        0 points
        0
        Parent
        I agree that guaranteeing that a system will prefer longer reward times is very hard (whether the system can modify its hardware or not).
        private_messaging 13 Sep 2013 23:27 UTC
        −1 points
        0
        Parent
        Yes, of course… well even apart from the guarantees, it seems to me that it is hard to build the AI in such a way that it would be unable to find a better solution than to wait
        
        By the way, a “reward” may not be the appropriate metaphor—if we suppose that press of a button results in absence of an itch, or absence of pain, then that does not suggest existence of a drive to preserve itself. Which suggests that the drive to preserve itself is not inherently a feature of utility maximization in the systems that are driven by conditioning, and would require additional work.
        Expand this thread
        TheOtherDave 13 Sep 2013 23:53 UTC
        1 point
        0
        Parent
        
        apart from the guarantees, it seems to me that it is hard to build the AI in such a way that it would be unable to find a better solution than to wait
        
        I’m not sure what the difference is between a guarantee that the AI will not X, on the one hand, and building an AI in such a way that it’s unable to X, on the other.
        
        Regardless, I agree that it does not follow from the supposition that pressing a button results in absence of an itch, or absence of pain, or some other negative reinforcement, that the button-pressing system has a drive to preserve itself.
        
        And, sure, it’s possible to have a utility-maximizing system that doesn’t seek to preserve itself. (Of course, if I observe a utility-maximizing system X, I should expect X to seek to preserve itself, but that’s a different question.)
        private_messaging 14 Sep 2013 9:15 UTC
        0 points
        0
        Parent
        
        I’m not sure what the difference is between a guarantee that the AI will not X, on the one hand, and building an AI in such a way that it’s unable to X, on the other.
        
        About the same as between coming up with a true conjecture, and making a proof, except larger i’d say.
        
        Of course, if I observe a utility-maximizing system X, I should expect X to seek to preserve itself, but that’s a different question.
        
        Well yes, given that if it failed to preserve itself you wouldn’t be seeing it, albeit with the software there is no particular necessity for it to try to preserve itself.
        TheOtherDave 14 Sep 2013 17:45 UTC
        2 points
        0
        Parent
        
        I’m not sure what the difference is between a guarantee that the AI will not X, on the one hand, and building an AI in such a way that it’s unable to X, on the other.
        About the same as between coming up with a true conjecture, and making a proof, except larger
        
        Ah, I see what you mean now. At least, I think I do. OK, fair enough.