JGWeissman comments on Guardian Angels: Discrete Extrapolated Volitions

JGWeissman 26 Sep 2011 21:23 UTC
9 points

Assuming the possibility of all of the following: what would happen if every person had a superintelligent AI with a utility function of that person’s idealized extrapolated utility function? How would that compare to a scenario with a single AI embodying a successful calculation of CEV?

A singleton AI with individual CEV’s for each human can do at least as well by simulating the negotiation of uniformly powerful individual AIs for each CEV. This is more stable by having the singleton’s simulation enforce uniform levels of power, where actual AIs could potentially diverge in power.
- lessdazed 26 Sep 2011 21:49 UTC
  0 points
  Parent
  I don’t think “individual CEV” is proper. It’s like calling an ATM an “ATM organism”, which would be even worse than calling it an “ATM machine”, as is common. The “C” means individual extrapolated volitions are combined coherently.
  
  I agree it would in theory be better to have a singleton. But that requires knowing how to cohere extrapolated volitions. My idea is that it might be possible to push off that task to superintelligences without destroying the world in the process.
  - wedrifid 27 Sep 2011 0:43 UTC
    1 point
    Parent
    I don’t think “individual CEV” is proper. It’s like calling an ATM an “ATM organism”, which would be even worse than calling it an “ATM machine”, as is common. The “C” means individual extrapolated volitions are combined coherently.
    
    While it would be useful to be able to split the ‘combine from different agents wishes’ part from the ‘act as if the agents smarter and wiser’ part as it is currently described the ‘C’ is still necessary even for an individual. Because most organisms including, most importantly, humans do not have coherent value systems as they stand. So as it stands we need to say things like CEV and CEV for the label to make sense. The core of the problem here is that there are three important elements of the process that we are trying to represent with just two letters of the acronym.
    
    Make smarter, wiser and generally more betterer (intended emphasis on the informality needed for this level of terseness)
    Make internally coherent
    Combine with others
    
    Those three don’t neatly separate into ‘C’ and ‘E’.
    - lessdazed 27 Sep 2011 1:24 UTC
      1 point
      Parent
      From http://singinst.org/upload/CEV.html, I added some emphasis to explain why I understand it the way I do.
      
      Spread, muddle, and distance....
      
      Spread describes cases where your extrapolated volition becomes unpredictable, intractable, or random. You might predictably want a banana tomorrow, or predictably not want a banana tomorrow, or predictably have a 30% chance of wanting a banana tomorrow depending on variables that are quantum-random, deterministic but unknown, or computationally intractable. When multiple outcomes are possible and probable, this creates spread in your extrapolated volition.
      
      Muddle measures self-contradiction, inconsistency, and cases of “damned if you do and damned if you don’t”. Suppose that if you got a banana tomorrow you would not want a banana, and if you didn’t get a banana you would indignantly complain that you wanted a banana. This is muddle.
      
      Distance measures how difficult it would be to explain your volition to your current self, and the degree to which the volition was extrapolated by firm steps.
      
      Short distance: An extrapolated volition that you would readily agree with if explained.
      
      Medium distance: An extrapolated volition that would require extended education and argument before it became massively obvious in retrospect.
      
      Long distance: An extrapolated volition your present-day self finds incomprehensible; not outrageous or annoying, but blankly incomprehensible.
      
      Ground zero: Your actual decision.
      
      ...
      
      Coherence: Strong agreement between many extrapolated individual volitions which are un-muddled and un-spread in the domain of agreement, and not countered by strong disagreement.
      
      Coherence:
      
      Increases, as more humans actively agree.
      
      Decreases, as more humans actively disagree. (The strength of opposition decreases if the opposition is muddled.)
      
      Increases, as individuals support their wishes more, with stronger emotions or more settled philosophy.
      
      It should be easier to counter coherence than to create coherence.
      
      So coherence is something done after un-muddling.
      What links here?
      lessdazed's comment on Open thread, October 2011 by MarkusRamikin (14 Oct 2011 1:53 UTC; 11 points)
      lessdazed's comment on The Meaning of Right by Eliezer Yudkowsky (12 Oct 2011 22:57 UTC; 2 points)