Eliezer Yudkowsky comments on The Meaning of Right

Eliezer Yudkowsky 10 Sep 2009 8:12 UTC
7 points
It seems to me that if you build a Friendly AI, you ought to build it to act where coherence exists and not act where it doesn’t.
- Wei Dai 10 Sep 2009 18:48 UTC
  26 points
  0
  Parent
  What makes you think that any coherence exists in the first place? Marcello’s argument seems convincing to me. In the space of possible computations, what fraction gives the same final answer regardless of the order of inputs presented? Why do you think that the “huge blob of computation” that is your morality falls into this small category? There seems to be plenty of empirical evidence that human morality is in fact sensitive to the order in which moral arguments are presented.
  
  Or think about it this way. Suppose an (unFriendly) SI wants to craft an argument that would convince you to adopt a certain morality and then stop paying attention to any conflicting moral arguments. Could it do so? Could it do so again with a different object-level morality on someone else? (This assumes there’s an advantage to being first, as far as giving moral arguments to humans is concerned. Adjust the scenario accordingly if there’s an advantage in being last instead.)
  
  You say the FAI won’t act where coherence doesn’t exist but if you don’t expect coherence now, you ought to be doing something other than building such an FAI, or at least have a contingency plan for when it halts without giving any output?
  - Eliezer Yudkowsky 11 Sep 2009 1:37 UTC
    5 points
    Parent
    
    What makes you think that any coherence exists in the first place?
    
    Most people wouldn’t want to be turned into paperclips?
    - Wei Dai 11 Sep 2009 4:17 UTC
      35 points
      0
      Parent
      
      Most people wouldn’t want to be turned into paperclips?
      
      Of course not, since they haven’t yet heard the argument that would make they want to. All the moral arguments we’ve heard so far have been invented by humans, and we just aren’t that inventive. Even so, we have Voluntary Human Extinction Movement.
      What links here?
      sunwillrise's comment on Alignment: “Do what I would have wanted you to do” by Oleg Trott (13 Jul 2024 1:07 UTC; 45 points)
      - Eliezer Yudkowsky 11 Sep 2009 22:05 UTC
        16 points
        Parent
        Wei, suppose I want to help someone. How ought I to do so?
        
        Is the idea here that humans end up anywhere depending on what arguments they hear in what order, without the overall map of all possible argument orders displaying any sort of concentration in one or more clusters where lots of endpoints would light up, or any sort of coherency that could be extracted out of it?
        Wei Dai 11 Sep 2009 22:29 UTC
        35 points
        0
        Parent
        
        Wei, suppose I want to help someone. How ought I to do so?
        
        I don’t know. (I mean I don’t know how to do it in general. There are some specific situations where I do know how to help, but lots more where I don’t.)
        
        Is the idea here that humans end up anywhere depending on what arguments they hear in what order, without the overall map of all possible argument orders displaying any sort of concentration in one or more clusters where lots of endpoints would light up, or any sort of coherency that could be extracted out of it?
        
        Yes. Or another possibility is that the overall map of all possible argument orders does display some sort of concentration, but that concentration is morally irrelevant. Human minds were never “designed” to hear all possible moral arguments, so where the concentration occurs is accidental, and perhaps horrifying from our current perspective. (Suppose the concentration turns out to be voluntary extinction or something worse, would you bite the bullet and let the FAI run with it?)
    - CarlShulman 11 Sep 2009 4:29 UTC
      16 points
      Parent
      A variety of people profess to consider this desirable if it leads to powerful intelligent life filling the universe with higher probability or greater speed. I would bet that there are stable equilibria that can be reached with arguments.
      - RHollerith 11 Sep 2009 6:00 UTC
        10 points
        Parent
        Carl says that a variety of people profess to consider it desirable that present-day humans get disassembled “if it leads to powerful intelligent life filling the universe with higher probability or greater speed.”
        
        Well, yeah, I’m not surprised. Any system of valuing things in which every life, present and future, has the same utility as every other life will lead to that conclusion because turning the existing living beings and their habitat into computronium, von-Neumann probes, etc, to hasten the start of the colonization of the light cone by a few seconds will have positive expected marginal utility according to the system of valuing things.
        jacob_cannell 2 Feb 2011 2:04 UTC
        1 point
        Parent
        That could still be a great thing for us provided that current human minds were uploaded into the resulting computronium explosion.
        anon895 2 Feb 2011 3:21 UTC
        2 points
        Parent
        ...which won’t happen if the computronium is the most important thing and uploading existing minds would slow it down. The AI might upload some humans to get their cooperation during the early stages of takeoff, but it wouldn’t necessarily keep those uploads running once it no longer depended on humans, if the same resources could be used more efficiently for itself.
        dxu 17 Apr 2015 21:13 UTC
        2 points
        Parent
        To get my cooperation, at least, it would have to credibly precommit that it wouldn’t just turn my simulation off after it no longer needs me. (Of course, the meaning of the word “credibly” shifts somewhat when we’re talking about a superintelligence trying to “prove” something to a human.)
- thomblake 18 May 2012 13:10 UTC
  1 point
  Parent
  
  It seems to me that if you build a Friendly AI, you ought to build it to act where coherence exists and not act where it doesn’t.
  
  Is “not act” a meaningful option for a Singleton?