Nick_Beckstead comments on Tiling Agents for Self-Modifying AI (OPFAI #2)

Nick_Beckstead 6 Jun 2013 22:22 UTC
4 points
All that said, I do think that where MIRI has technical FAI questions to work on now, I think it is a very reasonable to write up:
- what the question is
- why answering the question is important for making machine intelligence benefit humanity
- why we shouldn’t expect the question to be answered by default by whoever makes AGI
In this particular case, I am asking for more info about the second two questions.
What links here?
- Nick_Beckstead's comment on Tiling Agents for Self-Modifying AI (OPFAI #2) by Eliezer Yudkowsky (7 Jun 2013 12:22 UTC; 5 points)
- lukeprog 6 Jun 2013 23:01 UTC
  4 points
  Parent
  For convenience: “SuperBenefit” = increasing the probability that advanced machine intelligence has a positive impact.
  
  I agree that MIRI has a lot left to explain with respect to questions #2 and #3, but it’s easier to explain those issues when we’ve explained #1 already, and we’ve only just begun to do that with AI forecasting, IEM, and Tiling Agents.
  
  Presumably the relevance of AI forecasting and IEM to SuperBenefit is clear already?
  
  In contrast, it does seem like the relevance of the Tiling Agents work to SuperBenefit is unclear to many people, and that more explanation is needed there. Now that Tiling Agents has been published, Eliezer has begun to explain its relevance to SuperBenefit in various places on this page, but it will take a lot of trial and error for us to discover what is and isn’t clear to people.
  
  As for question #3, we’ve also only just begun to address that issue in detail.
  
  So, MIRI still has a lot of explaining to do, and we’re working on it. But allow me a brief reminder that this gap isn’t unique to MIRI at all. Arguing for the cost effectiveness of any particular intervention given the overwhelming importance of the far future is extremely complicated, whether it be donating to AMF, doing AI risk strategy, spreading rationality, or something else.
  
  E.g. if somebody accepts the overwhelming importance of the far future and is donating to AMF, they have roughly as much explaining to do as MIRI does, if not more.
  What links here?
  - lukeprog's comment on Tiling Agents for Self-Modifying AI (OPFAI #2) by Eliezer Yudkowsky (6 Jun 2013 23:07 UTC; 0 points)
  - Nick_Beckstead 7 Jun 2013 1:42 UTC
    1 point
    Parent
    
    Presumably the relevance of AI forecasting and IEM to SuperBenefit is clear already?
    
    Yes.
    
    So, MIRI still has a lot of explaining to do, and we’re working on it. But allow me a brief reminder that this gap isn’t unique to MIRI at all. Arguing for the cost effectiveness of any particular intervention given the overwhelming importance of the far future is extremely complicated, whether it be donating to AMF, doing AI risk strategy, spreading rationality, or something else. E.g. if somebody accepts the overwhelming importance of the far future and is donating to AMF, they have roughly as much explaining to do as MIRI does, if not more.
    
    I basically agree with these comments, with a couple of qualifications.
    
    I think it’s unique to MIRI in the sense that it makes sense for MIRI to be expected to explain how its research is going to accomplish its mission of making machine intelligence benefit humanity, whereas it doesn’t make sense for global health charities to be expected to explain why improving global health makes the far future go better. This means MIRI has an asymmetrically hard job, but I do think it’s a reasonable division of labor.
    
    I think it makes sense for other people who care about the far future to evaluate how the other strategies you mentioned are expected to affect the far future, and try to find the best ones. There is an overwhelming amount of work to do.
    - lukeprog 7 Jun 2013 3:47 UTC
      6 points
      Parent
      
      I think it’s unique to MIRI in the sense that it makes sense for MIRI to be expected to explain how its research is going to accomplish its mission of making machine intelligence benefit humanity, whereas it doesn’t make sense for global health charities to be expected to explain why improving global health makes the far future go better.
      
      Right. Very few charities are even claiming to be good for the far future. So there’s an asymmetry between MIRI and other charities w.r.t. responsibility to explain plausible effects on the far future. But among parties (including MIRI) who care principally about the far future and are trying to do something about it, there seems to be no such asymmetry — except for other reasons, e.g. asymmetry in resource use.
      - Nick_Beckstead 7 Jun 2013 11:54 UTC
        1 point
        Parent
        Yes.
  - jsteinhardt 7 Jun 2013 14:16 UTC
    0 points
    Parent
    
    So, MIRI still has a lot of explaining to do, and we’re working on it. But allow me a brief reminder that this gap isn’t unique to MIRI at all. Arguing for the cost effectiveness of any particular intervention given the overwhelming importance of the far future is extremely complicated, whether it be donating to AMF, doing AI risk strategy, spreading rationality, or something else.
    
    I agree with this. Typically people justify their research on other grounds than this, for instance by identifying an obstacle to progress and showing how their approach might overcome it in a way that previously tried approaches were not able to. My impression is that one reason for doing this is that it is typically much easier to communicate along these lines, because it brings the discourse towards much more familiar technical questions while still correlating well with progress more generally.
    
    Note that under this paradigm, the main thing MIRI needs to do to justify their work is to explain why the Lob obstacle is insufficiently addressed by other approaches (for instance, statistical learning theory). I would actually be very interested in understanding the relationship of statistics to the Lob obstacle, so look forward to any writeup that might exist in the future.