Wei Dai comments on Safety Culture and the Marginal Effect of a Dollar

Wei Dai 9 Jun 2011 6:29 UTC
14 points
It’s hard for me to imagine 100 good papers on the subject of AI safety (as opposed to say, FAI design). Once you have 10 good papers with variations of “AGI is dangerous, please be careful!”, what can you say in the 11th one that you haven’t already said? Also, 100 papers all carrying the same basic message, all funded by the same organization… that seems a bit surreal.

ETA: Sorry, I’m being overly skeptical and nitpicking. On reflection I think something like this probably is a good idea and should be pursued (unless money is a constraint and someone can come up with better use for it).

ETA2: If someone has done serious thinking about the feasibility of convincing a substantial fraction of AGI researchers about the need for safety, by “publishing X good quality papers”, could they please explain their thoughts in more detail? (My mind keeps changing about whether this is feasible or not.)
- jimrandomh 9 Jun 2011 6:42 UTC
  17 points
  Parent
  
  It’s hard for me to imagine 100 good papers on the subject of AI safety (as opposed to say, FAI design). Once you have 10 good papers with variations of “AGI is dangerous, please be careful!”, what can you say in the 11th one that you haven’t already said?
  
  There’s a lot to say at one layer remove—things like stability analyses of particular strategies for implementing goal systems, general safety measures such as fake network interfaces, friendliness analyses of hypothetical programs, and so on. A paper can impart the idea that safety is important, without being directly about safety. (In fact, there’s some reason to suspect that articles one layer removed may be better than articles that are directly about safety).
  - CarlShulman 9 Jun 2011 16:59 UTC
    5 points
    Parent
    This seems right. One additional thing to note, however, is that while it looks quite likely that good papers lead to improvements at the margin, high-publicity bad work can harm a developing field’s prospects and reputation, and thus outsiders’ desire to affiliate with it. Robin Hanson emphasizes this point a lot.
    - khafra 9 Jun 2011 18:03 UTC
      2 points
      Parent
      Carl, are you saying that the non-SIAI-affiliated qualified academics among us should attempt to get high-publicity, bad papers published advocating anything-goes GAI design, without regard for safety?
      - CarlShulman 9 Jun 2011 21:14 UTC
        10 points
        Parent
        No, for many reasons, including the following:
        
        Such things are very likely to backfire, and moreso than they seem; we live in a world of substantial transparency, and dirty laundry gets found
        Being the kind of people who would do such things would have bad effects and sabotage friendly cooperation with the very AI folk whose cooperation is so important
        There is already a lot of stuff along these lines
        Folk actually in a position to do such things would better use their limited time, reputation, and commitment on other projects
        timtyler 9 Jun 2011 21:29 UTC
        3 points
        Parent
        
        Being the kind of people who would do such things would have bad effects and sabotage friendly cooperation with the very AI folk whose cooperation is so important
        
        My impression is that the bridges are mostly burned there. For years, the SIAI has been campaigning against other projects, in the hope of denying them mindshare and funding.
        
        We have Yudkowsky saying: “And if Novamente should ever cross the finish line, we all die.” and saying he will try to make various other AI projects “look merely stupid”.
        
        I expect the SIAI looks to most others in the field like a secretive competing organisation, who likes to use negative marketing techniques. Implying that your rivals will destroy the world is an old marketing trick that takes us back to the Daisy Ad. This is not necessarily the kind of organisation one would want to affiliate with.