cdwhite comments on Paul Christiano named as US AI Safety Institute Head of AI Safety

cdwhite 16 Apr 2024 22:10 UTC
22 points
5
Couple of thoughts re: NIST as an institution:
1. FWIW NIST, both at Gaithersburg and Boulder, is very well-regarded in AMO physics and slices of condensed matter. Bill Phillips (https://www.nist.gov/people/william-d-phillips) is one big name there, but AIUI they do a whole bunch of cool stuff. Which doesn’t say much one way or the other about how the AI Safety Institute will go! But NIST has a record of producing, or at least sheltering, very good work in the general vicinity of their remit.
2. Don’t knock metrology! It’s really, really hard, subtle, and creative, and you all of a sudden have to worry about things nobody has ever thought about before! I’m a condensed matter theory guy, more or less, but I go to metrology talks every once in a while and have my mind blown.
- Adam Scholl 17 Apr 2024 8:08 UTC
  20 points
  5
  Parent
  I agree metrology is cool! But I think units are mostly helpful for engineering insofar as they reflect fundamental laws of nature—see e.g. the metric units—and we don’t have those yet for AI. Until we do, I expect attempts to define them will be vague, high-level descriptions more than deep scientific understanding.
  (And I think the former approach has a terrible track record, at least when used to define units of risk or controllability—e.g. BSL levels, which have failed so consistently and catastrophically they’ve induced an EA cause area, and which for some reason AI labs are starting to emulate).
  - ChristianKl 22 Apr 2024 16:03 UTC
    11 points
    4
    Parent
    In what sense did the BSL levels failed consistently or catastrophically?
    Even if you think COVID-19 is a lab leak, the BSL levels would have suggested that the BSL 2 that Wuhan used for their Coronavirus gain-of-function research is not enough.
    - Adam Scholl 23 Apr 2024 2:43 UTC
      2 points
      −3
      Parent
      There have been frequent and severe biosafety accidents for decades, many of which occurred at labs which were attempting to follow BSL protocol.
      - Davidmanheim 26 Apr 2024 15:28 UTC
        2 points
        −2
        Parent
        That doesn’t seem like “consistently and catastrophically,” it seems like “far too often, but with thankfully fairly limited local consequences.”
  - Davidmanheim 22 Apr 2024 12:24 UTC
    6 points
    −4
    Parent
    
    BSL levels, which have failed so consistently and catastrophically they’ve induced an EA cause area,
    
    This is confused and wrong, in my view. The EA cause area around biorisk is mostly happy to rely on those levels, and unlike for AI, the (very useful) levels predate EA interest and give us something to build on. The questions are largely instead about whether to allow certain classes of research at all, the risks of those who intentionally do things that are forbiddn, and how new technology changes the risk.
    - Adam Scholl 23 Apr 2024 2:36 UTC
      10 points
      4
      Parent
      The EA cause area around biorisk is mostly happy to rely on those levels
      I disagree—I think nearly all EA’s focused on biorisk think gain of function research should be banned, since the risk management framework doesn’t work well enough to drive the expected risk below that of the expected benefit. If our framework for preventing lab accidents worked as well as e.g. our framework for preventing plane accidents, I think few EA’s would worry much about GoF.
      (Obviously there are non-accidental sources of biorisk too, for which we can hardly blame the safety measures; but I do think the measures work sufficiently poorly that even accident risk alone would justify a major EA cause area).
      - Ben Pace 23 Apr 2024 4:04 UTC
        11 points
        4
        Parent
        [Added April 28th: In case someone reads my comment without this context: David has made a number of worthwhile contributions to discussions of biological existential risks (e.g. 1, 2, 3) as well as worked professionally in this area and his contributions on this topic are quite often well-worth engaging with. Here I just intended to add that in my opinion early on in the covid pandemic he messed up pretty badly in one or two critical discussions around mask effectiveness and censoring criticism of the CDC. Perhaps that’s not saying much because the base rate for relevant experts dealing with Covid is also that they were very off-the-mark. Furthermore David’s June 2020 post-mortem of his mistakes was a good public service even while I don’t agree with his self-assessment in all cases. Overall I think his arguments are often well-worth engaging with.]
        I’m not in touch with the ground truth in this case, but for those reading along without knowing the context, I’ll mention that it wouldn’t be the first time that David has misrepresented what people in the Effective Altruism Biorisk professional network believe^[1].
        (I will mention that David later apologized for handling that situation poorly and wasting people’s time^[2], which I think reflects positively on him.)
        ^
        See Habryka’s response to Davidmanheim’s comment here from March 7th 2020, such as this quote.
        Overall, my sense is that you made a prediction that people in biorisk would consider this post an infohazard that had to be prevented from spreading (you also reported this post to the admins, saying that we should “talk to someone who works in biorisk at at FHI, Openphil, etc. to confirm that this is a really bad idea”).
        We have now done so, and in this case others did not share your assessment (and I expect most other experts would give broadly the same response).
        ^
        See David’s own June 25th reply to the same comment.
        Adam Scholl 23 Apr 2024 5:15 UTC
        6 points
        2
        Parent
        My guess is more that we were talking past each other than that his intended claim was false/unrepresentative. I do think it’s true that EA’s mostly talk about people doing gain of function research as the problem, rather than about the insufficiency of the safeguards; I just think the latter is why the former is a problem.
        Davidmanheim 24 Apr 2024 17:14 UTC
        2 points
        −4
        Parent
        The OP claimed a failure of BSL levels was the single thing that induced biorisk as a cause area, and I said that was a confused claim. Feel free to find someone who disagrees with me here, but the proximate causes of EAs worrying about biorisk has nothing to do with BSL lab designations. It’s not BSL levels that failed in allowing things like the soviet bioweapons program, or led to the underfunded and largely unenforceable BWC, or the way that newer technologies are reducing the barriers to terrorists and other being able to pursue bioweapons.
        Adam Scholl 25 Apr 2024 0:49 UTC
        6 points
        −1
        Parent
        I think we must still be missing each other somehow. To reiterate, I’m aware that there is non-accidental biorisk, for which one can hardly blame the safety measures. But there is also accident risk, since labs often fail to contain pathogens even when they’re trying to.
        Davidmanheim 25 Apr 2024 18:23 UTC
        2 points
        0
        Parent
        Having written extensively about it, I promise you I’m aware. But please, tell me more about how this supports the original claim which I have been disagreeing with, that these class of incidents were or are the primary concern of the EA biosecurity community, the one that led to it being a cause area.
        aysja 25 Apr 2024 23:13 UTC
        4 points
        2
        Parent
        I agree there other problems the EA biosecurity community focuses on, but surely lab escapes are one of those problems, and part of the reason we need biosecurity measures? In any case, this disagreement seems beside the main point that I took Adam to be making, namely that the track record for defining appropriate units of risk for poorly understood, high attack surface domains is quite bad (as with BSL). This still seems true to me.
        Davidmanheim 26 Apr 2024 5:32 UTC
        2 points
        −2
        Parent
        BSL isn’t the thing that defines “appropriate units of risk”, that’s pathogen risk-group levels, and I agree that those are are problem because they focus on pathogen lists rather than actual risks. I actually think BSL are good at what they do, and the problem is regulation and oversight, which is patchy, as well as transparency, of which there is far too little. But those are issues with oversight, not with the types of biosecurity measure that are available.
        Adam Scholl 26 Apr 2024 1:53 UTC
        2 points
        0
        Parent
        This thread isn’t seeming very productive to me, so I’m going to bow out after this. But yes, it is a primary concern—at least in the case of Open Philanthropy, it’s easy to check what their primary concerns are because they write them up. And accidental release from dual use research is one of them.
        Davidmanheim 26 Apr 2024 5:26 UTC
        0 points
        0
        Parent
        If you’re appealing to OpenPhil, it might be useful to ask one of the people who was working with them on this as well.
        
        And you’ve now equivocated between “they’ve induced an EA cause area” and a list of the range of risks covered by biosecurity—not what their primary concerns are—and citing this as “one of them.” I certainly agree that biosecurity levels are one of the things biosecurity is about, and that “the possibility of accidental deployment of biological agents” is a key issue, but that’s incredibly far removed from the original claim that the failure of BSL levels induced the cause area!
      - Davidmanheim 24 Apr 2024 15:58 UTC
        2 points
        0
        Parent
        I did not say that they didn’t want to ban things, I explicitly said “whether to allow certain classes of research at all,” and when I said “happy to rely on those levels, I meant that the idea that we should have “BSL-5” is the kind of silly thing that novice EAs propose that doesn’t make sense because there literally isn’t something significantly more restrictive other than just banning it.
        
        I also think that “nearly all EA’s focused on biorisk think gain of function research should be banned” is obviously underspecified, and wrong because of the details. Yes, we all think that there is a class of work that should be banned, but tons of work that would be called gain of function isn’t in that class.
        Adam Scholl 25 Apr 2024 3:10 UTC
        4 points
        2
        Parent
        the idea that we should have “BSL-5” is the kind of silly thing that novice EAs propose that doesn’t make sense because there literally isn’t something significantly more restrictive
        I mean, I’m sure something more restrictive is possible. But my issue with BSL levels isn’t that they include too few BSL-type restrictions, it’s that “lists of restrictions” are a poor way of managing risk when the attack surface is enormous. I’m sure someday we’ll figure out how to gain this information in a safer way—e.g., by running simulations of GoF experiments instead of literally building the dangerous thing—but at present, the best available safeguards aren’t sufficient.
        I also think that “nearly all EA’s focused on biorisk think gain of function research should be banned” is obviously underspecified, and wrong because of the details.
        I’m confused why you find this underspecified. I just meant “gain of function” in the standard, common-use sense—e.g., that used in the 2014 ban on federal funding for such research.
        Davidmanheim 25 Apr 2024 18:36 UTC
        2 points
        0
        Parent
        I mean, I’m sure something more restrictive is possible.
        But what? Should we insist that the entire time someone’s inside a BSL-4 lab, we have a second person who is an expert in biosafety visually monitoring them to ensure they don’t make mistakes? Or should their air supply not use filters and completely safe PAPRs, and feed them outside air though a tube that restricts their ability to move around instead? (Edit to Add: These are already both requires in BSL-4 labs. When I said I don’t know of anything more restrictive they could do, I was being essentially literal—they do everything including quite a number of unreasonable things to prevent human infection, short of just not doing the research.)
        Or do you have some new idea that isn’t just a ban with more words?
        
        “lists of restrictions” are a poor way of managing risk when the attack surface is enormous
        Sure, list-based approaches are insufficient, but they have relatively little to do with biosafety levels of labs, they have to do with risk groups, which are distinct, but often conflated. (So Ebola or Smallpox isn’t a “BSL-4” pathogen, because there is no such thing. )
        I just meant “gain of function” in the standard, common-use sense—e.g., that used in the 2014 ban on federal funding for such research.
        That ban didn’t go far enough, since it only applied to 3 pathogen types, and wouldn’t have banned what Wuhan was doing with novel viruses, since that wasn’t working with SARS or MERS, it was working with other species of virus. So sure, we could enforce a broader version of that ban, but getting a good definition that’s both extensive enough to prevent dangerous work and that doesn’t ban obviously useful research is very hard.