Neel Nanda comments on Linch’s Shortform

Neel Nanda 26 Aug 2024 16:57 UTC
5 points
0

work on capabilities at Anthropic because of the supposed inevitability of racing with China

I can’t recall hearing this take from Anthropic people before
- Ben Pace 26 Aug 2024 17:53 UTC
  3 points
  2
  Parent
  V surprising! I think of it as a standard refrain (when explaining why it’s ethically justified to have another competitive capabilities company at all). But not sure I can link to a crisp example of it publicly.
  - Drake Thomas 26 Aug 2024 19:15 UTC
    20 points
    0
    Parent
    (I work on capabilities at Anthropic.) Speaking for myself, I think of international race dynamics as a substantial reason that trying for global pause advocacy in 2024 isn’t likely to be very useful (and this article updates me a bit towards hope on that front), but I think US/China considerations get less than 10% of the Shapley value in me deciding that working at Anthropic would probably decrease existential risk on net (at least, at the scale of “China totally disregards AI risk” vs “China is kinda moderately into AI risk but somewhat less than the US”—if the world looked like China taking it really really seriously, eg independently advocating for global pause treaties with teeth on the basis of x-risk in 2024, then I’d have to reassess a bunch of things about my model of the world and I don’t know where I’d end up).
    
    My explanation of why I think it can be good for the world to work on improving model capabilities at Anthropic looks like an assessment of a long list of pros and cons and murky things of nonobvious sign (eg safety research on more powerful models, risk of leaks to other labs, race/competition dynamics among US labs) without a single crisp narrative, but “have the US win the AI race” doesn’t show up prominently in that list for me.
  - Ben Pace 26 Aug 2024 18:57 UTC
    11 points
    2
    Parent
    Ah, here’s a helpful quote from a TIME article.
    On the day of our interview, Amodei apologizes for being late, explaining that he had to take a call from a “senior government official.” Over the past 18 months he and Jack Clark, another co-founder and Anthropic’s policy chief, have nurtured closer ties with the Executive Branch, lawmakers, and the national-security establishment in Washington, urging the U.S. to stay ahead in AI, especially to counter China. (Several Anthropic staff have security clearances allowing them to access confidential information, according to the company’s head of security and global affairs, who declined to share their names. Clark, who is originally British, recently obtained U.S. citizenship.) During a recent forum at the U.S. Capitol, Clark argued it would be “a chronically stupid thing” for the U.S. to underestimate China on AI, and called for the government to invest in computing infrastructure. “The U.S. needs to stay ahead of its adversaries in this technology,” Amodei says. “But also we need to provide reasonable safeguards.”
    - Neel Nanda 27 Aug 2024 12:17 UTC
      6 points
      0
      Parent
      Seems unclear if that’s their true beliefs or just the rhetoric they believed would work in DC.
      
      The latter could be perfectly benign—eg you might think that labs need better cyber security to stop eg North Korea getting the weights, but this is also a good idea to stop China getting them, so you focus on the latter when talking to Nat sec people as a form of common ground
  - Neel Nanda 27 Aug 2024 12:15 UTC
    8 points
    2
    Parent
    My (maybe wildly off) understanding from several such conversations is that people tend to say:
    
    We think that everyone is racing super hard already, so the marginal effect of pushing harder isn’t that high
    Having great models is important to allow Anthropic to push on good policy and do great safety work
    We have an RSP and take it seriously, so think we’re unlikely to directly do harm by making dangerous AI ourselves
    
    China tends not to explicitly come up, though I’m not confident it’s not a factor.
    
    (to be clear, the above is my rough understanding from a range of conversations, but I expect there’s a diversity of opinions and I may have misunderstood)
  - Zach Stein-Perlman 27 Aug 2024 5:14 UTC
    8 points
    4
    Parent
    The standard refrain is that Anthropic is better than [the counterfactual, especially OpenAI but also China], I think.
    Worry about China gives you as much reason to work on capabilities at OpenAI etc. as at Anthropic.
    - Ben Pace 27 Aug 2024 5:23 UTC
      6 points
      0
      Parent
      Oh yeah, agree with the last sentence, I just guess that OpenAI has way more employees who are like “I don’t really give these abstract existential risk concerns much thought, this is a cool/fun/exciting job” and Anthropic has way more people who are like “I care about doing the most good and so I’ve decided that helping this safety-focused US company win this race is the way to do that”. But I might well be mistaken about what the current ~2.5k OpenAI employees think, I don’t talk to them much!