Mateusz Bagiński comments on Ilya Sutskever and Jan Leike resign from OpenAI [updated]

Mateusz Bagiński 15 May 2024 2:43 UTC
34 points
5

After returning to OpenAI just five days after he was ousted, Mr. Altman reasserted his control and continued its push toward increasingly powerful technologies that worried some of his critics. Dr. Sutskever remained an OpenAI employee, but he never returned to work.

Had this been known until now? I didn’t know he “didn’t get back to work” although admittedly I wasn’t tracking the issue very closely.
- LawrenceC 15 May 2024 4:22 UTC
  14 points
  4
  Parent
  Yeah, it’s been a bit of a meme (“where is Ilya?”). See e.g. Gwern’s comment thread here.
  - Mateusz Bagiński 15 May 2024 4:29 UTC
    7 points
    −2
    Parent
    Yeah, that meme did reach me. But I was just assuming Ilya got back (was told to get back) to doing the usual Ilya superalignment things and decided (was told) not to stick his neck out.
    - wassname 15 May 2024 7:20 UTC
      1 point
      2
      Parent
      He might have returned to work, but agreed to no external coms.
- Nathan Helm-Burger 15 May 2024 17:56 UTC
  12 points
  2
  Parent
  Interesting to watch Sam Altman talk about it here at timestamp 18:40:
  - trevor 15 May 2024 18:12 UTC
    7 points
    0
    Parent
    Notably, this interview was on March 18th, and afaik the highest-level interview Altman has had to give his two cents since the incident. There’s a transcript here. (There was also this podcast a couple days ago).
    I think a Dwarkesh-Altman podcast would be more likely to arrive at more substance from Altman’s side of the story. I’m currently pretty confident that Dwarkesh and Altman are sufficiently competent to build enough trust to make sane and adequate pre-podcast agreements (e.g. don’t be an idiot who plays tons of one-shot games just because podcast cultural norms are more vivid in your mind than game theory), but I might be wrong about this; trailblazing the frontier of making-things-happen, like Dwarkesh and Altman are, is a lot harder than thinking about the frontier of making-things-happen.
    - Seth Herd 15 May 2024 22:16 UTC
      9 points
      0
      Parent
      There’s also this podcast from just yesterday. It’s really good. Sam continues to say all the right things; in fact, I think this is the most reassuring he’s ever been on taking the societal risks seriously, if not necessarily the existential risks. Which leaves me baffled. He’s certainly a skilled enough social engineer to lie convincingly, but he sounds so dang sincere. I’m weakly concluding for the moment that he just doesn’t think the alignment problem is that hard. I think that’s wrong, but the matter is curiously murky, so it’s not necessarily an irrational opinion to hold. Getting more meaningful discussion between optimistic and pessimistic alignment experts would help close that gap.
      - roha 15 May 2024 23:58 UTC
        16 points
        9
        Parent
        He has a stance towards risk that is a necessary condition for becoming the CEO of a company like OpenAI, but doesn’t give you a high probability of building a safe ASI:
        https://blog.samaltman.com/what-i-wish-someone-had-told-me
        “Inaction is a particularly insidious type of risk.”
        https://blog.samaltman.com/how-to-be-successful
        “Most people overestimate risk and underestimate reward.”
        https://blog.samaltman.com/upside-risk
        “Instead of downside risk [2], more investors should think about upside risk—not getting to invest in the company that will provide the return everyone is looking for.”
      - baynaynas 17 May 2024 16:51 UTC
        4 points
        1
        Parent
        There’s a couple problems that even though Sam is surrounded by people like Ilya and Jan warning him of the consequences, he’s currently unwilling to change course.
        Staring down AGI ruin is just fundamentally a deeply scary abyss. Doubly so when it would’ve been Sam himself at fault.
        Like many political leaders, he’s unable to let go of power because he believes that he himself is most capable of wielding it, even when it causes them to take actions that are worse than either leader would do independently if they weren’t trying to wrest power away from the other.
        More than most, Sam envisions the upside. He’s a vegetarian and cares a lot about animal suffering (“someday eating meat is going to get cancelled, and we are going to look back upon it with horror. we have a collective continual duty to find our way to better morals”), seeing the power of AGI to stop that suffering. He talks up the curing cancer and objectively good tech progress a lot. He probably sees it as: heads, I live forever and am the greatest person for bringing utopia, tails, I’m dead anyway like I would be in 50 years.
        Ultimately, I think Sama sees the coin flip (but believing utopia AGI is higher chance) and being the ever-optimist, is willing to flip it for the good things because it’s too scary to believe that the abyss side would happen.
- Benjamin Sturgeon 18 May 2024 6:40 UTC
  11 points
  1
  Parent
  Actually, as far as I know, this is wrong. He simply hasn’t been back to the offices but has been working remotely.
  
  https://www.vox.com/future-perfect/2024/5/17/24158403/openai-resignations-ai-safety-ilya-sutskever-jan-leike-artificial-intelligence
  
  This article goes into some detail and seems quite good.