Thomas Kwa comments on robo’s Shortform

Thomas Kwa 19 May 2024 7:17 UTC
45 points
20
As recently as early 2023 Eliezer was very pessimistic about AI policy efforts amounting to anything, to the point that he thought anyone trying to do AI policy was hopelessly naive and should first try to ban biological gain-of-function research just to understand how hard policy is. Given how influential Eliezer is, he loses a lot of points here (and I guess Hendrycks wins?)

Then Eliezer updated and started e.g. giving podcast interviews. Policy orgs spun up and there are dozens of safety-concerned people working in AI policy. But this is not reflected in the LW frontpage. Is this inertia, or do we like thinking about computer science more than policy, or is it something else?
What links here?
- Thomas Kwa's comment on What mistakes has the AI safety movement made? by EuanMcLean (24 May 2024 21:55 UTC; 11 points)
- quetzal_rainbow 19 May 2024 10:39 UTC
  4 points
  1
  Parent
  It depends on overall probability distibution. Previously Eliezer thought something like that p(doom|trying to solve alignment) = 50% and p(doom|trying to solve AI ban without alignment) = 99% an then updated to p(doom|trying to solve alignment) = 99% and p(doom|trying to solve AI ban without alignment) = 95%, which makes solving AI ban even if pretty much doomed but worthwhile. But if you are, say, Alex Turner, you could start with the same probabilities, but update towards p(doom|trying to solve alignment) = 10%, which makes publishing papers on steering vectors very reasonable.
  The other reasons:
  1. I expect majority of policy people to be on EA forum, maybe I am wrong;
  2. Kat Woods has large twitter thread about how posting on Twitter is much more useful than posting on LW/AF/EAF in terms of public outreach.
  - Thomas Kwa 19 May 2024 19:24 UTC
    4 points
    2
    Parent
    Seems reasonable except that Eliezer’s p(doom | trying to solve alignment) in early 2023 was much higher than 50%, probably more like 98%. AGI Ruin was published in June 2022 and drafts existed since early 2022. MIRI leadership had been pretty pessimistic ever since AlphaGo in 2016 and especially since their research agenda collapsed in 2019.
    - quetzal_rainbow 19 May 2024 20:08 UTC
      1 point
      0
      Parent
      I am talking about belief state in ~2015, because everyone was already skeptical about policy approach at that time.