Leon Lang comments on Leon Lang’s Shortform

Leon Lang 29 Aug 2024 5:22 UTC
5 points
0
40 min podcast with Anca Dragan who leads safety and alignment at google deepmind: https://youtu.be/ZXA2dmFxXmg?si=Tk0Hgh2RCCC0-C7q
- Zach Stein-Perlman 29 Aug 2024 7:08 UTC
  9 points
  0
  Parent
  I listened to it. I don’t recommend it. Anca seems good and reasonable but the conversation didn’t get into details on misalignment, scalable oversight, or DeepMind’s Frontier Safety Framework.
  - Neel Nanda 29 Aug 2024 8:16 UTC
    6 points
    1
    Parent
    My read is that the target audience is much more about explaining alignment concerns to a mainstream audience and that GDM takes them seriously (which I think is great!), than about providing non trivial details to a LessWrong etc audience
  - Leon Lang 29 Aug 2024 7:47 UTC
    4 points
    0
    Parent
    Agreed.
    
    I think the most interesting part was that she made a comment that one way to predict a mind is to be a mind, and that that mind will not necessarily have the best of all of humanity as its goal. So she seems to take inner misalignment seriously.