Daniel Kokotajlo comments on Daniel Kokotajlo’s Shortform

Daniel Kokotajlo 12 Nov 2024 21:11 UTC
38 points
0
I did a podcast discussion with Undark a month or two ago, a discussion with Arvind Narayanan from AI Snake Oil. https://undark.org/2024/11/11/podcast-will-artificial-intelligence-kill-us-all/

Also, I did this podcast with Dean Ball and Nathan Labenz: https://www.cognitiverevolution.ai/agi-lab-transparency-requirements-whistleblower-protections-with-dean-w-ball-daniel-kokotajlo/

For those reading this who take the time to listen to either of these: I’d be extremely interested in brutally honest reactions + criticism, and then of course in general comments.
- TsviBT 12 Nov 2024 23:43 UTC
  15 points
  6
  Parent
  10 more years till interpretability? That’s crazy talk. What do you mean by that and why do you think it? (And if it’s a low bar, why do you have such a low bar?)
  
  “Pre-AGI we should be comfortable with proliferation” Huh? Didn’t you just get done saying that pre-AGI AI is going to contribute meaningfully to research (such as AGI research)?
  - Daniel Kokotajlo 13 Nov 2024 19:30 UTC
    3 points
    1
    Parent
    I don’t remember what I meant when I said that, but I think it’s a combination of low bar and ’10 years is a long time for an active fast-moving field like ML’ Low bar = We don’t need to have completely understood everything, we just need to be able to say e.g. ‘this AI is genuinely trying to carry out our instructions, no funny business, because we’ve checked for various forms of funny business and our tools would notice if it was happening.’ Then we get the AIs to solve lots of alignment problems for us.
    
    Yep I think pre-AGI we should be comfortable with proliferation. I think it won’t substantially accelerate AGI research until we are leaving the pre-AGI period, shall we say, and need to transition to some sort of slowdown or pause. (I agree that when AGI R&D starts to 2x or 5x due to AI automating much of the process, that’s when we need the slowdown/pause).
    
    I also think that people used to think ‘there needs to be fewer actors with access to powerful AI technology, because that’ll make it easier to coordinate’ and that now seems almost backwards to me. There are enough actors racing to AI that adding additional actors (especially within the US) actually makes it easier to coordinate because it’ll be the government doing the coordinating anyway (via regulation and executive orders) and the bottleneck is ignorance/lack-of-information.
    - TsviBT 14 Nov 2024 3:04 UTC
      5 points
      3
      Parent
      
      we’ve checked for various forms of funny business and our tools would notice if it was happening.
      
      I think it’s a high bar due to the nearest unblocked strategy problem and alienness.
      
      I agree that when AGI R&D starts to 2x or 5x due to AI automating much of the process, that’s when we need the slowdown/pause)
      
      If you start stopping proliferation when you’re a year away from some runaway thing, then everyone has the tech that’s one year away from the thing. That makes it more impossible that no one will do the remaining research, compared to if the tech everyone has is 5 or 20 years away from the thing.
- Akash 12 Nov 2024 22:43 UTC
  4 points
  0
  Parent
  Did they have any points that you found especially helpful, surprising, or interesting? Anything you think folks in AI policy might not be thinking enough about?
  (Separately, I hope to listen to these at some point & send reactions if I have any.)
- Charlie Steiner 14 Nov 2024 1:19 UTC
  2 points
  0
  Parent
  I did a podcast discussion with Undark a month or two ago, a discussion with Arvind Narayanan from AI Snake Oil. https://undark.org/2024/11/11/podcast-will-artificial-intelligence-kill-us-all/
  
  Well, that went quite well. Um, I think two main differences I’d like to see are, first, a shift in attention from ‘AGI when’ to more specific benchmarks/capabilities. Like, ability to replace 90% of the work of an AI researcher (can you say SWE-bench saturated? Maybe in conversation with Arvind only) when?
  And then the second is to try to explicitly connect those benchmarks/capabilities directly to danger—like, make the ol’ King Midas analogy maybe? Or maybe just that high capabilities → instability and risk inherently?