Signer comments on Thoughts on “AI is easy to control” by Pope & Belrose

Signer 10 Jan 2024 12:57 UTC
4 points
2

I think the fact that good humans have been able to keep rogue bad humans more-or-less under control for millennia is strong evidence that good AIs will be able to keep rogue AIs under control

Why? Like, what law of nature says that the trend in this terms should continue?
- Nora Belrose 10 Jan 2024 23:07 UTC
  −21 points
  −20
  Parent
  Game theory
  - Signer 11 Jan 2024 5:22 UTC
    1 point
    0
    Parent
    Yes, but available strategies can change for AI vs humans—why assume they will be the same?
    
    Induction from history depends on it’s interpretation—we have more information than 1111111111 over {bad, not-so-bad}. It just feels like at present point the crux between optimists and doomers is not about whether white box access or trained mind-space is better, about how much it all updates you from what prior.