Rob Bensinger comments on AGI Ruin: A List of Lethalities

Rob Bensinger 12 Jun 2022 3:36 UTC
2 points
0
most people in AI alignment think it’s possible that an AI could be trained to optimize for something like this.
I don’t think we have any idea how to do this. If we knew how to get an AGI system to reliably maximize the number of paperclips in the universe, that might be most of the (strawberry-grade) alignment problem solved right there.
- Evan R. Murphy 12 Jun 2022 7:32 UTC
  1 point
  0
  Parent
  You’re right, my mistake—of course we don’t know how to deliberately and reliably train a paperclip maximizer. I’ve updated the parent comment now to say:
  most people in AI alignment think it’s possible that an AI like this could in principle emerge from training (though we don’t know how to reliably train one on purpose).