Quintin Pope comments on The Shard Theory Alignment Scheme

Quintin Pope 25 Aug 2022 22:25 UTC
6 points
1
I (a member of Team Shard) think that there are plausible worlds where we don’t address any of the “core difficulties” of alignment, but survive anyways, just because making an AI aligned enough that it doesn’t literally want to kill everyone turns out to be a very low bar.
- Rohin Shah 26 Aug 2022 6:11 UTC
  5 points
  0
  Parent
  Great, glad to know it isn’t universal.
  - TurnTrout 30 Aug 2022 0:44 UTC
    5 points
    0
    Parent
    Same, and I think that this kind of “just don’t do anything actively stupid” world is actually reasonably likely (10%+).