Charbel-Raphaël comments on How could you possibly choose what an AI wants?

Charbel-Raphaël 8 May 2023 23:25 UTC
1 point
0
None of these directly address what I’m calling The alignment stability problem, to give a name to what you’re addressing here.
Maybe the alignment stability problem is the same thing as the sharp left turn?
- Seth Herd 9 May 2023 2:34 UTC
  1 point
  0
  Parent
  I don’t think so. That’s one breaking point for alignment, but I’m saying in that post that even if we avoid a sharp left turn and make it to an aligned, superintelligent AGI, that its alignment may drift away from human values as it continues to learn. Learning may necessarily shift the meanings of existing concepts, including values.