peterbarnett comments on TurnTrout’s shortform feed

peterbarnett 1 Dec 2023 18:37 UTC
1 point
0
Conditional on the AI never doing something like: manipulating/deceiving^[1] the humans such that the humans think the AI is aligned, such that the AI can later do things the humans don’t like, then I am much more optimistic about the whole situation.
1. ^
  The AI could be on some level not “aware” that it was deceiving the humans, a la Deep Deceptiveness.