Donald Hobson comments on Why No Interesting Unaligned Singularity?

Donald Hobson 10 Aug 2022 10:35 UTC
2 points
Well if we screw up that badly with deceptive misalignment, that corresponds to crashing on the launchpad.
It is reasonably likely that humans will have some technique they use that is intended to minimize deceptive misalignment. Or that gradient descent shapes the goals to something similar to what we want before the AI is smart enough to be deceptive.

Donald Hobson comments on Why No *Interesting* Unaligned Singularity?