Yitz comments on “Fragility of Value” vs. LLMs

Yitz 13 Apr 2022 18:33 UTC
4 points
Even if the simulation is perfect, using human approval isn’t a solution to outer alignment, for reasons like deception and wireheading
It still does honestly seem way more likely to not kill us all than a paperclip-optimizer, so if we’re pressed for time near the end, why shouldn’t we go with this suggestion over something else?