Quintin Pope comments on AGI Ruin: A List of Lethalities

Quintin Pope 6 Jun 2022 6:27 UTC
21 points
8
There’s shard theory, which aims to describe the process by which values form in humans. The eventual aim is to understand value formation well enough that we can do it in an AI system. I also think figuring out human values, value reflection and moral philosophy might actually be a lot easier than we assume. E.g., the continuous perspective on agency / values is pretty compelling to me and changes things a lot, IMO.