Gordon Seidoh Worley comments on A theory of human values

Gordon Seidoh Worley 26 Jun 2019 22:16 UTC
2 points
Uncertainty about metaethics seems a serious source of risk in AI safety, and especially AI alignment. I’ve written a paper detailing how we might approach such fundamental uncertainty such that we can perform analysis to find positions which minimize risk such that we don’t unnecessarily expose ourselves to risk by unnecessarily making assumptions we need not make.