Anirandis comments on How easily can we separate a friendly AI in design space from one which would bring about a hyperexistential catastrophe?

Anirandis 10 Sep 2020 1:59 UTC
2 points
Mainly for brevity, but also because it seems to involve quite a drastic change in how the reward function/model as a whole functions. So it doesn’t seem particularly likely that it’ll be implemented.