Also, giving something more points for killing people than making cake sounds like a bad incentive scheme.
In the original cake-or-death example, it wasn’t that killing got more points, it’s that killing is easier (and hence gets more points over time). This is a reflection of the fact that “true” human values are complex and difficult to maximise, but many other values are much easier to maximise.
In the original cake-or-death example, it wasn’t that killing got more points, it’s that killing is easier (and hence gets more points over time). This is a reflection of the fact that “true” human values are complex and difficult to maximise, but many other values are much easier to maximise.