Nebu comments on Naturalism versus unbounded (or unmaximisable) utility options

Nebu 5 Jan 2016 8:50 UTC
0 points

Why would we give the AI a utility function that assigns 0 utility to an outcome where we get everything we want but it never turns itself off?

The designer of that AI might have (naively?) thought this was a clever way of solving the friendliness problem. Do the thing I want, and then make sure to never do anything again. Surely that won’t lead to the whole universe being tiled with paperclips, etc.