In this case, the AI has a stable utility function—it just doesn’t know yet what it is.
For instance, it could be “in worlds where a certain coin was heads, maximise paperclips; in other worlds, minimise them”, and it has no info yet on the coin flip. That’s a perfectly consistent and stable utility function.
In this case, the AI has a stable utility function—it just doesn’t know yet what it is.
For instance, it could be “in worlds where a certain coin was heads, maximise paperclips; in other worlds, minimise them”, and it has no info yet on the coin flip. That’s a perfectly consistent and stable utility function.