It strikes me that a persistently selfish agent may be somewhat altruistic towards its future selves. The agent might want its future versions to be free to follow their own selfish preferences, rather than binding them to its current selfish preferences.
Another alternative is that the agent is not only selfish but lazy… it could self-modify to bind its future selves, but that takes effort, and it can’t be bothered.
Either way, it’s going to take a weird sort of utility function to reproduce human selfishness in an AI.
Now that I think of it, caring about making more copies of yourself might be more fundamental than caring about object-level things in the world… I wonder what kind of math could be used to model this.
It strikes me that a persistently selfish agent may be somewhat altruistic towards its future selves. The agent might want its future versions to be free to follow their own selfish preferences, rather than binding them to its current selfish preferences.
Another alternative is that the agent is not only selfish but lazy… it could self-modify to bind its future selves, but that takes effort, and it can’t be bothered.
Either way, it’s going to take a weird sort of utility function to reproduce human selfishness in an AI.
Now that I think of it, caring about making more copies of yourself might be more fundamental than caring about object-level things in the world… I wonder what kind of math could be used to model this.