I think that the cumulative utility maximizer model might not be really appropriate if you allow for the agent to be forgetful.
Anyway, if you stick with it, then the agent picks option 1.
I’ve considered some variations of the model that pick option 2, but they seem all vulnerable to wireheading.
I think that the cumulative utility maximizer model might not be really appropriate if you allow for the agent to be forgetful. Anyway, if you stick with it, then the agent picks option 1.
I’ve considered some variations of the model that pick option 2, but they seem all vulnerable to wireheading.