You disagree with MPS/LPS in what way? In that there are cyclical states or in that it is impossible to rate states against each other or something else?
Completely agree that preferences are not consistent over time but I’m not sure about the relevance of that here.
Agent actions definitely do not maximize over reachable state preferences. My only point there was that they make some attempt to improve states. If you disagree with that what would be an example? Totally agree with your point that it can get very messy.
You disagree with MPS/LPS in what way? In that there are cyclical states or in that it is impossible to rate states against each other or something else?
Completely agree that preferences are not consistent over time but I’m not sure about the relevance of that here.
Agent actions definitely do not maximize over reachable state preferences. My only point there was that they make some attempt to improve states. If you disagree with that what would be an example? Totally agree with your point that it can get very messy.