To get behavior you need preferences + temperature, that’s what I meant by saying there was a difference between wanting X a little and wanting X a lot.
I agree that the formulation I gave benefits actions that generate a lot of entropy. Really you want to consider causal entropy of your actions. I think that means P(τ)∝exp(E(U(τ))) for each sequence of actions τ I agree that’s less elegant.
To get behavior you need preferences + temperature, that’s what I meant by saying there was a difference between wanting X a little and wanting X a lot.
I agree that the formulation I gave benefits actions that generate a lot of entropy. Really you want to consider causal entropy of your actions. I think that means P(τ)∝exp(E(U(τ))) for each sequence of actions τ I agree that’s less elegant.