Thanks for the link! It does look somewhat relevant.
But I think the weighting by reward (or other significant variables) is pretty important, since it generates a goal to pursue, making it emphasize things that can achieved rather than just things that might randomly happen.
Though this makes me think about whether there are natural variables in the state space that could be weighted by, without using reward per se. E.g. the size of (s’ - s) in some natural embedding, or the variance in s’ over all the possible actions that could be taken. Hmm. 🤔
Thanks for the link! It does look somewhat relevant.
But I think the weighting by reward (or other significant variables) is pretty important, since it generates a goal to pursue, making it emphasize things that can achieved rather than just things that might randomly happen.
Though this makes me think about whether there are natural variables in the state space that could be weighted by, without using reward per se. E.g. the size of (s’ - s) in some natural embedding, or the variance in s’ over all the possible actions that could be taken. Hmm. 🤔