tailcalled comments on tailcalled’s Shortform

tailcalled 29 Mar 2024 16:17 UTC
2 points
0
Thanks for the link! It does look somewhat relevant.
But I think the weighting by reward (or other significant variables) is pretty important, since it generates a goal to pursue, making it emphasize things that can achieved rather than just things that might randomly happen.
Though this makes me think about whether there are natural variables in the state space that could be weighted by, without using reward per se. E.g. the size of (s’ - s) in some natural embedding, or the variance in s’ over all the possible actions that could be taken. Hmm. 🤔