Charlie Steiner comments on Finding the estimate of the value of a state in RL agents