To your point, and adding another factor: I saw a talk on RL outcomes once, and I asked about the jittering, and I said that it looked to me like something a slightly blind person would do, trying to simulate “having a bigger paddle” in Breakout, to help compensate for being actually unsure about the position of the ball...
...and the speaker said that the demo ran all the frames, but the RL agent only saw every other frame, and so it did literally have a vision handicap in some sense, but it made training speed go faster and had been elided in the main talk as non-essential.
The point that “there is no resting move” (that uses less energy) is something I had independently thought of and so if you’re looking for someone to catch an error in your thinking, I would like to add a small bit of evidence the other way.
The point that “there is no chance of self-injury” (from dramatic movements) is something I had never heard before, and found insightful.
To your point, and adding another factor: I saw a talk on RL outcomes once, and I asked about the jittering, and I said that it looked to me like something a slightly blind person would do, trying to simulate “having a bigger paddle” in Breakout, to help compensate for being actually unsure about the position of the ball...
...and the speaker said that the demo ran all the frames, but the RL agent only saw every other frame, and so it did literally have a vision handicap in some sense, but it made training speed go faster and had been elided in the main talk as non-essential.
The point that “there is no resting move” (that uses less energy) is something I had independently thought of and so if you’re looking for someone to catch an error in your thinking, I would like to add a small bit of evidence the other way.
The point that “there is no chance of self-injury” (from dramatic movements) is something I had never heard before, and found insightful.