Note the paper itself is from July 2019. (Not everything in the newsletter is the latest news.)
I also wonder if different techniques do better on atari vs. mujoco environments for “unprincipled” reasons that make apples to apples comparisons difficult for techniques developed by different groups.
That seems quite likely to me, but one would hope that a good method also works in situations it wasn’t designed for, so this still seems like a reasonable evaluation to me.
Note the paper itself is from July 2019. (Not everything in the newsletter is the latest news.)
That seems quite likely to me, but one would hope that a good method also works in situations it wasn’t designed for, so this still seems like a reasonable evaluation to me.