That Hartikainen et al. paper was really interesting! Unfortunately I don’t know enough about the state of the art for unsupervised exploration—they compare DDLUS to a 2018 paper (DIAYN), but I’m not sure how either of these compares to other prominent exploration techniques (e.g. something like NGU).
I also wonder if different techniques do better on atari vs. mujoco environments for “unprincipled” reasons that make apples to apples comparisons difficult for techniques developed by different groups.
Note the paper itself is from July 2019. (Not everything in the newsletter is the latest news.)
I also wonder if different techniques do better on atari vs. mujoco environments for “unprincipled” reasons that make apples to apples comparisons difficult for techniques developed by different groups.
That seems quite likely to me, but one would hope that a good method also works in situations it wasn’t designed for, so this still seems like a reasonable evaluation to me.
That Hartikainen et al. paper was really interesting! Unfortunately I don’t know enough about the state of the art for unsupervised exploration—they compare DDLUS to a 2018 paper (DIAYN), but I’m not sure how either of these compares to other prominent exploration techniques (e.g. something like NGU).
I also wonder if different techniques do better on atari vs. mujoco environments for “unprincipled” reasons that make apples to apples comparisons difficult for techniques developed by different groups.
Note the paper itself is from July 2019. (Not everything in the newsletter is the latest news.)
That seems quite likely to me, but one would hope that a good method also works in situations it wasn’t designed for, so this still seems like a reasonable evaluation to me.