Rohin Shah comments on [AN #120]: Tracing the intellectual roots of AI and AI alignment

Rohin Shah 7 Oct 2020 21:05 UTC
LW: 2 AF: 2
AF
they compare DDLUS to a 2018 paper (DIAYN)
Note the paper itself is from July 2019. (Not everything in the newsletter is the latest news.)
I also wonder if different techniques do better on atari vs. mujoco environments for “unprincipled” reasons that make apples to apples comparisons difficult for techniques developed by different groups.
That seems quite likely to me, but one would hope that a good method also works in situations it wasn’t designed for, so this still seems like a reasonable evaluation to me.