Charlie Steiner comments on [AN #120]: Tracing the intellectual roots of AI and AI alignment

Charlie Steiner 7 Oct 2020 19:09 UTC
LW: 4 AF: 3
AF
That Hartikainen et al. paper was really interesting! Unfortunately I don’t know enough about the state of the art for unsupervised exploration—they compare DDLUS to a 2018 paper (DIAYN), but I’m not sure how either of these compares to other prominent exploration techniques (e.g. something like NGU).
I also wonder if different techniques do better on atari vs. mujoco environments for “unprincipled” reasons that make apples to apples comparisons difficult for techniques developed by different groups.
- Rohin Shah 7 Oct 2020 21:05 UTC
  LW: 2 AF: 2
  AF Parent
  they compare DDLUS to a 2018 paper (DIAYN)
  Note the paper itself is from July 2019. (Not everything in the newsletter is the latest news.)
  I also wonder if different techniques do better on atari vs. mujoco environments for “unprincipled” reasons that make apples to apples comparisons difficult for techniques developed by different groups.
  That seems quite likely to me, but one would hope that a good method also works in situations it wasn’t designed for, so this still seems like a reasonable evaluation to me.