Past Account comments on What’s the difference between newer Atari-playing AI and the older Deepmind one (from 2014)?

Past Account 2 Nov 2021 21:58 UTC
1 point
0
[Deleted]
- Raemon 2 Nov 2021 22:37 UTC
  6 points
  0
  Parent
  Sorry, was being kinda lazy and hoping someone had already thought about this.
  This was the newer Deepmind one:
  https://www.lesswrong.com/posts/mTGrrX8SZJ2tQDuqz/deepmind-generally-capable-agents-emerge-from-open-ended?commentId=bosARaWtGfR836shY#bosARaWtGfR836shY
  I was motivated to post by this algorithm from China I heard about today:
  https://www.facebook.com/nellwatson/posts/10159870157893559
  I think this is the older deepmind paper:
  https://deepmind.com/research/publications/2019/playing-atari-deep-reinforcement-learning
  - axioman 4 Nov 2021 23:55 UTC
    3 points
    0
    Parent
    The first thing you mention does not learn to play Atari, and is in general trained quite differently from Atari-playing AI’s (as it relies on self-play to kind of automatically generate a curriculum of harder and harder tasks, at least for the some of the more competitive tasks in XLand).