Aiyen comments on Deepmind’s Gato: Generalist Agent

Aiyen 13 May 2022 2:28 UTC
2 points
When they talk about an expert benchmark, they’re talking about the performance of a human expert right? That’s what it sounds like, but it’s worth being sure.
- Noa Nabeshima 16 May 2022 21:48 UTC
  6 points
  Parent
  No, I’m pretty confident every expert is a neural network policy trained on the task. See “F. Data Collection Details” and the second paragraph of “3.3. Robotics—RGB Stacking Benchmark (real and sim)”
  - Aiyen 17 May 2022 8:37 UTC
    4 points
    Parent
    I read the paper and this is correct.
- Tristan Wegner 13 May 2022 9:35 UTC
  5 points
  Parent
  That is how I have seen the term expert performance used in papers. Yes.
  - evdoks 13 May 2022 11:19 UTC
    1 point
    Parent
    What would then 50% of expert performance mean? If 100% is a human expert and 0% is a random policy then what is the meaning of something in between?
    - Aiyen 13 May 2022 15:57 UTC
      1 point
      Parent
      Getting half the score, getting half as many questions right, etc.