npostavs comments on [linkpost] The final AI benchmark: BIG-bench

npostavs 10 Jun 2022 18:21 UTC
6 points

I think BIG-bench could be the final AI benchmark: if a language model surpasses the top human score on it, the model is an AGI. At this point, there is nowhere to move the goalposts.

But when you say:

the benchmark is still growing. The organizers keep it open for submissions.

Doesn’t that mean this benchmark is a set of moving goalposts?
- RomanS 11 Jun 2022 5:07 UTC
  2 points
  Parent
  Good catch! You’re right, if contributors continue to add harder and harder tasks to the benchmark, and do it fast enough, the benchmark could be forever ahead.
  I expect that some day the benchmark will be frozen. And even if it’s not frozen, new tasks are added only a few times per month these days, thus it’s not impossible to solve its current version.