[Question] How does OpenAI’s language model affect our AI timeline estimates?

jimrandomh15 Feb 2019 3:11 UTC

50 points

OpenAI recently announced progress in NLP, using a large transformer-based language model to tackle a variety of tasks and breaking performance records in many of them. It also generates synthetic short stories, which are surprisingly good.

How surprising are these results, given past models of how difficult language learning was and how far AI had progressed? Should we be significantly updating our estimates of AI timelines?

What links here?

Ruby's comment on Core Tag Examples [temporary] by Ruby (7 Apr 2020 19:24 UTC; 2 points)

jimrandomh15 Feb 2019 3:11 UTC

50 points

7 comments1 min readLW link

AI Timelines OpenAI Language Models

orthonormal 15 Feb 2019 22:29 UTC
15 points
It doesn’t move much probability mass to the very near term (i.e. 1 year or less), because both this and AlphaStar aren’t really doing consequentialist reasoning, they’re just able to get a surprising performance with simpler tricks (the very Markovian nature of human writing, a good position evaluation function) given a whole lot of compute.
However, it does shift my probabilities forward in time, in the sense that one new weird trick to do deductive or consequentialist reasoning, plus a lot of compute, might get you there really quickly.
Darmani 15 Feb 2019 7:43 UTC
10 points
Something you learn pretty quickly in academia: don’t trust the demos. Systems never work as well when you select the inputs freely (and, if they do, expect thorough proof). So, I wouldn’t read too deeply into this yet; we don’t know how good it actually is.
- james_t 15 Feb 2019 15:17 UTC
  21 points
  Parent
  Vis-a-vis selecting inputs freely: OpenAI also included a large dump of unconditioned text generation in their github repo.
- Vanessa Kosoy 15 Feb 2019 17:29 UTC
  18 points
  Parent
  They claim beating records on a range of standard tests (such as the Winograd schema), which is not something you can cheat by cherry-picking, assuming they are honest about the results.
- wizzwizz4 20 Oct 2019 18:32 UTC
  2 points
  Parent
  https://transformer.huggingface.co/ is a nice demonstration of GPT2 that allows you to select the inputs freely.

avturchin 15 Feb 2019 9:51 UTC
1 point
It lowers expected AI timing but not only because it is so great achievement, but also because it demonstrates that large part of human thinking could be just generating plausible continuation of the input text.
Pattern 15 Feb 2019 17:26 UTC
0 points
OpenAI’s “safety” move (not releasing the model) reduces the scrutiny it can receive, which makes its impact on forecasts conditional on how good you think it is, when you haven’t seen it.