gwern comments on Inverse scaling can become U-shaped

gwern 15 Nov 2022 23:34 UTC
LW: 5 AF: 3
0
AF

They’re also finding that inverse scaling on these tasks goes away with chain-of-thought prompting

So, like some of the Big-Bench PaLM results, these are more cases of ‘hidden scaling’ where quite simple inner-monologue approaches can show smooth scaling while the naive pre-existing benchmark claims that there are no gains with scale?
- Ethan Perez 16 Nov 2022 3:55 UTC
  LW: 1 AF: 1
  0
  AF Parent
  Yup