Lukas Finnveden comments on PaLM-2 & GPT-4 in “Extrapolating GPT-N performance”

Lukas Finnveden 1 Jun 2023 21:11 UTC
LW: 7 AF: 2
0
AF
I assume that’s from looking at the GPT-4 graph. I think the main graph I’d look at for a judgment like this is probably the first graph in the post, without PaLM-2 and GPT-4. Because PaLM-2 is 1-shot and GPT-4 is just 4 instead of 20+ benchmarks.

That suggests 90% is ~1 OOM away and 95% is ~3 OOMs away.

(And since PaLM-2 and GPT-4 seemed roughly on trend in the places where I could check them, probably they wouldn’t change that too much.)