I’m so torn on this paper -I think it makes a reasonable point that many claims of emergence are overrated and that it’s easy to massage metrics into a single narrative. But also, I think the title and abstract are overclaiming clickbait—obviously models have emergent abilities!! Chain of thought and few shot learning are just not a thing smaller models can do. Accuracy is sometimes the right metric, etc. It’s often overhyped, but this paper way overclaims
I’m so torn on this paper -I think it makes a reasonable point that many claims of emergence are overrated and that it’s easy to massage metrics into a single narrative. But also, I think the title and abstract are overclaiming clickbait—obviously models have emergent abilities!! Chain of thought and few shot learning are just not a thing smaller models can do. Accuracy is sometimes the right metric, etc. It’s often overhyped, but this paper way overclaims