sweenesm comments on Understanding Emergence in Large Language Models

sweenesm 30 Nov 2024 2:50 UTC
1 point
0
Thanks for the post. I think it’d be helpful if you could add some links to references for some of the things you say, such as:
For instance, between 10^10 and 10^11 parameters, models showed dramatic improvements in their ability to interpret emoji sequences representing movies.