Mateusz Bagiński comments on chinchilla’s wild implications

Mateusz Bagiński Aug 25, 2023, 1:11 PM
3 points
0
I think I remember seeing somewhere that LLMs learn more slowly on languages with “more complex” grammar (in the sense of their loss decreasing more slowly per the same number of tokens) but I can’t find the source right now.