Vladimir_Nesov comments on AXRP Episode 37 - Jaime Sevilla on Forecasting AI

Vladimir_Nesov Oct 5, 2024, 7:00 PM
LW: 2 AF: 1
0
AF
The data wall discussion in the podcast applies Chinchilla’s 20 tokens/parameter too broadly and doesn’t account for repetition of data in training. These issues partially cancel out, but new information on these ingredients would affect the amended argument differently. I wrote up the argument as a new post.