leogao comments on My AI Model Delta Compared To Yudkowsky

leogao 13 Jun 2024 3:38 UTC
16 points
0
(In fact, we know that the fraction of features extracted is probably quite small—for example, the 16M latent GPT-4 autoencoder only captures 10% of the downstream loss in terms of equivalent pretraining compute.)