leogao comments on How to Better Report Sparse Autoencoder Performance

leogao Jun 3, 2024, 5:43 PM
2 points
0
I usually look at log(downstream loss—original LM loss). But more broadly, there’s nothing wrong with looking at log of some LM loss based term—all the scaling laws stuff does it.