Neel Nanda comments on Taking features out of superposition with sparse autoencoders more quickly with informed initialization

Neel Nanda 24 Sep 2023 12:43 UTC
2 points
0
I’m pretty concerned about the compute scaling of autoencoders to real models. I predict the scaling of the data needed and of the amount of features is super linear in d_model, which seems to scale badly to a frontier model
- Logan Riggs 24 Sep 2023 14:17 UTC
  2 points
  0
  Parent
  This doesn’t engage w/ (2) - doing awesome work to attract more researchers to this agenda is counterfactually more useful than directly working on lowering the compute cost now (since others, or yourself, can work on that compute bottleneck later).
  Though honestly, if the results ended up in a ~2x speedup, that’d be quite useful for faster feedback loops for myself.
  - Neel Nanda 24 Sep 2023 14:29 UTC
    6 points
    2
    Parent
    Yeah, I agree that doing work that gets other people excited about sparse autoencoders is arguably more impactful than marginal compute savings, I’m just arguing that compute savings do matter.