Logan Riggs comments on Taking features out of superposition with sparse autoencoders more quickly with informed initialization

Logan Riggs 24 Sep 2023 14:17 UTC
2 points
0
This doesn’t engage w/ (2) - doing awesome work to attract more researchers to this agenda is counterfactually more useful than directly working on lowering the compute cost now (since others, or yourself, can work on that compute bottleneck later).
Though honestly, if the results ended up in a ~2x speedup, that’d be quite useful for faster feedback loops for myself.
- Neel Nanda 24 Sep 2023 14:29 UTC
  6 points
  2
  Parent
  Yeah, I agree that doing work that gets other people excited about sparse autoencoders is arguably more impactful than marginal compute savings, I’m just arguing that compute savings do matter.