Demian Till comments on Do sparse autoencoders find “true features”?

Demian Till 23 Feb 2024 19:59 UTC
1 point
0
Even for a fairly small target model we might want to discover e.g. 100K features and and the input vectors might be e.g. 768D. That’s a lot of work to compute that matrix!
- Charlie Steiner 23 Feb 2024 21:04 UTC
  6 points
  2
  Parent
  Hm. Okay, I remembered a better way to improve efficiency: neighbor lists. For each feature, remember a list of who its closest neighbors are, and just compute your “closeness loss” by calculating dot products in that list.
  The neighbor list itself can either be recomputed once in a while using the naive method, or you can accelerate the neighbor list recomputation by keeping more coarse-grained track of where features are in activation-space.
  What links here?
  - Logan Riggs's comment on Do sparse autoencoders find “true features”? by Demian Till (27 Feb 2024 0:20 UTC; 6 points)
  - Demian Till 24 Feb 2024 11:41 UTC
    1 point
    0
    Parent
    Thanks, I mentioned this as a potential way forward for tackling quadratic complexity in my edit at the end of the post.