samshap comments on Stitching SAEs of different sizes

samshap 13 Jul 2024 22:12 UTC
1 point
0
This is great work. My recommendation: add a term in your loss function that penalizes features with high cosine similarity.

I think there is a strong theoretical underpinning for the results you are seeing.

I might try to reach out directly—some of my own academic work is directly relevant here.
- Bart Bussmann 13 Jul 2024 22:40 UTC
  1 point
  0
  Parent
  Interesting! I actually did a small experiment with this a while ago, but never really followed up on it.
  
  I would be interested to hear about your theoretical work in this space, so sent you a DM :)