neverix comments on 200 COP in MI: Exploring Polysemanticity and Superposition

neverix 21 Nov 2023 22:30 UTC
1 point
0
This happens in transformer MLP layers. Note that the hidden dimen
Is the point that transformer MLPs blow up the hidden dimension in the middle?
- Neel Nanda 21 Nov 2023 22:59 UTC
  2 points
  0
  Parent
  Thanks for the catch, I deleted “Note that the hidden dimen”. Transformers do blow up the hidden dimension, but that’s not very relevant here—they have many more neurons than residual stream dimensions, and they have many more features than neurons (as shown in the recent Anthropic paper)
  - neverix 23 Nov 2023 22:34 UTC
    1 point
    0
    Parent
    To clarify, I thought it was about superposition happening inside the projection afterwards.