I think the key takeaway I wanted people to get is that superposition is something novel and non-trivial, and isn’t just a standard polysemantic neuron thing. I wrote this post in response to two interactions where people assumed that superposition was just polysemanticity.
It turned out that a substantial fraction of the post went the other way (i.e. talking about non-superposition polysemanticity), so maybe?
Wouldn’t “Neuron Polysemanticity is not ‘just’ Superposition” be a more fitting title?
I think the key takeaway I wanted people to get is that superposition is something novel and non-trivial, and isn’t just a standard polysemantic neuron thing. I wrote this post in response to two interactions where people assumed that superposition was just polysemanticity.
It turned out that a substantial fraction of the post went the other way (i.e. talking about non-superposition polysemanticity), so maybe?