Wei Shi

Karma: 0

Wei Shi Nov 19, 2024, 2:00 AM
1 point
0
AF
in reply to: Neel Nanda’s comment on: Open Source Replication of Anthropic’s Crosscoder paper for model-diffing
I got it, thank you very much!

Wei Shi Nov 18, 2024, 9:07 AM
1 point
0
AF
on: Open Source Replication of Anthropic’s Crosscoder paper for model-diffing
We trained a crosscoder of width 16,384 on the residual stream activations from the middle layer of the Gemma-2 2B base and IT models.
I don’t understand the training process here, as well as the mini-paper from Anthropic. How do you train one crosscoder on the residual stream from two different models?