Look inside an LLM. Goodfire trained sparse autoencoders on Llama 3 8B and built a tool to work with edited versions of Llama by tuning features/concepts.
https://preview.goodfire.ai/
(I am loosely affiliated, another team at my current employer was involved in this)
Look inside an LLM. Goodfire trained sparse autoencoders on Llama 3 8B and built a tool to work with edited versions of Llama by tuning features/concepts.
https://preview.goodfire.ai/
(I am loosely affiliated, another team at my current employer was involved in this)