We (Goodfire) just put our research preview live—you can play with Llama 3 and use sparse autoencoders to read & write from its internal activations. This is a linkpost for:
Taking research and turning it into something you can actually use and play with has been great. It’s surprising how much of a difference iterating on something when you expect it to actually be used feels; I think it’s definitely pushed the quality of what you can do with SAEs up a notch.
[Linkpost] Play with SAEs on Llama 3
We (Goodfire) just put our research preview live—you can play with Llama 3 and use sparse autoencoders to read & write from its internal activations. This is a linkpost for:
The research preview.
Our blog post about building it.
Taking research and turning it into something you can actually use and play with has been great. It’s surprising how much of a difference iterating on something when you expect it to actually be used feels; I think it’s definitely pushed the quality of what you can do with SAEs up a notch.