Depending on model size I’m fairly confident we can train SAEs and see if they can find relevant features (feel free to dm me about this).
Depending on model size I’m fairly confident we can train SAEs and see if they can find relevant features (feel free to dm me about this).