Probably a much better way of getting a sense of the long-term agenda than reading my comment is to look back at Chris Olah’s “Interpretability Dreams” post.
Our present research aims to create a foundation for mechanistic interpretability research. In particular, we’re focused on trying to resolve the challenge of superposition. In doing so, it’s important to keep sight of what we’re trying to lay the foundations for. This essay summarizes those motivating aspirations – the exciting directions we hope will be possible if we can overcome the present challenges.
Probably a much better way of getting a sense of the long-term agenda than reading my comment is to look back at Chris Olah’s “Interpretability Dreams” post.