For a work implementin this idea, see: https://www.anthropic.com/index/decomposing-language-models-into-understandable-components
For a work implementin this idea, see: https://www.anthropic.com/index/decomposing-language-models-into-understandable-components