On one hand, Olah et al.’s (2020) investigations find circuits which implement human-comprehensible functions.
At a higher level, they also find that different branches (when the modularity is enforced already by the architecture) tend to contain different features.
At a higher level, they also find that different branches (when the modularity is enforced already by the architecture) tend to contain different features.