Yeah sorry I should have been more precise. I think it’s so non-trivial that it plausibly contains most of the difficulty in the overall problem—which is a statement I think many people working on mechanistic interpretability would disagree with.
Yeah sorry I should have been more precise. I think it’s so non-trivial that it plausibly contains most of the difficulty in the overall problem—which is a statement I think many people working on mechanistic interpretability would disagree with.