I wrote a quick follow-up: Mechanistic Interpretability as Reverse Engineering (follow-up to “cars and elephants”) FYI.
I wrote a quick follow-up: Mechanistic Interpretability as Reverse Engineering (follow-up to “cars and elephants”) FYI.