dr_s comments on Why I’m Moving from Mechanistic to Prosaic Interpretability

dr_s 30 Dec 2024 12:56 UTC
4 points
0
Interesting. But CNNs were developed originally for a reason to begin with, and MLP-mixer does mention a rather specific architecture as well as “modern regularization techniques”. I’d say all of that counts as baking in some inductive biases in the model though I agree it’s a very light touch.