Daniel Tan comments on Why I’m Moving from Mechanistic to Prosaic Interpretability

Daniel Tan 30 Dec 2024 8:45 UTC
6 points
0
I didn’t say there’s no inductive bias at work in models. Merely that trying to impose your own inductive biases on models is probably doomed for reasons best stated in the Bitter Lesson. The effectiveness of pretraining / scaling suggests that inductive biases work best when they are arrived at organically by very expressive models training on very large amounts of data.

Our models are very general, but when e.g. we use a diffusion model for images that exploits (and is biased towards) the kind of local structure we expect of images, when we use a transformer for text that exploits (and is biased towards) the kind of sequential pair-correlation you see in natural language, etc.
Sure, but it’s unclear whether these inductive biases are necessary. ~~Concrete example:~~ ~~randomly initialised CNN weights can extract sufficiently good features to linearly classify MNIST. Another example:~~ ~~randomly initialised transformers with only embeddings learnable~~ ~~can do modular addition.~~
Edit: On reflection, the two examples above actually support the idea that the inductive bias of the architecture is immensely helpful to solving tasks (to the extent that the weights themselves don’t really matter). A better example of my point is that MLP-mixer models can be as good as CNNs on vision tasks despite having much smaller architectural inductive biases towards vision.
- eggsyntax 1 Jan 2025 20:24 UTC
  4 points
  0
  Parent
  Sure, but it’s unclear whether these inductive biases are necessary.
  My understanding is that they’re necessary even in principle, since there are an unbounded number of functions that fit any finite set of points. Even AIXI has a strong inductive bias, toward programs with the lowest Kolmogorov complexity.
  - Daniel Tan 1 Jan 2025 21:54 UTC
    5 points
    0
    Parent
    Yeah, I agree that some inductive bias is probably necessary. But not all inductive biases are equal; some are much more general than others, and in particular I claim that ‘narrow’ inductive biases (e.g. specializing architectures to match domains) probably have net ~zero benefit compared to those learned from data
- dr_s 30 Dec 2024 12:56 UTC
  4 points
  0
  Parent
  Interesting. But CNNs were developed originally for a reason to begin with, and MLP-mixer does mention a rather specific architecture as well as “modern regularization techniques”. I’d say all of that counts as baking in some inductive biases in the model though I agree it’s a very light touch.