Garrett Baker comments on Taking the parameters which seem to matter and rotating them until they don’t

Garrett Baker 26 Aug 2022 20:53 UTC
7 points
1
You can easily do the same with any affine transformation, no? Skew, translation (scale doesn’t matter for interpretability).
You can do this with any normalized, nonzero, invertible affine transformation. Otherwise, you either get the 0 function, get a function arbitrarily close to zero, or are unable to invert the function. I may end up doing this.
What would suffice as convincing proof that this is valuable for a task: the transformation increases the effectiveness of the best training methods.
This will not provide any improvement in training, for various reasons, but mainly because I anticipate there’s a reason the network is not in the interpretable basis. Interpretable networks do not actually increase training effectiveness. The real test of this method will be in my attempts to use it to understand what my MNIST network is doing.
- hobs 27 Aug 2022 18:07 UTC
  1 point
  0
  Parent
  Wouldn’t your explainable rotated representation create a more robust model? Kind of like Newton’s model of gravity was a better model than Kepler and Copernicus computing nested ellipses. Your model might be immune to adversarial examples and might generalize outside of the training set.