johnswentworth comments on Taking the parameters which seem to matter and rotating them until they don’t

johnswentworth 26 Aug 2022 22:34 UTC
15 points
2
This is the jacobian taken at a single data point, right? Assuming so, you might want to try looking for a single rotation which makes the jacobian sparse at many datapoints simultaneously. That would be more directly interpretable as factoring the net into a relatively low number of information channels.
Another useful next step would be to take some part of the net which maps X → Y → Z, and compute the rotations which maximize sparsity for X → Y and Y → Z separately. Then, try to compose the rotations found. Do the “sparse output channels” of X → Y approximately match the “sparse input channels” of Y → Z?
- Garrett Baker 26 Aug 2022 23:02 UTC
  5 points
  0
  Parent
  I was planning on doing the first idea, and I do like the second idea! I’m slightly skeptical that the two rotations will be the same, but I did find that when performing the method on the last layer of the model, I get the identity matrix, which is some evidence in favor of the ‘rotations are the same’ prediction being right in general.
  - Lucius Bushnaq 29 Aug 2022 6:31 UTC
    3 points
    0
    Parent
    I think the idea is that if the rotated basis fundamentally “means” something important, rather than just making what’s happening easier to picture for us humans, we’d kind of expect the basis computed for X->Y to mostly match the basis for Y->Z.
    At least that’s the sort of thing I’d expect to see in such a world.
    - Garrett Baker 29 Aug 2022 17:44 UTC
      2 points
      0
      Parent
      Yup, this is why I’m skeptical there will be a positive result. I did not try to derive a principled, meaningful, basis. I tried the most obvious thing to do which nobody else seems to have done. So I expect this device will be useful and potentially the start of something fundamental, but not fundamental itself.