DaemonicSigil comments on Race Along Rashomon Ridge

DaemonicSigil 8 Jul 2022 3:41 UTC
4 points
0
In your example, I think even adding just one more node, h3, to the hidden layer would suffice to connect the two solutions. One node per dimension of input suffices to learn the function, but it’s also possible for two nodes to share the task between them, where the share of the task they are picking up can vary continuously from 0 to 1. So just have h3 take over x2 from h2, then h2 takes over x1 from h1, and then h1 takes over x2 from h3.
- Buck 8 Jul 2022 5:37 UTC
  3 points
  0
  Parent
  Yeah, but you have the same problem if you were using all three of the nodes in the hidden layer.
  - Quintin Pope 8 Jul 2022 22:34 UTC
    1 point
    0
    Parent
    Actual trained neural networks are known to have redundant parameters, as demonstrated by the fact that we can prune them so much.