James Payor comments on Challenge: construct a Gradient Hacker

James Payor 10 Mar 2023 1:09 UTC
LW: 3 AF: 2
0
AF
My troll example is a fully connected network with all zero weights and biases, no skip connections.

This isn’t something that you’d reach in regular training, since networks are initialized away from zero to avoid this. But it does exhibit a basic ingredient in controlling the gradient flow.

To look for a true hacker I’d try to reconfigure the way the downstream computation works (by modifying attention weights, saturating relus, or similar) based on some model of the inputs, in a way that pushes around where the gradients go.
- Thomas Larsen 10 Mar 2023 1:51 UTC
  3 points
  1
  Parent
  At least under most datasets, it seems to me like the zero NN fails condition 2a, as perturbing the weights will not cause it to go back to zero.