Thomas Larsen comments on Challenge: construct a Gradient Hacker

Thomas Larsen 10 Mar 2023 0:51 UTC
5 points
0
This is a plausible internal computation that the network could be doing, but the problem is that the gradients flow back through from the output to the computation of the gradient to the true value y, and so GD will use that to set the output to be the appropriate true value.
- Thomas Larsen 20 Mar 2023 21:10 UTC
  2 points
  0
  Parent
  Following up to clarify this: the point is that this attempt fails 2a because if you perturb the weights along the connection $\nabla_{θ} L (θ) - - \to ϵ \cdot I d o u t p u t$ , there is now a connection from the internal representation of $y$ to the output, and so training will send this thing to the function $f (D, θ) \approx y$ .