Ofer comments on Obstacles to gradient hacking

Ofer Sep 9, 2021, 2:56 PM
LW: 1 AF: 1
AF

But gradient descent doesn’t modify a neural network one weight at a time

Sure, but the gradient component that is associated with a given weight is still zero if updating that weight alone would not affect loss.
- StellaAthena Sep 9, 2021, 4:54 PM
  3 points
  Parent
  What do you think the gradient of min(x, y) is?