adamShimi comments on Gradient hacking

adamShimi 2 Jan 2021 14:10 UTC
LW: 2 AF: 1
AF
Hum, I would say that your logic is probably redundant, and thus might end up being removed for simplicity reasons? Whereas I expect deceptive logic to includes very useful things like knowing how the optimization process works, which would definitely help having better performance.
But to be honest, how can SGD create gradient hacking (if it’s even possible) is completely an open research problem.
- Ofer 2 Jan 2021 14:32 UTC
  LW: 1 AF: 1
  AF Parent
  My point was that there’s no reason that SGD will create specifically “deceptive logic” because “deceptive logic” is not privileged over any other logic that involves modeling the base objective and acting according to it. But I now think this isn’t always true—see the edit block I just added.