Ofer comments on Thoughts on gradient hacking

Ofer 27 Apr 2022 20:03 UTC
LW: 1 AF: 1
AF
Suppose that each subnetwork does general reasoning and thus up until some point during training the subnetworks are useful for minimizing loss.
- Not Relevant 27 Apr 2022 20:19 UTC
  1 point
  Parent
  Are you saying that such a mechanism occurs by coincidence, or that it’s actively constructed? It seems like for all the intermediate steps, all consumers of the almost-identical subnetworks would naturally just pick one and use that one, since it was slightly closer to what the consumer needed.