p.b. comments on Are human imitators superhuman models with explicit constraints on capabilities?

p.b. 23 May 2022 7:41 UTC
1 point
If this model is supposed to explain double descent the question is why the model at the first local minimum isn’t more intelligent than later models with lower loss? Shouldn’t it have learned the simple model of the data without the deviations?