jsteinhardt comments on Future ML Systems Will Be Qualitatively Different

jsteinhardt 14 Jan 2022 0:00 UTC
2 points
Okay I think I get what you’re saying now—more SGD steps should increase “effective model capacity”, so per the double descent intuition we should expect the validation loss to first increase then decrease (as is indeed observed). Is that right?