Rohin Shah comments on [AN #95]: A framework for thinking about how to make AI go well

Rohin Shah 17 Apr 2020 20:11 UTC
6 points
Perhaps, I don’t think they tried that (though I haven’t read the paper in detail).
If by distillation you mean “train a smaller student net using the current net”, I’d expect that you’d still have some robustness, but less of it. (But I’d expect removing 30 random neurons would still not make much of a difference, unless you distilled down to a really small model.)