I would be more impressed if he had used the information bottleneck as a simple example of a varying training condition, instead of authoritatively declaring it The Difference, accompanied with its own just so story to explain discrepancies in implementation that haven’t even been demonstrated. I’m not even sure the analogy is correct; is the 7.5MB storing training parameters or the python code?
I would be more impressed if he had used the information bottleneck as a simple example of a varying training condition, instead of authoritatively declaring it The Difference, accompanied with its own just so story to explain discrepancies in implementation that haven’t even been demonstrated. I’m not even sure the analogy is correct; is the 7.5MB storing training parameters or the python code?