gjm comments on Rethinking Batch Normalization