By the way, if you look at Filan et al.ās paper āClusterability in Neural Networksā there is a lot of variance in their results but generally speaking they find that L1 regularization leads to slightly more clusterability than L2 or dropout.
By the way, if you look at Filan et al.ās paper āClusterability in Neural Networksā there is a lot of variance in their results but generally speaking they find that L1 regularization leads to slightly more clusterability than L2 or dropout.