gwern comments on What is an appropriate sample size when surveying billions of data points?