Not sure why we are having this discussion. Statistics is a discipline with certain themes, like “intelligently using data for conclusions we want.” These themes are sufficient to give it its own character, and make it both an applied and theoretical discipline. I don’t think you are a statistician, right? Why are you talking about this?
Statistics is as much an applied discipline as physics.
You can post about whatever you want. I have objections if you start mischaracterizing what statistics is about for fun on the internet. Fun on the internet is great, being snarky on the internet is ok, misleading people is not.
edit: In fact, you can view this whole recent “data science” thing that statisticians are so worried about as a reaction to the statistics discipline becoming too theoretical and divorced from actual data analysis problems. [This is a controversial opinion, I don’t think I share it, quite.]
I don’t believe I’m mischaracterizing statistics. My original point was an observation that, in my experience, good mathematicians and good statisticians are different. Their brains work differently. To use an imperfect analogy, good C programmers and good Lisp programmers are also quite different. You just need to think in a very different manner in Lisp compared to C (and vice versa). That, of course, doesn’t mean that a C programmer can’t be passably good in Lisp.
I understand that in the academia statistics departments usually focus on theoretical statistics. That’s fine—I don’t in particular care about “official” discipline boundaries. For my purposes I would like to draw a divide between theoretical statistics and, let’s call it practical statistics. I find it useful to classify theoretical statistics as applied math, and practical statistics as something different from that.
Data science is somewhat different from traditional statistics, but I’m not sure its distinction lies on the theoretical-practical divide. As a crude approximation, I’d say that traditional statistics is mostly concerned with extracting precise and “provable” information out of small data sets, and data science tends to drown in data and so loves non-parametric models and ML in particular.
Not sure why we are having this discussion. Statistics is a discipline with certain themes, like “intelligently using data for conclusions we want.” These themes are sufficient to give it its own character, and make it both an applied and theoretical discipline. I don’t think you are a statistician, right? Why are you talking about this?
Statistics is as much an applied discipline as physics.
Because I’m interested in the subject. Do you have objections?
You can post about whatever you want. I have objections if you start mischaracterizing what statistics is about for fun on the internet. Fun on the internet is great, being snarky on the internet is ok, misleading people is not.
edit: In fact, you can view this whole recent “data science” thing that statisticians are so worried about as a reaction to the statistics discipline becoming too theoretical and divorced from actual data analysis problems. [This is a controversial opinion, I don’t think I share it, quite.]
I don’t believe I’m mischaracterizing statistics. My original point was an observation that, in my experience, good mathematicians and good statisticians are different. Their brains work differently. To use an imperfect analogy, good C programmers and good Lisp programmers are also quite different. You just need to think in a very different manner in Lisp compared to C (and vice versa). That, of course, doesn’t mean that a C programmer can’t be passably good in Lisp.
I understand that in the academia statistics departments usually focus on theoretical statistics. That’s fine—I don’t in particular care about “official” discipline boundaries. For my purposes I would like to draw a divide between theoretical statistics and, let’s call it practical statistics. I find it useful to classify theoretical statistics as applied math, and practical statistics as something different from that.
Data science is somewhat different from traditional statistics, but I’m not sure its distinction lies on the theoretical-practical divide. As a crude approximation, I’d say that traditional statistics is mostly concerned with extracting precise and “provable” information out of small data sets, and data science tends to drown in data and so loves non-parametric models and ML in particular.
Ok, I am not interested in wasting more time on this, all I am saying is:
This is misleading. Theoretical statistics is not applied math, either. I think you don’t know what you are talking about, re: this subject.
So we disagree :-)