The descriptive math part was very good, thanks—and that’s why I resisted downvoting the post. My problem is that the conclusion omits the hugely important factor that categories are useful for specific goals, and the kind of techniques you are suggesting (essentially unsupervised techniques) are context-free.
E.g. is a dead cow more similar to a dead (fixed from ‘live’) horse or to a live cow? (It clearly depends what you want to do with it)
I tend to find techniques attractive when they’re generalizable, and context-free, unsupervised techniques fit the bill. You can automate them. You can apply them to a range of projects. But you’re right—sometimes specific knowledge about a specific application matters, and you can’t generalize it away.
The two aren’t mutually exclusive, of course. You can use specific knowledge about a particular problem to make your machine learning methods work better, sometimes.
I read a paper the other day about predicting the targets of a particular type of small nucleolar RNA, which are an important part of the machinery that regulates gene expression. One of the methods they used was to run an SVM classifier on a number of features about the RNA in question. SVM classifiers are one of those nice general-purpose easily-automated methods, but the authors used their knowledge of the specific problem to pick out what features it would use for its classification. Things like the length of particular parts of the RNA—stuff that would occur to molecular biologists, but could be prohibitively expensive for a purely automatic machine learning algorithm to discover if you just gave it all the relevant data.
(More bio nerdery: they combined this with a fast approximation of the electrostatic forces at work, and ended up getting remarkably good accuracy and speed. The paper is here, if anyone’s interested.)
Belatedly, I remembered a relevant tidbit of wisdom I once got from a math professor.
When a theorist comes up with a new algorithm, it’s not going to outperform existing algorithms used commercially in the “real world.” Not even if, in principle the new algorithm is more elegant or faster or whatever. Why? Because in the real world, you don’t just take a general-purpose algorithm off the page, you optimize the hell out of it. Engineers who work with airplanes will jimmy their algorithms to accommodate all the practical “common knowledge” about airplanes. A mere mathematician who doesn’t know anything about airplanes can’t compete with that.
If you’re a theorist trying to come up with a better general method your goal is to give evidence that your algorithm will do better than the existing one after you optimize the hell out of them equally.
The descriptive math part was very good, thanks—and that’s why I resisted downvoting the post. My problem is that the conclusion omits the hugely important factor that categories are useful for specific goals, and the kind of techniques you are suggesting (essentially unsupervised techniques) are context-free.
E.g. is a dead cow more similar to a dead (fixed from ‘live’) horse or to a live cow? (It clearly depends what you want to do with it)
That’s a good point.
I tend to find techniques attractive when they’re generalizable, and context-free, unsupervised techniques fit the bill. You can automate them. You can apply them to a range of projects. But you’re right—sometimes specific knowledge about a specific application matters, and you can’t generalize it away.
The two aren’t mutually exclusive, of course. You can use specific knowledge about a particular problem to make your machine learning methods work better, sometimes.
I read a paper the other day about predicting the targets of a particular type of small nucleolar RNA, which are an important part of the machinery that regulates gene expression. One of the methods they used was to run an SVM classifier on a number of features about the RNA in question. SVM classifiers are one of those nice general-purpose easily-automated methods, but the authors used their knowledge of the specific problem to pick out what features it would use for its classification. Things like the length of particular parts of the RNA—stuff that would occur to molecular biologists, but could be prohibitively expensive for a purely automatic machine learning algorithm to discover if you just gave it all the relevant data.
(More bio nerdery: they combined this with a fast approximation of the electrostatic forces at work, and ended up getting remarkably good accuracy and speed. The paper is here, if anyone’s interested.)
Belatedly, I remembered a relevant tidbit of wisdom I once got from a math professor.
When a theorist comes up with a new algorithm, it’s not going to outperform existing algorithms used commercially in the “real world.” Not even if, in principle the new algorithm is more elegant or faster or whatever. Why? Because in the real world, you don’t just take a general-purpose algorithm off the page, you optimize the hell out of it. Engineers who work with airplanes will jimmy their algorithms to accommodate all the practical “common knowledge” about airplanes. A mere mathematician who doesn’t know anything about airplanes can’t compete with that.
If you’re a theorist trying to come up with a better general method your goal is to give evidence that your algorithm will do better than the existing one after you optimize the hell out of them equally.
Your example doesn’t quite make sense to me. Did you mean “is a dead cow more similar to a dead horse or to a live cow”, or …?