buybuydandavis comments on Paradigm shifts in forecasting

buybuydandavis 10 May 2014 2:56 UTC
0 points
I wouldn’t generalize too much from a forecasting competition.

Per Wolpert’s No Free Lunch theorems, algorithm performance depends on fit to problem domain. The winner is likely a guy who lucked out on the choice of performance evaluation which fit his algorithm better than the competition. It doesn’t mean he’ll win the next competition. And it doesn’t mean he isn’t good, but it likely means that he was good and lucky.

How do we judge the potential and promise of the new complicated forecasting method?

Theory and judgment play a part.

When I first saw the Deep Learning method presented by Hinton, I was confident that it would be good without seeing the results, as it looked like a great theoretical approach, attacking the problem the right way.

Same thing with Wolpert and Stacked Generalization.

What to bet on? Things that theoretically look good, but are currently computationally cost prohibitive. As computers improve, there is an algorithmic land grab by researchers rushing into the areas that become computationally tractable.
- gwern 10 May 2014 21:11 UTC
  4 points
  Parent
  
  Per Wolpert’s No Free Lunch theorems, algorithm performance depends on fit to problem domain.
  
  Aren’t all these forecasting competitions using real data from real-world problems, and so NFL is irrelevant?
  - buybuydandavis 17 May 2014 23:20 UTC
    0 points
    Parent
    NFL not relevant to the real world? Would you like to elaborate?
    - gwern 18 May 2014 1:27 UTC
      3 points
      Parent
      Real-world problems are not a random sampling from all possible problems and there’s plenty of structure to exploit, so invoking NFL in this context seems odd to me.
      - buybuydandavis 18 May 2014 22:55 UTC
        0 points
        Parent
        A real world competition isn’t a random sample of anything. It’s a selection of some problems, with some data. The performance of any algorithm will depend on fit to those problems, with those data.
        
        My takeaways from the NFL theorems—the problems in the real world are some structured subset of all possible problems, and the performance of any generalizer for a problem will depend on fit to that problem.
        gwern 20 May 2014 17:27 UTC
        0 points
        Parent
        
        The performance of any algorithm will depend on fit to those problems, with those data.
        
        That’s not chopped liver.