[Linkpost] Growth in FLOPS used to train ML models

Derek M. Jones14 Mar 2022 11:28 UTC

3 points

This is a linkpost for https://shape-of-code.com/2022/03/13/growth-in-flops-used-to-train-ml-models/

Given the ongoing history of continually increasing compute power, what is the maximum compute power that might be available to train ML models in the coming years?

Derek M. Jones14 Mar 2022 11:28 UTC

3 points

3 comments1 min readLW link

gwern 14 Mar 2022 15:57 UTC
3 points
0
Speaking of compute and experience curves, Karpathy just posted about replicating Le Cun’s 1989 pre-MNIST digit classifying results and what difference compute & methods make: https://karpathy.github.io/2022/03/14/lecun1989/
- Derek M. Jones 14 Mar 2022 17:24 UTC
  1 point
  0
  Parent
  Thanks, an interesting read until the author peers into the future. Moore’s law is on its last legs, so the historical speed-ups will soon be just that, something that once happened. There are some performance improvements still to come from special purpose cpus, and half-precision floating-point will reduce memory traffic (which can then be traded for cpu perforamnce).
wunan 14 Mar 2022 15:20 UTC
1 point
0
Thanks for writing! I don’t see an actual answer to the question asked in the beginning—“Given the ongoing history of continually increasing compute power, what is the maximum compute power that might be available to train ML models in the coming years?” Did I miss it?