SteveZ comments on Predicting GPU performance

SteveZ 15 Dec 2022 1:34 UTC
4 points
0
Yep, I think you’re right that both views are compatible. In terms of performance comparison, the architectures are quite different and so while looking at raw floating-point performance gives you a rough idea of the device’s capabilities, performance on specific benchmarks can be quite different. Optimization adds another dimension entirely, for example NVIDIA has highly-optimized DNN libraries that achieve very impressive performance (as a fraction of raw floating-point performance) on their GPU hardware. AFAIK nobody is spending that much effort (e.g. teams of engineers x several months) to optimize deep learning models on CPU these days because it isn’t worth the return on investment.
- Mau 15 Dec 2022 3:00 UTC
  2 points
  0
  Parent
  Thanks! To make sure I’m following, does optimization help just by improving utilization?
  - SteveZ 15 Dec 2022 3:26 UTC
    2 points
    0
    Parent
    Yeah pretty much. If you think about mapping something like matrix-multiply to a specific hardware device, details like how the data is laid out in memory, utilizing the cache hierarchy effectively, efficiently moving data around the system, etc are important for performance.