sairjy comments on Open & Welcome Thread—August 2020

sairjy 13 Aug 2020 12:33 UTC
3 points
They seem focused on inferencing, which requires a lot less compute than training a model. Example: GPT-3 required thousands of GPUs for training, but it can run on less than 20 GPUs.
Microsoft built an Azure supercluster for OpenAI and it has 10,000 GPUs.
- ChristianKl 13 Aug 2020 12:47 UTC
  2 points
  Parent
  There will be models trained with a lot more compute then GPT-3 and the best models that are out there will be build on those huge billion dollar models. Renting out those billion dollar models in a software as a service way makes sense as a business model. The big cloud providers will all do it.