Garrett Baker comments on Zach Stein-Perlman’s Shortform

Garrett Baker 21 Aug 2024 18:40 UTC
10 points
6
I think you probably under-rate the effect of having both a large number & concentration of very high quality researchers & engineers (more than OpenAI now, I think, and I wouldn’t be too surprised if the concentration of high quality researchers was higher than at GDM), being free from corporate chafe, and also having many of those high quality researchers thinking (and perhaps being correct in thinking, I don’t know) they’re value aligned with the overall direction of the company at large. Probably also Nvidia rate-limiting the purchases of large labs to keep competition among the AI companies.

All of this is also compounded by smart models leading to better data curation and RLAIF (given quality researchers & lack of crust) leading to even better models (this being the big reason I think llama had to be so big to be SOTA, and Gemini not even SOTA), which of course leads to money in the future even if they have no money now.
- Joseph Miller 27 Aug 2024 14:58 UTC
  1 point
  0
  Parent
  llama had to be so big to be SOTA,
  How many parameters do you estimate for other SOTA models?
  - Garrett Baker 27 Aug 2024 15:36 UTC
    3 points
    0
    Parent
    Minstral had like 150b parameters or something.