Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
nostalgebraist comments on
[Link] Training Compute-Optimal Large Language Models
nostalgebraist
1 Apr 2022 20:08 UTC
2
points
It’s strongly implied to be 2048, as in Gopher.
Back to top
It’s strongly implied to be 2048, as in Gopher.