This makes sense, I think you could be right. Llama 4 should give us more evidence on numerical precision and scaling of experts.
Current theme: default
Less Wrong (text)
Less Wrong (link)
This makes sense, I think you could be right. Llama 4 should give us more evidence on numerical precision and scaling of experts.