Archimedes comments on LLMs Look Increasingly Like General Reasoners

Archimedes 9 Nov 2024 17:02 UTC
7 points
0
Additional data points:
o1-preview and the new Claude Sonnet 3.5 both significantly improved over prior models on SimpleBench.
The math, coding, and science benchmarks in the o1 announcement post: