p.b. comments on Jesse Hoogland’s Shortform

p.b. 11 Dec 2024 8:17 UTC
4 points
2
You are skipping over a very important component: Evaluation.
Which is exactly what we don’t know how to do well enough outside of formally verifiable domains like math and code, which is exactly where o1 shows big performance jumps.