ozziegooen comments on $300 Fermi Model Competition

ozziegooen 3 Feb 2025 21:22 UTC
2 points
0
By the way—I imagine you could do a better job with the evaluation prompts by having another LLM pass, where it formalizes the above more and adds more context. For example, with an o1/R1 pass/Squiggle AI pass, you could probably make something that considers a few more factors with this and brings in more stats.