Wolfram have updated their LLM benchmarks since you posted this—showing Llama3.1-405b-instruct at #1 place.
Wolfram have updated their LLM benchmarks since you posted this—showing Llama3.1-405b-instruct at #1 place.