cubefox comments on “AI achieves silver-medal standard solving International Mathematical Olympiad problems”

cubefox 26 Jul 2024 1:43 UTC
2 points
−3
The diagram actually says it uses the AlphaZero algorithm. Which obviously doesn’t involve an LLM.
- gjm 26 Jul 2024 9:49 UTC
  8 points
  0
  Parent
  The AlphaZero algorithm doesn’t obviously not involve an LLM. It has a “policy network” to propose moves, and I don’t know what that looks like in the case of AlphaProof. If I had to guess blindly I would guess it’s an LLM, but maybe they’ve got something else instead.