In 2021, I predicted math to be basically solved by 2023 (using the kind of reinforcement learning on formally checkable proofs that deepmind is using).
It’s been slower than expected and I wouldn’t have guessed some less formal setting like o1 to go relatively well—but since then I just nod along to these kinds of results.
(Not sure what to think of that claimed 95% number though—wouldn’t that kind of imply they’d blown past the IMO grand challenge?
EDIT: There were significant time limits on the human participants, see Qumeric’s comment.)
In 2021, I predicted math to be basically solved by 2023 (using the kind of reinforcement learning on formally checkable proofs that deepmind is using). It’s been slower than expected and I wouldn’t have guessed some less formal setting like o1 to go relatively well—but since then I just nod along to these kinds of results.
(Not sure what to think of that claimed 95% number though—wouldn’t that kind of imply they’d blown past the IMO grand challenge? EDIT: There were significant time limits on the human participants, see Qumeric’s comment.)