tailcalled comments on LLM Generality is a Timeline Crux

tailcalled 25 Jun 2024 5:32 UTC
10 points
3
I always feel like self-play on math with a proof checker like Agda or Coq is a promising way to make LLMs superhuman on these areas. Do we have any strong evidence that it’s not?
- eggsyntax 25 Jun 2024 10:34 UTC
  3 points
  0
  Parent
  Do you mean as a (presumably RL?) training method to make LLMs themselves superhuman in that area, or that the combined system can be superhuman? I think AlphaCode is some evidence for the latter, with the compiler in the role of proof-checker.
  - tailcalled 25 Jun 2024 10:44 UTC
    2 points
    0
    Parent
    The former