Daniel Tan comments on Daniel Tan’s Shortform

Daniel Tan Jan 2, 2025, 5:36 PM
3 points
2
It turns out that “train the model st CoT is predictable by a different model” is exactly the idea in prover-verifier games. That’s very exciting! Do PVGs reduce introspection or steganography?