abramdemski comments on Why is o1 so deceptive?

abramdemski 3 Oct 2024 3:08 UTC
LW: 4 AF: 3
0
AF
To me, this comparison to humans doesn’t seem to answer why the o1 training ended up producing this result.
- Gunnar_Zarncke 3 Oct 2024 15:22 UTC
  4 points
  0
  Parent
  Convergence. Humans and LLMs with deliberation do the same thing and end up making the same class of errors