Neel Nanda comments on Zach Stein-Perlman’s Shortform

Neel Nanda 12 Dec 2024 16:19 UTC
6 points
2
It seems unlikely that openai is truly following the test the model plan? They keep eg putting new experimental versions onto lmsys, presumably mostly due to different post training, and it seems pretty expensive to be doing all the DC evals again on each new version (and I think it’s pretty reasonable to assume that a bit of further post training hasn’t made things much more dangerous)