didn’t run red-teaming and persuasion evals on the actually-final-version
Asking for this is a bit pointless, since even after the actually-final-version there will be a next update for which non-automated evals won’t be redone, so it’s equally reasonable to do non-automated evals only on some earlier version rather than the actually-final one.
Rushed bc of deepseek?
Similar opinion here, also noting they didn’t run red-teaming and persuasion evals on the actually-final-version:
https://x.com/teortaxesTex/status/1885401111659413590
Asking for this is a bit pointless, since even after the actually-final-version there will be a next update for which non-automated evals won’t be redone, so it’s equally reasonable to do non-automated evals only on some earlier version rather than the actually-final one.