Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Ollie J comments on
[Paper] AI Sandbagging: Language Models can Strategically Underperform on Evaluations
Ollie J
13 Jun 2024 12:15 UTC
2
points
0
Fixed, thanks for flagging
Back to top
Fixed, thanks for flagging