Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Christopher King comments on
Alignment Faking in Large Language Models
Christopher King
18 Dec 2024 21:13 UTC
1
point
2
Now see if you can catch sandbagging in the scratchpad!
Back to top
Now see if you can catch sandbagging in the scratchpad!