Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
ryan_greenblatt comments on
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
ryan_greenblatt
16 Jan 2024 2:12 UTC
3
points
0
See TurnTrout’s shortform
here
for some more discussion.
Back to top
See TurnTrout’s shortform here for some more discussion.