Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Kola Ayonrinde comments on
On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback
[ ]
[deleted]
Back to top
Kola Ayonrinde comments on On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback