Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Kola Ayonrinde comments on
Targeted Manipulation and Deception Emerge when Optimizing LLMs for User Feedback
[ ]
[deleted]
Back to top
Kola Ayonrinde comments on Targeted Manipulation and Deception Emerge when Optimizing LLMs for User Feedback