Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
ReaderM
Karma:
107
All
Posts
Comments
New
Top
Old
Large Language Models can Strategically Deceive their Users when Put Under Pressure.
ReaderM
15 Nov 2023 16:36 UTC
89
points
9
comments
2
min read
LW
link
1
review
(arxiv.org)
Back to top