Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
the gears to ascension comments on
Large Language Models can Strategically Deceive their Users when Put Under Pressure.
the gears to ascension
17 Nov 2023 4:11 UTC
3
points
0
it could be, but it smells of RLHF to my brain.
Back to top
it could be, but it smells of RLHF to my brain.