Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
evhub comments on
Modulating sycophancy in an RLHF model via activation steering
evhub
9 Aug 2023 23:44 UTC
LW: 2 AF: 2
0
AF
(Moderation note: added to the Alignment Forum from LessWrong.)
Back to top
(Moderation note: added to the Alignment Forum from LessWrong.)