Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
James Payor comments on
In the Short-Term, Why Couldn’t You Just RLHF-out Instrumental Convergence?
James Payor
16 Sep 2023 21:32 UTC
1
point
1
See also:
LLMs Sometimes Generate Purely Negatively-Reinforced Text
Back to top
See also: LLMs Sometimes Generate Purely Negatively-Reinforced Text