Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
dmz comments on
High-stakes alignment via adversarial training [Redwood Research report]
dmz
9 May 2022 17:21 UTC
LW: 3 AF: 2
AF
Indeed. (Well, holding the quality degradation fixed, which causes a small change in the threshold.)
Back to top
Indeed. (Well, holding the quality degradation fixed, which causes a small change in the threshold.)