Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Stuart_Armstrong comments on
Benchmark for successful concept extrapolation/avoiding goal misgeneralization
Stuart_Armstrong
7 Jul 2022 13:24 UTC
3
points
0
I’d say do two challenges: one at a mix rate of 0.5, one at a mix rate of 0.1.
Back to top
I’d say do two challenges: one at a mix rate of 0.5, one at a mix rate of 0.1.