Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Ramana Kumar answers
Is there any rigorous work on using anthropic uncertainty to prevent situational awareness / deception?
Ramana Kumar
26 Sep 2024 9:21 UTC
LW: 2 AF: 1
0
AF
Vaguely related perhaps is the work on Decoupled Approval:
https://arxiv.org/abs/2011.08827
Back to top
Vaguely related perhaps is the work on Decoupled Approval: https://arxiv.org/abs/2011.08827