Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
tailcalled comments on
Don’t design agents which exploit adversarial inputs
tailcalled
18 Nov 2022 19:33 UTC
2
points
0
Maybe if you have a good measure of being in-distribution, which itself is a nontrivial problem.
Back to top
Maybe if you have a good measure of being in-distribution, which itself is a nontrivial problem.