paulfchristiano comments on Reliability amplification

paulfchristiano 2 Feb 2019 20:56 UTC
2 points
0
Yes, when I say:
Given a distribution A over policies that ε-close to a benign policy for some ε ≪ 1, can we implement a distribution A⁺ over policies which is δ-close to a benign policy of similar capability, for some δ ≪ ε?
a “benign” policy has to be benign for all inputs. (See also security amplification, stating the analogous problem where a policy is “mostly” benign but may fail on a “small” fraction of inputs.)