Charlie Steiner comments on Open Problems in Negative Side Effect Minimization

Charlie Steiner 6 May 2022 20:22 UTC
LW: 6 AF: 2
0
AF
There’s definitely a tension here between avoiding bad disruptive actions and doing good disruptive actions.
It seems to me like you’re thinking about SEM more like a prior that starts out dominant but can get learned away over time. Is that somewhat close to how you’re thinking about this tension?
- Fabian Schimpf 13 May 2022 8:39 UTC
  2 points
  0
  AF Parent
  Starting more restrictive seems sensible; this could be, as you say, learned away, or one could use human feedback to sign off on high-impact actions. The first problem reminds me of finding regions of attractions in nonlinear control where the ROA is explored without leaving the stable region. The second approach seems to hinge on humans being able to understand the implications of high-impact actions and the consequences of a baseline like inaction. There are probably also other alternatives that we have not yet considered.
- [ ]
  [deleted]