Stuart_Armstrong comments on A probabilistic off-switch that the agent is indifferent to

Stuart_Armstrong 26 Sep 2018 8:44 UTC
LW: 3 AF: 2
AF
Interesting. I’ll think of whether this works and can be generalised (it doesn’t make it reflectively stable—creating u-maximising subagents is still allowed, and doesn’t directly hurt the agent—but might improve the situation).
What links here?
- A probabilistic off-switch that the agent is indifferent to by Ofer (25 Sep 2018 13:13 UTC; 11 points)