I think the property of handling shutdown depends on the choice of absolute value or truncation at 0 in the deviation measure, not the choice of the core part of the deviation measure. RR doesn’t handle shutdown because by default it is set to only penalize reductions in reachability (using truncation at 0). I would expect that replacing the truncation with absolute value (thus penalizing increases in reachability as well) would result in handling shutdown (but break the asymmetry property from the RR paper). Similarly, AUP could be modified to only penalize reductions in goal-achieving ability by replacing the absolute value with truncation, which I think would make it satisfy the asymmetry property but not handle shutdown.
Yeah, I agree with that. I had to cut out a lot of interesting thoughts about it to keep it short, but re-reading the summary I wish I had included a link to your comment, which I found quite helpful. I’ll probably add a note to the next newsletter about it.
Thanks Rohin for a great summary as always!
I think the property of handling shutdown depends on the choice of absolute value or truncation at 0 in the deviation measure, not the choice of the core part of the deviation measure. RR doesn’t handle shutdown because by default it is set to only penalize reductions in reachability (using truncation at 0). I would expect that replacing the truncation with absolute value (thus penalizing increases in reachability as well) would result in handling shutdown (but break the asymmetry property from the RR paper). Similarly, AUP could be modified to only penalize reductions in goal-achieving ability by replacing the absolute value with truncation, which I think would make it satisfy the asymmetry property but not handle shutdown.
More thoughts on independent design choices here.
Yeah, I agree with that. I had to cut out a lot of interesting thoughts about it to keep it short, but re-reading the summary I wish I had included a link to your comment, which I found quite helpful. I’ll probably add a note to the next newsletter about it.