Ofer comments on Which counterfactuals should an AI follow?

Ofer 8 Apr 2021 10:03 UTC
LW: 3 AF: 3
AF
Maybe “logical counterfactuals” are also relevant here (in the way I’ve used them in this post). For example, consider a reward function that depends on whether the first 100 digits after the $10^{100}$ th digit in the decimal representation of $π$ are all 0. I guess this example is related to the “closest non-expert model” concept.