Stuart_Armstrong comments on Best utility normalisation method to date?

Stuart_Armstrong 7 Sep 2019 23:09 UTC
LW: 2 AF: 1
AF
Hum… It seems that we can stratify here. Let $X$ represent the values of a collection of variables that we are uncertain about, and that we are stratifying on.

When we compute the normalising factor for utility $U$ under two policies $π$ and $π^{'}$ , we normally do it as:
- $U \to U / N_{U}$ , with $N_{U} = \sum_{x} P (X = x) (E_{π, X = x} U - E_{π^{'}, X = x} U)$ .
And then we replace $U$ with $U / N_{U}$ .

Instead we might normalise the utility $U$ separately for each value of $x$ :
- Conditional on $X = x$ , then $U \to U / N_{U, x}$ , with $N_{U, x} = E_{π, X = x} U - E_{π^{'}, X = x} U$ .
The problem is that, since we’re dividing by the $N$ , the expectation of $U / N_{U, x}$ is not the same $U / N_{U}$ .

Is there an obvious improvement on this?

Note that here, total utilitarianism get less weight in large universes, and more in small ones.

I’ll think more...