Math appendix for: "Why you must maximize expected utility"

This is a mathematical appendix to my post “Why you must maximize expected utility”, giving precise statements and proofs of some results about von Neumann-Morgenstern utility theory without the Axiom of Continuity. I wish I had the time to make this post more easily readable, giving more intuition; the ideas are rather straight-forward and I hope they won’t get lost in the line noise!

The work here is my own (though closely based on the standard proof of the VNM theorem), but I don’t expect the results to be new.

I represent preference relations as total preorders $≼$ on a simplex $Δ_{N}$ ; define $≺$ , $\sim$ , $≽$ and $≻$ in the obvious ways (e.g., $x \sim y$ iff both $x ≼ y$ and $y ≼ x$ , and $x ≺ y$ iff $x ≼ y$ but not $y ≼ x$ ). Write $e^{i}$ for the $i$ ’th unit vector in $R^{N}$ .

In the following, I will always assume that $≼$ satisfies the independence axiom: that is, for all $x, y, z \in Δ_{N}$ and $p \in (0, 1]$ , we have $x ≺ y$ if and only if $p x (1 - p) z ≺ p y (1 - p) z$ . Note that the analogous statement with weak preferences follows from this: $x ≼ y$ holds iff $y ⊀ x$ , which by independence is equivalent to $p y (1 - p) z ⊀ p x (1 - p) z$ , which is just $p x (1 - p) z ≼ p y (1 - p) z$ .

Lemma 1 (more of a good thing is always better). If $x ≺ y$ and $0 \leq p < q \leq 1$ , then $(1 - p) x p y ≺ (1 - q) x q y$ .

Proof. Let $r := q - p$ . Then, $(1 - p) x p y = ((1 - q) x p y) r x$ and $(1 - q) x q y = ((1 - q) x p y) r y$ . Thus, the result follows from independence applied to $x$ , $y$ , $\frac{1}{1 - r} ((1 - q) x p y)$ , and $r$ . $□$

Lemma 2. If $x ≼ y ≼ z$ and $x ≺ z$ , then there is a unique $p \in [0, 1]$ such that $(1 - q) x q z ≺ y$ for $q \in [0, p)$ and $y ≺ (1 - q) x q z$ for $q \in (p, 1]$ .

Proof. Let $p$ be the supremum of all $r \in [0, 1]$ such that $(1 - r) x r z ≼ y$ (note that by assumption, this condition holds for $r = 0$ ). Suppose that $0 \leq q < p$ . Then there is an $r \in (q, p]$ such that $(1 - r) x r z ≼ y$ . By Lemma 1, we have $(1 - q) x q z ≺ (1 - r) x r z$ , and the first assertion follows.

Suppose now that $p < q \leq 1$ . Then by definition of $p$ , we do not have $(1 - q) x q z ≼ y$ , which means that we have $(1 - q) x q z ≻ y$ , which was the second assertion.

Finally, uniqueness is obvious, because if both $p$ and $p^{'}$ satisfied the condition, we would have $y ≺ (1 - \frac{p p^{'}}{2}) x \frac{p p^{'}}{2} z ≺ y$ . $□$

Definition 3. $x$ is much better than $y$ , notation $x ≻_{*} y$ or $y ≺_{*} x$ , if there are neighbourhoods $U$ of $x$ and $V$ of $y$ (in the relative topology of $Δ_{N}$ ) such that we have $x^{'} ≻ y^{'}$ for all $x^{'} \in U$ and $y^{'} \in V$ . (In other words, the graph of $≻_{*}$ is the interior of the graph of $≻$ .) Write $x ≼_{*} y$ or $y ≽_{*} x$ when $x ⊁_{*} y$ ( $x$ is not much better than $y$ ), and $x \sim_{*} y$ ( $x$ is about as good as $y$ ) when both $x ≼_{*} y$ and $x ≽_{*} y$ .

Theorem 4 (existence of a utility function). There is a $u \in R^{N}$ such that for all $x, y \in Δ_{N}$ ,

$\sum_{i} x_{i} u_{i} < \sum_{i} y_{i} u_{i} ⟺ x ≺_{*} y ⟹ x ≺ y .$

Unless $x \sim y$ for all $x$ and $y$ , there are $i, j \in {1, \dots, N}$ such that $u_{i} \neq u_{j}$ .

Proof. Let $i$ be a worst and $j$ a best outcome, i.e. let $i, j \in {1, \dots, N}$ be such that $e^{i} ≼ e^{k} ≼ e^{j}$ for all $k \in {1, \dots, N}$ . If $e^{i} \sim e^{j}$ , then $e^{i} \sim e^{k}$ for all $k$ , and by repeated applications of independence we get $x \sim e^{i} \sim y$ for all $x, y \in Δ_{N}$ , and therefore $x \sim_{*} y$ again for all $x, y \in Δ_{N}$ , and we can simply choose $u = 0$ .

Thus, suppose that $e^{i} ≺ e^{j}$ . In this case, let $u$ be such that for every $k \in {1, \dots, N}$ , $u_{k}$ equals the unique $p$ provided by Lemma 2 applied to $e^{i} ≼ e^{k} ≼ e^{j}$ and $e^{i} ≺ e^{j}$ . Because of Lemma 1, $u_{i} = 0 \neq 1 = u_{j}$ . Let $f (r) := (1 - r) e^{i} r e^{j}$ .

We first show that $p := \sum_{k} x_{k} u_{k} < \sum_{k} y_{k} u_{k} =: q$ implies $x ≺ y$ . For every $k$ , we either have $u_{k} < 1$ , in which case by Lemma 2 we have $e^{k} ≺ f (u_{k} ϵ_{k})$ for arbitrarily small $ϵ_{k} > 0$ , or we have $u_{k} = 1$ , in which case we set $ϵ_{k} := 0$ and find $e^{k} ≼ e^{j} = f (u_{k} ϵ_{k})$ . Set $ϵ := \sum_{k} x_{k} ϵ_{k}$ . Now, by independence applied $N - 1$ times, we have $x = \sum_{k} x_{k} e^{k} ≼ \sum_{k} x_{k} f (u_{k} ϵ_{k}) = f (p ϵ)$ ; analogously, we obtain $y ≽ f (q - δ)$ for arbitrarily small $δ > 0$ . Thus, using $p < q$ and Lemma 1, $x ≼ f (p ϵ) ≺ f (q - δ) ≼ y$ and therefore $x ≺ y$ as claimed. Now note that if $\sum_{k} x_{k} u_{k} < \sum_{k} y_{k} u_{k}$ , then this continues to hold for $x^{'}$ and $y^{'}$ in a sufficiently small neighbourhood of $x$ and $y$ , and therefore we have $x ≺_{*} y$ .

Now suppose that $\sum_{k} x_{k} u_{k} \geq \sum_{k} y_{k} u_{k}$ . Since we have $u_{i} = 0$ and $u_{j} = 1$ , we can find points $x^{'}$ and $y^{'}$ arbitrarily close to $x$ and $y$ such that the inequality becomes strict (either the left-hand side is smaller than one and we can increase it, or the right-hand side is greater than zero and we can decrease it, or else the inequality is already strict). Then, $x^{'} ≻ y^{'}$ by the preceding paragraph. But this implies that $x ⊀_{*} y$ , which completes the proof. $□$

Corollary 5. $≼_{*}$ is a preference relation (i.e., a total preorder) that satisfies independence and the von Neumann-Morgenstern continuity axiom.

Proof. It is well-known (and straightforward to check) that this follows from the assertion of the theorem. $□$

Corollary 6. $u$ is unique up to affine transformations.

Proof. Since $u$ is a VNM utility function for $≼_{*}$ , this follows from the analogous result for that case. $□$

Corollary 7. Unless $x \sim y$ for all $x, y \in Δ_{N}$ , for all $r \in R$ the set ${x \in Δ_{N} : \sum_{i} x_{i} u_{i} = r}$ has lower dimension than $Δ_{N}$ (i.e., it is the intersection of $Δ_{N}$ with a lower-dimensional subspace of $R^{N}$ ).

Proof. First, note that the assumption implies that $N \geq 2$ . Let $v \in R^{N}$ be given by $v_{i} = 1$ , $\forall i$ , and note that $Δ_{N}$ is the intersection of the hyperplane $A := {x \in R^{N} : x \cdot v = 1}$ with the closed positive orthant