Donald Hobson comments on Self-Reference Breaks the Orthogonality Thesis

Donald Hobson 20 Feb 2023 11:20 UTC
2 points
0
Yes integral is exactly what I intended to communicate.
Think of hypothesis space. A vast abstract space of all possibilities. Each hypothesis has a $P (x)$ the probability of being true, and a $U_{a} (x)$ the utility of action $a$ if it is true.
To really evaluate an action, you need to calculate $\int P (x) U_{a} (x) d x$ an integral over all hypothesis.
If you don’t want to behave with maximum intelligence, just pretty good intelligence, then you can run gradient descent to find a point X by trying to maximize $P (x)$ . Then you can calculate $U_{a} (X)$ to compare actions. More sophisticated methods would sum several points.
This is partly using the known structure of the problem. If you have good evidence, then the function $P (x)$ is basically 0 almost everywhere. So if $U_{a} (X)$ is changing fairly slowly over the region that is significantly nonzero, looking at any nonzero point of $P (x)$ is a good estimate of the integral.