Daniel Kokotajlo comments on Utility Maximization = Description Length Minimization

Daniel Kokotajlo 19 Feb 2021 9:41 UTC
LW: 16 AF: 7
AF
Probably confused noob question:
It seems like your core claim is that we can reinterpret expected-utility maximizers as expected-number-of-bits-needed-to-describe-the-world-using-M2 minimizers, for some appropriately chosen model of the world M2.
If so, then it seems like something weird is happening, because typical utility functions (e.g. “pleasure—pain” or “paperclips”) are unbounded above and below, whereas bits are bounded below, meaning a bit-minimizer is like a utility function that’s bounded above: there’s a best possible state the world could be in according to that bit-minimizer.
Or are we using a version of expected utility theory that says utility must be bounded above and below? (In that case, I might still ask, isn’t that in conflict with how number-of-bits is unbounded above?)
What links here?
- johnswentworth's comment on Utility Maximization = Description Length Minimization by johnswentworth (26 Feb 2021 2:28 UTC; 4 points)
- Garrett Baker's comment on DragonGod’s Shortform by DragonGod (10 May 2023 21:08 UTC; 4 points)
- Rohin Shah 24 Feb 2021 21:24 UTC
  LW: 25 AF: 12
  0
  AF Parent
  The core conceptual argument is: the higher your utility function can go, the bigger the world must be, and so the more bits it must take to describe it in its unoptimized state under M2, and so the more room there is to reduce the number of bits.
  If you could only ever build 10 paperclips, then maybe it takes 100 bits to specify the unoptimized world, and 1 bit to specify the optimized world.
  If you could build 10^100 paperclips, then the world must be humongous and it takes 10^101 bits to specify the unoptimized world, but still just 1 bit to specify the perfectly optimized world.
  If you could build ∞ paperclips, then the world must be infinite, and it takes ∞ bits to specify the unoptimized world. Infinities are technically challenging, and John’s comment goes into more detail about how you deal with this sort of case.
  For more intuition, notice that exp(x) is a bijective function from (-∞, ∞) to (0, ∞), so it goes from something unbounded on both sides to something unbounded on one side. That’s exactly what’s happening here, where utility is unbounded on both sides and gets mapped to something that is unbounded only on one side.
  What links here?
  - AI takeoff story: a continuation of progress by other means by Edouard Harris (27 Sep 2021 15:55 UTC; 76 points)
  - Garrett Baker's comment on DragonGod’s Shortform by DragonGod (10 May 2023 21:08 UTC; 4 points)
  - Daniel Kokotajlo 25 Feb 2021 8:27 UTC
    LW: 3 AF: 1
    AF Parent
    Ahh, thanks!
- johnswentworth 19 Feb 2021 17:28 UTC
  LW: 9 AF: 6
  AF Parent
  Awesome question! I spent about a day chewing on this exact problem.
  First, if our variables are drawn from finite sets, then the problem goes away (as long as we don’t have actually-infinite utilities). If we can construct everything as limits from finite sets (as is almost always the case), then that limit should involve a sequence of world models.
  The more interesting question is what that limit converges to. In general, we may end up with an improper distribution (conceptually, we have to carry around two infinities which cancel each other out). That’s fine—improper distributions happen sometimes in Bayesian probability, we usually know how to handle them.
  - Daniel Kokotajlo 19 Feb 2021 22:42 UTC
    LW: 2 AF: 1
    AF Parent
    Thanks for the reply, but I might need you to explain/dumb-down a bit more.
    --I get how if the variables which describe the world can only take a finite combination of values, then the problem goes away. But this isn’t good enough because e.g. “number of paperclips” seems like something that can be arbitrarily big. Even if we suppose they can’t get infinitely big (though why suppose that?) we face problems, see below.
    --What does it mean in this context to construct everything as limits from finite sets? Specifically, consider someone who is a classical hedonistic utilitarian. It seems that their utility is unbounded above and below, i.e. for any setting of the variables, there is a setting which is a zillion times better and a setting which is a zillion times worse. So how can we interpret them as minimizing the bits needed to describe the variable-settings according to some model M2? For any M2 there will be at least one minimum-bit variable-setting, which contradicts what we said earlier about every variable-setting having something which is worse and something which is better.
    - johnswentworth 19 Feb 2021 23:34 UTC
      LW: 7 AF: 5
      AF Parent
      I’ll answer the second question, and hopefully the first will be answered in the process.
      First, note that $P [X | M_{2}] \propto e^{α u (X)}$ , so arbitrarily large negative utilities aren’t a problem—they get exponentiated, and yield probabilities arbitrarily close to 0. The problem is arbitrarily large positive utilities. In fact, they don’t even need to be arbitrarily large, they just need to have an infinite exponential sum; e.g. if $u (X)$ is $1$ for any whole number of paperclips $X$ , then to normalize the probability distribution we need to divide by $\sum_{X = 0}^{\infty} e^{α \cdot 1} = \infty$ . The solution to this is to just leave the distribution unnormalized. That’s what “improper distribution” means: it’s a distribution which can’t be normalized, because it sums to $\infty$ .
      The main question here seems to be “ok, but what does an improper distribution mean in terms of bits needed to encode X?”. Basically, we need infinitely many bits in order to encode X, using this distribution. But it’s “not the same infinity” for each X-value—not in the sense of “set of reals is bigger than the set of integers”, but in the sense of “we constructed these infinities from a limit so one can be subtracted from the other”. Every X value requires infinitely many bits, but one X-value may require 2 bits more than another, or 3 bits less than another, in such a way that all these comparisons are consistent. By leaving the distribution unnormalized, we’re effectively picking a “reference point” for our infinity, and then keeping track of how many more or fewer bits each X-value needs, compared to the reference point.
      In the case of the paperclip example, we could have a sequence of utilities $u_{n} (X)$ which each assigns utility $X$ to any number of paperclips X < $n$ (i.e. 1 util per clip, up to $n$ clips), and then we take the limit $n \to \infty$ . Then our $n^{t h}$ unnormalized distribution is $P_{u n n o r m} [X | M_{n}] = e^{α X} I [X < n]$ , and the normalizing constant is $Z_{n} = \frac{1 - e^{α n}}{1 - e^{α}}$ , which grows like $O (e^{α n})$ as $n \to \infty$ . The number of bits required to encode a particular value $X < n$ is
      $- log \frac{P_{u n n o r m} [X | M_{n}]}{Z_{n}} = log \frac{1 - e^{α n}}{1 - e^{α}} - α X$
      Key thing to notice: the first term, $log \frac{1 - e^{α n}}{1 - e^{α}}$ , is the part which goes to $\infty$ with $n$ , and it does not depend on $X$ . So, we can take that term to be our “reference point”, and measure the number of bits required for any particular $X$ relative to that reference point. That’s exactly what we’re implicitly doing if we don’t normalize the distribution: ignoring normalization, we compute the number of bits required to encode X as
      $- log P_{u n n o r m} [X | M_{n}] = - α X$
      … which is exactly the “adjustment” from our reference point.
      (Side note: this is exactly how information theory handles continuous distributions. An infinite number of bits is required to encode a real number, so we pull out a term $log d x$ which diverges in the limit $d x \to 0$ , and we measure everything relative to that. Equivalently, we measure the number of bits required to encode up to precision $d x$ , and as long as the distribution is smooth and $d x$ is small, the number of bits required to encode the rest of $x$ using the distribution won’t depend on the value of $x$ .)
      Does this make sense? Should I give a different example/use more English?