Max H comments on There are no coherence theorems

Max H 1 Mar 2023 0:18 UTC
2 points
0
Agents can make themselves immune to all possible money-pumps for Completeness by acting in accordance with the following policy: ‘if I previously turned down some option X, I will not choose any option that I strictly disprefer to X.’

Plus some other assumptions (capable of backwards induction, knowing trades in advance), right?
I’m curious whether these assumptions are actually stronger than, or related to, completeness.
so if we care about whether agents will be (or will appear to be) expected utility maximizers, we have to care about whether they will be representable as expected utility maximizers.
Both sets (representable and not) are non-empty. The question remains about which set the interesting agents are in. I think that CCT + VNM, money pump arguments, etc. strongly hint, but do not prove, that the EU maximizers are the interesting ones.
Also, I personally don’t find the question itself particularly interesting, because it seems like one can move between these sets in a relatively shallow way (I’d be interested in seeing counterexamples, though). Perhaps that’s what Yudkowsky means by not caring about representability?
What links here?
- Max H's comment on A decade of lurking, a month of posting by Max H (28 Apr 2023 14:31 UTC; 1 point)
- Elliott Thornley (EJT) 1 Mar 2023 19:24 UTC
  2 points
  1
  Parent
  Plus some other assumptions (capable of backwards induction, knowing trades in advance), right?
  Yep, that’s right!
  I’m curious whether these assumptions are actually stronger than, or related to, completeness.
  Since the Completeness assumption is about preferences while the backward-induction and knowing-trades-in-advance assumptions are not, they don’t seem very closely related to me. The assumption that the agent’s strict preferences are transitive is more closely related, but it’s not stronger than Completeness in the sense of implying Completeness.
  Can you say a bit more about what you mean by ‘interesting agents’?
  From your other comment:
  That is, if you try to construct / find / evolve the most powerful agent that you can, without a very precise understanding of agents / cognition / alignment, you’ll probably get something very close to an EU maximizer.
  I think this could well be right. The main thought I want to argue against is more like:
  - Even if you initially succeed in creating a powerful agent that doesn’t maximize expected utility, VNM/CCT/money-pump arguments make it likely that this powerful agent will later become an expected utility maximizer.
  - Max H 1 Mar 2023 21:14 UTC
    2 points
    1
    Parent
    I meant stronger in a loose sense: you argued that “completeness doesn’t come for free”, but it seems more like actually what you’ve shown is that not-pursuing-dominated-strategies is the thing that doesn’t come for free.
    You either need a bunch of assumptions about preferences, or you need one less of those assumptions, plus a few other assumptions about knowing trades, induction, and adherence to a specific policy.
    And even given all these other assumptions, the proposed agent with a preferential gap seems like it’s still only epsilon-different from an actual EU maximizer. To me this looks like a strong hint that these assumptions actually do point at a core of something simple which one might call “coherence”, which I expect to show up in (all minus epsilon) advanced agents, even if there are pathological points in advanced-agent-space which don’t have these properties (and even if expected utility theory as a whole isn’t quite correct).
    - Elliott Thornley (EJT) 1 Mar 2023 23:19 UTC
      9 points
      9
      Parent
      You either need a bunch of assumptions about preferences, or you need one less of those assumptions, plus a few other assumptions about knowing trades, induction, and adherence to a specific policy.
      I see. I think this is right.
      the proposed agent with a preferential gap seems like it’s still only epsilon-different from an actual EU maximizer.
      I agree with this too, but note that the agent with a single preferential gap is just an example. Agents can have arbitrarily many preferential gaps and still avoid pursuing dominated strategies. And agents with many preferential gaps may behave quite differently to expected utility maximizers.
      What links here?
      Max H's comment on Why Not Subagents? by johnswentworth (23 Jun 2023 1:43 UTC; 8 points)
      Max H's comment on Trying to deconfuse some core AI x-risk problems by habryka (17 Oct 2023 20:27 UTC; 4 points)