Houshalter comments on Open thread, Mar. 2 - Mar. 8, 2015

Houshalter 3 Mar 2015 8:35 UTC
0 points
The first condition is not true, since it gives a consistent value to any probability distribution of utilities. The second condition is not true other since the median function is not merely a transform of the mean function.

I’m not sure what the “ugly” behavior you describe is, and I bet it rests on some assumption that’s too strong. I already mentioned how inconsistent behavior can be fixed by allowing it to predetermine it’s actions.
- Kindly 3 Mar 2015 13:40 UTC
  4 points
  Parent
  You can find the Von Neumann—Morgenstern axioms for yourself. It’s hard to say whether or not they’re too strong.
  
  The problem with “allowing [the median algorithm] to predetermine its actions” is that in this case, I no longer know what the algorithm outputs in any given case. Maybe we can resolve this by considering a case when the median algorithm fails, and you can explain what your modification does to fix it. Here’s an example.
  
  Suppose I roll a single die.
  - Bet A loses you $5 on a roll of 1 or 2, but wins you $1 on a roll of 3, 4, 5, or 6.
  - Bet B loses you $5 on a roll of 5 or 6, but wins you $1 on a roll of 1, 2, 3, or 4.
  Bet A has median utility of U($1), as does bet B. However, combined they have a median utility of U(-$4).
  
  So the straightforward median algorithm pays money to buy Bet A, pays money to buy Bet B, but will then pay money to be rid of their combination.
  - Houshalter 3 Mar 2015 23:49 UTC
    1 point
    Parent
    I think I’ve found the core of our disagreement. I want an algorithm that considers all possible paths through time. It decides on a set of actions, not just for the current time step, but for all possible future time steps. It chooses such that the final probability distribution of possible outcomes, at some point in the future, is optimal according to some metric. I originally thought of median, but it can work with any arbitrary metric.
    
    This is a generalization of expected utility. The VNM axioms require an algorithm to make decisions independently and Just In Time. Whereas this method lets it consider all possible outcomes. It may be less elegant than EU, but I think it’s closer to what humans actually want.
    
    Anyway your example is wrong, even without predetermined actions. The algorithm would buy bet A, but then not buy bet B. This is because it doesn’t consider bets in isolation like EU, but considers it’s entire probability distribution of possible outcomes. Buying bet B would decrease it’s expected median utility, so it wouldn’t take it.
    What links here?
    Houshalter's comment on Open thread, Mar. 2 - Mar. 8, 2015 by MrMind (5 Mar 2015 3:55 UTC; 0 points)
    - Douglas_Knight 5 Mar 2015 18:53 UTC
      5 points
      Parent
      
      The VNM axioms require an algorithm to make decisions independently and Just In Time.
      
      No, they don’t.
      - Houshalter 7 Mar 2015 10:03 UTC
        −2 points
        Parent
        Assuming the bet has a fixed utility, then EU gives it a fixed estimate right away. Whereas my method considers it along with all other bets that it’s made or expects to make, and it’s estimate can change over time. I should have said that it’s not independent or fixed, but that is what I meant.
        Kindly 7 Mar 2015 15:11 UTC
        2 points
        Parent
        In the VNM scheme where expected utility is derived at a consequence of the axioms, the way that a bet’s utility changes over time is that its utility is not fixed. Nothing at all stops you from changing the utility you attach to a 50:50 gamble of getting a kitten versus $5 if your utility for a kitten (or for $5) changes: for example, if you get another kitten or win the lottery.
        
        Generalizing to allow the value of the bet to change when the value of the options did not change seems strange to me.
    - IlyaShpitser 5 Mar 2015 19:07 UTC
      2 points
      Parent
      
      This is a generalization of expected utility.
      
      I am lost, this is just EU in a longitudinal setting? You can average over lots of stuff. Maximizing EU is boring, it’s specifying the right distribution that’s tricky.
      - Houshalter 7 Mar 2015 9:51 UTC
        0 points
        Parent
        It’s not EU, since it can implement arbitrary algorithms to specify the desired probability distribution of outcomes. Averaging utility is only one possibility, another I mentioned was median utility.
        
        So you would take the median utility of all the possible outcomes. And then select the action (or series of actions in this case) that leads to the highest median utility.
        
        No method of specifying utilities would let EU do the same thing, but you can trivially implement EU in it, so it’s strictly more general than EU.
    - Vaniver 5 Mar 2015 14:56 UTC
      1 point
      Parent
      
      I think I’ve found the core of our disagreement. I want an algorithm that considers all possible paths through time. It decides on a set of actions, not just for the current time step, but for all possible future time steps.
      
      So, I think you might be interested in UDT. (I’m not sure what the current best reference for that is.) I think that this requires actual omniscience, and so is not a good place to look for decision algorithms.
      
      (Though I should add that typically utilities are defined over world-histories, and so any decision algorithm typically identifies classes of ‘equivalent’ actions, i.e. acknowledges that this is a thing that needs to be accepted somehow.)
      - Douglas_Knight 5 Mar 2015 18:56 UTC
        4 points
        Parent
        UDT is overkill. The idea that all future choices can be collapsed into a single choice appears in the work of von Neumann and Morgenstern, but is probably much older.
    - Kindly 4 Mar 2015 3:02 UTC
      1 point
      Parent
      Oh, I see. I didn’t take that problem into account, because it doesn’t matter for expected utility, which is additive. But you’re right that considering the entire probability distribution is the right thing to do, and under than assumption we’re forced to be transitive.
      
      The actual VNM axiom violated by median utility is independence: If you prefer X to Y, then a gamble of X vs Z is preferable to the equivalent gamble of Y vs Z. Consider the following two comparisons:
      
      Taking bet A, as above, versus the status quo.
      
      A ²⁄₃ chance of taking bet A and a ¹⁄₃ chance of losing $5, versus a ²⁄₃ chance of the status quo and a ¹⁄₃ chance of losing $5.
      
      In the first case, bet A has median utility U($1) and the status quo has U($0), so you pick bet A. In the second case, a gamble with a possibility of bet A has median utility U(-$5) and a gamble with a possibility of the status quo still has U($0), so you pick the second gamble.
      
      Of course, independence is probably the shakiest of the VNM axioms, and it wouldn’t surprise me if you’re unconvinced by it.