M. Y. Zuo comments on Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

M. Y. Zuo 1 Oct 2022 0:28 UTC
1 point
0
They approach the Pareto frontier, but never in fact reach it, from what I understand. So wouldn’t this just move the problem into the meta level? What’s stopping the Players from going through the threat escalation spiral to force their ideal ROSE point?
Sure they may be very tiny slices of utility in contention, but then it only takes one obstinate Player to threaten all utility to ensure they lose zero.
- Diffractor 1 Oct 2022 2:24 UTC
  2 points
  0
  Parent
  What I mean is, the players hit each other up, are like “yo, let’s decide on which ROSE point in the object-level game we’re heading towards”
  Of course, they don’t manage to settle on what equilibrium to go for in the resulting bargaining game, because, again, multiple ROSE points might show up in the bargaining game.
  But, the ROSE points in the bargaining game are in a restricted enough zone (what with that whole “must be better than the random-dictator point” thing) to seriously constrain the possibilities in the object-level game. “Worst ROSE point for Alice in the object-level-game” is a whole lot worse for her than “Worst ROSE point for Alice in the bargaining game about what to do in the object-level-game”.
  
  So, the players should be able to ratchet up their disagreement point and go “even if the next round of bargaining fails, at least we can promise that everyone does this well, right? Sure, everyone’s going for their own idea of fairness, but even if Alice ends up with her worst ROSE point in the bargaining game, her utility is going to be at least this high, and similar for everyone else.”
  
  And so, each step of bargaining that successfully happens ratchets up the disagreement point closer to the Pareto frontier, in a way that should quickly converge. If someone disagrees on step 3, then the step-3 disagreement point gets played, which isn’t that short of the Pareto frontier. And if someone doesn’t have time for all this bargaining, they can just break things off at step 10 or something, that’s just about as good as going all the way to infinity.
  
  Or at least, it should work like this. I haven’t proved that it does, and it depends on things like “what does ROSE bargaining look like for n players” and “does the random-dictator-point-dominance thing still hold in the n-player case” and “what’s the analogue of that strategy where you block your foe from getting more than X utility, when there are multiple foes?”. But this disagreement-point ratcheting is a strategy that address your worries with “ever-smaller pieces of the problem live on higher meta-levels, so the process of adding layers of meta actually converges to solving the problem, and breaking it off early solves most of the problem”
  
  Regarding your last comment, yes, you could always just have a foe that’s a jerk, but you can at least act so they don’t gain from being an jerk, in a way robust against you and a foe having slightly different definitions of “jerk”.
  - M. Y. Zuo 1 Oct 2022 14:52 UTC
    1 point
    0
    Parent
    Regarding your last comment, yes, you could always just have a foe that’s a jerk, but you can at least act so they don’t gain from being an jerk, in a way robust against you and a foe having slightly different definitions of “jerk”.
    How would someone ensure that the jerk does not gain from their threats regarding the choice of ROSE points, assuming Players cannot exit the game?
    - Diffractor 1 Oct 2022 16:28 UTC
      2 points
      0
      Parent
      With this.
      - M. Y. Zuo 1 Oct 2022 16:53 UTC
        1 point
        0
        Parent
        Reading Clarifications 1, 2, and 3 seems to imply that it would not be useful or applicable in this scenario.
        Clarification 1: Yes, utilities are invariant up to a positive affine transformation so there’s no canonical way to split utilities evenly. Hence the part about “Assume a magical solution N which gives us the fair division.” If we knew the exact properties of how to implement this magical solution, taking it at first for magical, that might give us some idea of what N should be, too.
        Clarification 2: The way this might work is that you pick a series of increasingly unfair-to-you, increasingly worse-for-the-other-player outcomes whose first element is what you deem the fair Pareto outcome: (100, 100), (98, 99), (96, 98). Perhaps stop well short of Nash if the skew becomes too extreme. Drop to Nash as the last resort. The other agent does the same, starting with their own ideal of fairness on the Pareto boundary. Unless one of you has a completely skewed idea of fairness, you should be able to meet somewhere in the middle. Both of you will do worse against a fixed opponent’s strategy by unilaterally adopting more self-favoring ideas of fairness. Both of you will do worse in expectation against potentially exploitive opponents by unilaterally adopting looser ideas of fairness. This gives everyone an incentive to obey the Galactic Schelling Point and be fair about it. You should not be picking the descending sequence in an agent-dependent way that incentivizes, at cost to you, skewed claims about fairness.
        Clarification 3: You must take into account the other agent’s costs and other opportunities when ensuring that the net outcome, in terms of final utilities, is worse for them than the reward offered for ‘fair’ cooperation. Offering them the chance to buy half as many paperclips at a lower, less fair price, does no good if they can go next door, get the same offer again, and buy the same number of paperclips at a lower total price.
        Can you show how this would lead to a ‘jerk’ never gaining in the aforementioned scenario?