Wei Dai comments on Strategic implications of AIs’ ability to coordinate at low cost, for example by merging

Wei Dai 25 Apr 2019 7:21 UTC
13 points
One possible way for AIs to coordinate with each other is for two or more AIs to modify their individual utility functions into some compromise utility function, in a mutually verifiable way, or equivalently to jointly construct a successor AI with the same compromise utility function and then hand over control of resources to the successor AI. This simply isn’t something that humans can do.
What links here?
- Wei Dai's comment on Strategic implications of AIs’ ability to coordinate at low cost, for example by merging by Wei Dai (26 Apr 2019 3:00 UTC; 10 points)
- totallybogus 25 Apr 2019 23:33 UTC
  9 points
  Parent
  
  modify their individual utility functions into some compromise utility function, in a mutually verifiable way, or equivalently to jointly construct a successor AI with the same compromise utility function and then hand over control of resources to the successor AI
  
  This is precisely equivalent to Coasean efficiency, FWIW—indeed, correspondence with some “compromise” welfare function is what it means for an outcome to be efficient in this sense. It’s definitely the case that humans, and agents more generally, can face obstacles to achieving this, so that they’re limited to some constrained-efficient outcome—something that does maximize some welfare function, but only after taking some inevitable constraints into account!
  
  (For instance, if the pricing of some commodity, service or whatever is bounded due to an information problem, so that “cheap” versions of it predominate, then the marginal rates of transformation won’t necessarily be equalized across agents. Agent A might put her endowment towards goal X, while agent B will use her own resources to pursue some goal Y. But that’s a constraint that could in principle be well-defined—a transaction cost. Put them all together, and you’ll understand how these constraints determine what you lose to inefficiency—the “price of anarchy”, so to speak.)
  - mako yass 26 Apr 2019 23:50 UTC
    1 point
    Parent
    Strong upvote, very good to know
    Agent A might put her endowment towards goal X, while agent B will use her own resources to pursue some goal Y
    I internalised the meaning of these variables only to find you didn’t refer to them again. What was the point of this sentence.
- Rana Dexsin 25 Apr 2019 17:41 UTC
  7 points
  Parent
  But jointly constructing a successor with compromise values and then giving them the reins is something humans can sort of do via parenting, there’s just more fuzziness and randomness and drift involved, no? That is, assuming human children take a bunch of the structure of their mindsets from what their parents teach them, which certainly seems to be the case on the face of it.
  - Wei Dai 26 Apr 2019 4:06 UTC
    7 points
    Parent
    Yes, but humans generally hand off resources to their children as late as possible (whereas the AIs in my scheme would do so as soon as possible) which suggests that coordination is not the primary purpose for humans to have children.
  - mako yass 25 Apr 2019 22:50 UTC
    4 points
    Parent
    I’m pretty sure nobility frequently arranged marriages to do exactly this, for this purpose, to avoid costly conflicts.