jimrandomh comments on Guardian Angels: Discrete Extrapolated Volitions

jimrandomh 26 Sep 2011 2:50 UTC
0 points

Assuming the possibility of all of the following: what would happen if every person had a superintelligent AI with a utility function of that person’s idealized extrapolated utility function?

One crazy nihilist with a destructive utility function would ruin the whole thing, by building a nuke or something. Offense wins decisively over defense.

Is it likely there would be a cooperative equilibrium among unmerged AIs?

Only if they were filtered to add restrictions or remove certain types of utility functions. And probably not even then, since AIs with evil utility functions could crop up randomly in that environment, from botched self-modifications or damage.

How would that compare to a scenario with a single AI embodying a successful calculation of CEV?

A single AI would be much better, since it could resolve all prisoners’ dilemmas, coordination games, and ultimatum games in a way that’s optimal, rather than merely pareto efficient.

Is it possible to create multiple AIs such that one AI does not prevent others from being created, such as by releasing equally powerful AIs simultaneously?

Releasing equally powerful AIs simultaneously is very risky, because it gives them an incentive to rush their self-improvements through, rather than take their time to check them for errors. Also, one of the AIs would probably succeed in destroying the others; cybersecurity so far has been a decisive win for offense.

What would be different if a person or some few people did not have a superintelligence valuing what they would value, and only many people had their own AI?

Most peoples’ utility functions include some empathy, which would cover for many people being excluded from counting directly. However, if a person doesn’t have a superintelligence valuing what they would value, then some of their values will be excluded if no one else approves of them. This is mostly a good thing, since the values that would be excluded this way would probably be destructive ones. However, people who were not included directly would lose out in any contentions over scarce resources, which could turn into a serious problem for them if resources become scarce.
- lessdazed 26 Sep 2011 3:32 UTC
  2 points
  Parent
  
  One crazy nihilist
  
  A more convenient possible world was alluded to when I asked about excluding some individuals.
  
  equilibrium
  
  Only if
  
  No merging?
  
  A single AI would be much better
  
  Maybe, but I had also asked about the relative difficulty of calculating CEV and DEV. If DEV is easier, perhaps possible rather than impossible, that’s an advantage of it.
  
  one of the AIs would probably succeed in destroying the others; cybersecurity so far has been a decisive win for offense.
  
  War is a risk, it includes the possibility of mutual destruction, particularly if offense is more powerful. You don’t think they’d merge resources and values instead of risking it?
  
  empathy...lose out in any contentions over scarce resources
  
  Most likely scenario I agree, still less than probable,
  - jimrandomh 26 Sep 2011 3:51 UTC
    1 point
    Parent
    
    War is a risk, it includes the possibility of mutual destruction, particularly if offense is more powerful. You don’t think they’d merge resources and values instead of risking it?
    
    Cyberwar is different than regular war in that all competently performed attacks are inherently anonymous. Attacks performed very competently are also undetectable. This is very destabilizing. And it gets worse; while AIs might try to get around this by all merging together, none of them would be able to prove they hadn’t hidden a copy of themselves somewhere.
    - lessdazed 26 Sep 2011 13:16 UTC
      2 points
      Parent
      I don’t think undetectability solves things. Offensive subsystems could survive their creator’s demise like two people in a grenade lobbing fight.
      
      Suppose all hid a copy, the merged AI would still be more powerful than any hidden copies, and if it was destroyed everyone would be a small copy again. If there were many AIs, an individual would be banking on its ability to defeat a much larger entity. Offense is more powerful on most scales and technological levels but not by incomprehensible orders of magnitude.