Rob Bensinger comments on [missing post]

Rob Bensinger 14 Aug 2018 1:13 UTC
11 points
People have been using CEV to refer to both “Personal CEV” and “Global CEV” for a long time—e.g., in the 2013 MIRI paper “Ideal Advisor Theories and Personal CEV.”
I don’t know of any cases of Eliezer using “CEV” in a way that’s clearly inclusive of “Personal” CEV; he generally seems to be building into the notion of “coherence” the idea of coherence between different people. On the other hand, it seems a bit arbitrary to say that something should count as CEV if two human beings are involved, but shouldn’t count as CEV if one human being is involved, given that human individuals aren’t perfectly rational, integrated, unitary agents. (And if two humans is too few, it’s hard to say how many humans should be required before it’s “really” CEV.)
Eliezer’s original CEV paper did on one occasion use “coherence” to refer to intra-agent conflicts:
When people know enough, are smart enough, experienced enough, wise enough, that their volitions are not so incoherent with their decisions, their direct vote could determine their volition. If you look closely at the reason why direct voting is a bad idea, it’s that people’s decisions are incoherent with their volitions.
See also Eliezer’s CEV Arbital article:
Helping people with incoherent preferences
What if somebody believes themselves to prefer onions to pineapple on their pizza, prefer pineapple to mushrooms, and prefer mushrooms to onions? In the sense that, offered any two slices from this set, they would pick according to the given ordering?
(This isn’t an unrealistic example. Numerous experiments in behavioral economics demonstrate exactly this sort of circular preference. For instance, you can arrange 3 items such that each pair of them brings a different salient quality into focus for comparison.)
One may worry that we couldn’t ‘coherently extrapolate the volition’ of somebody with these pizza preferences, since these local choices obviously aren’t consistent with any coherent utility function. But how could we help somebody with a pizza preference like this?
I think that absent more arguing about why this is a bad idea, I’ll probably go on using “CEV” to refer to several different things, mostly relying on context to make it clear which version of “CEV” I’m talking about, and using “Personal CEV” or “Global CEV” when it’s really essential to disambiguate.
- Said Achmiz 14 Aug 2018 4:34 UTC
  4 points
  Parent
  
  On the other hand, it seems a bit arbitrary to say that something should count as CEV if two human beings are involved, but shouldn’t count as CEV if one human being is involved, given that human individuals aren’t perfectly rational, integrated, unitary agents. (And if two humans is too few, it’s hard to say how many humans should be required before it’s “really” CEV.)
  
  Conversely, it seems odd to me to select / construct our terminology on the basis of questionable—and, more importantly, controversial—frameworks/views like the idea that it makes sense to view a human as some sort of multiplicity of agents.
  
  The standard, “naive” view is that 1 person = 1 agent. I don’t see any reason not to say, nor anything odd about saying, that the concept of “CEV” applies when, and only when, we’re talking about two or more people. One person: personal extrapolated volition. Two people, three people, twelve million people, etc.: coherent extrapolated volition.
  
  Anything other than this, I’d call “arbitrary”.
  - Unnamed 14 Aug 2018 7:53 UTC
    2 points
    Parent
    You could think of CEV applied to a single unitary agent as a special case where achieving coherence is trivial. It’s an edge case where the problem becomes easier, rather than an edge case where the concepts threaten to break.
    Although this terminology makes it harder to talk about several agents who each separately have their own extrapolated volition (as you were trying to do in your original comment in this thread). Though replacing it with Personal Extrapolated Volition only helps a little, if we also want to talk about several separately groups who each have their own within-group extrapolated volition (which is coherent within each group but not between groups).
    - Said Achmiz 14 Aug 2018 17:44 UTC
      4 points
      Parent
      Yes, as you noted, I used “personal extrapolated volition” because the use case called for it. It seems to me that the existence of use cases that call for a term (in order to have clarity) is, in fact, the reason to have that term.
- steven0461 14 Aug 2018 20:08 UTC
  2 points
  Parent
  If it were up to me, I’d use “CEV” to refer to the proposal Eliezer calls “CEV” in his original article (which I think could be cashed out either in a way where applying the concept to subselves makes sense or in a way where that does not make sense), use “extrapolated volition” to refer to the more general class of algorithms that extrapolate people’s volitions, and use something like “true preferences” or “ideal preferences” or “preferences on reflection” when the algorithm for finding those preferences isn’t important, like in the OP.
  If I’m not mistaken, “CEV” originally stood for “Collective Extrapolated Volition”, but then Eliezer changed the name when people interpreted it in more of a “tyranny of the majority” way than he intended.