Roko comments on Hacking the CEV for Fun and Profit

Roko 13 Jun 2010 16:19 UTC
−1 points
0

I want other people to get what they want

Bob wants the AI to create as close an approximation to hell as possible, and throw you into it forever, because he is a fundamentalist christian.

Are you sure you want bob to get what he wants?
- AlexMennen 13 Jun 2010 16:39 UTC
  2 points
  Parent
  Most fundamentalist christians, although believing that there is a hell and that people like me are destined for it, and want their religion to be right, probably would not want an approximation of their religion created conditional on it not already being right. An AI cannot make Bob right.
  
  That being said, there probably are some people who would want me thrown into hell anyway even if their religion stipulating that I would be was not right in the first place. I should amend my statement: I want people to get what they want in ways that do not conflict, or conflict only minimally, with what other people want. Also, the possibility that there are a great many people like the Bob (as I said, I’m not quite sure how many fundamentalists would want to make their religion true even if it isn’t) is a very good reason not to use the average human utility function for the CEV. As you said, I do not want Bob to get what he wants and I suspect that you don’t either. So why would you want to create an FAI with a CEV that is inclined to accommodate Bob’s wish (which greatly conflicts with what other people want) if it proves especially popular?
  - Blueberry 13 Jun 2010 17:20 UTC
    1 point
    Parent
    CEV doesn’t just average people’s wishes. It extrapolates what people would do if they were better informed. Even if Bob wants to create a hell right now, his extrapolated volition may be for something else.
  - Roko 13 Jun 2010 19:03 UTC
    0 points
    Parent
    I wouldn’t.
    
    So why would you want to create an FAI with a CEV that is inclined to accommodate Bob’s wish
    - AlexMennen 14 Jun 2010 3:55 UTC
      2 points
      Parent
      Well, I suppose we can reliably expect that there are not enough people like Bob, and me getting tortured removes much more utility from me than it gives Bob, but that’s missing the point.
      
      Imagine yourself in a world in which the vast majority of people want to subject a certain minority group to eternal torture. The majority who want that minority group to be tortured is so vast that an FAI with an average human utility function-based CEV would be likely to subject the members of that minority group to eternal torture. You have the ability to create an FAI with a CEV based off of the average human utility function, with your personal utility function, or not at all. What do you do?
      - Roko 14 Jun 2010 9:12 UTC
        0 points
        Parent
        With my personal utility function, of course, which would, by my definition of the term “right”, always do the right thing.
        AlexMennen 15 Jun 2010 4:43 UTC
        2 points
        Parent
        Silly me, I thought that we were arguing about whether using a personal utility function is a better substitute, and I was rather confused at what appeared to be a sudden concession. Looking at the comments above, I notice that you in fact only disputed my claim that the results would be very similar.
- purpleposeidon 9 Jul 2010 7:25 UTC
  1 point
  Parent
  I want bob to think he gets what he wants.