[deleted] comments on Best career models for doing research?

[deleted]Dec 9, 2010, 7:15 PM
14 points
Let me try to rephrase: correct FAI theory shouldn’t have dangerous ideas. If we find that the current version does have dangerous ideas, then this suggests that we are on the wrong track. The “Friendly” in “Friendly AI” should mean friendly.
What links here?
- Lightwave's comment on Should LW have a public censorship policy? by Bongo (Dec 12, 2010, 7:33 PM; 5 points)
- Eliezer Yudkowsky Dec 9, 2010, 7:20 PM
  12 points
  Parent
  Pretty much correct in this case. Roko’s original post was, in fact, wrong; correctly programmed FAIs should not be a threat.
  What links here?
  - [deleted]'s comment on Best career models for doing research? by Kaj_Sotala (Dec 9, 2010, 8:41 PM; 0 points)
  - Vladimir_Nesov's comment on Best career models for doing research? by Kaj_Sotala (Dec 9, 2010, 7:31 PM; -4 points)
  - Vladimir_Nesov Dec 9, 2010, 7:25 PM
    13 points
    Parent
    (FAIs shouldn’t be a threat, but a theory to create a FAI will obviously have at least potential to be used to create uFAIs. FAI theory will have plenty of dangerous ideas.)
    What links here?
    Vladimir_Nesov's comment on Best career models for doing research? by Kaj_Sotala (Dec 9, 2010, 7:25 PM; 3 points)
  - XiXiDu Dec 9, 2010, 7:40 PM
    6 points
    Parent
    I want to highlight at this point how you think about similar scenarios:
    
    I do think that TORTURE is the obvious option, and I think the main instinct behind SPECKS is scope insensitivity.
    
    That isn’t very reassuring. I believe that if you had the choice of either letting a Paperclip maximizer burn the cosmic commons or torture 100 people, you’d choose to torture 100 people. Wouldn’t you?
    
    ...correctly programmed FAIs should not be a threat.
    
    They are always a threat to some beings. For example beings who oppose CEV or other AI’s. Any FAI who would run a human version of CEV would be a potential existential risk to any alien civilisation. If you accept all this possible oppression in the name of what is subjectively friendliness, how can I be sure that you don’t favor torture for some humans that support CEV, in order to ensure it? After all you already allow for the possibility that many beings are being oppressed or possible killed.
    - wedrifid Dec 9, 2010, 7:44 PM
      4 points
      Parent
      
      They are always a threat to some beings. For example beings who oppose CEV or other AI’s. Any FAI who would run a human version of CEV would be a potential existential risk to any alien civilisation.
      
      This seems to be true and obviously so.
    - Vladimir_Nesov Dec 9, 2010, 7:43 PM
      −1 points
      Parent
      
      ...correctly programmed FAIs should not be a threat.
      
      They are always a threat to some beings.
      
      Narrowness. You can parry almost any statement like this, by posing a context outside its domain of applicability.
  - cousin_it Dec 9, 2010, 11:23 PM
    0 points
    Parent
    Another pointless flamewar. This part makes me curious though:
    
    Roko’s original post was, in fact, wrong
    
    There are two ways I can interpret your statement:
    
    a) you know a lot more about decision theory than you’ve disclosed so far (here, in the workshop and elsewhere);
    
    b) you don’t have that advanced knowledge, but won’t accept as “correct” any decision theory that leads to unpalatable consequences like Roko’s scenario.
    
    Which is it?
    - Vladimir_Nesov Dec 9, 2010, 11:38 PM
      8 points
      Parent
      From my point of view, and as I discussed in the post (this discussion got banned with the rest, although it’s not exactly on that topic), the problem here is the notion of “blackmail”. I don’t know how to formally distinguish that from any other kind of bargaining, and the way in which Roko’s post could be wrong that I remember required this distinction to be made (it could be wrong in other ways, but that I didn’t notice at the time and don’t care to revisit).
      
      (The actual content edited out and posted as a top-level post.)
      - cousin_it Dec 9, 2010, 11:48 PM
        2 points
        Parent
        (I seem to have a talent for writing stuff, then deleting it, and then getting interesting replies. Okay. Let it stay as a little inference exercise for onlookers! And please nobody think that my comment contained interesting secret stuff; it was just a dumb question to Eliezer that I deleted myself, because I figured out on my own what his answer would be.)
        
        Thanks for verbalizing the problems with “blackmail”. I’ve been thinking about these issues in the exact same way, but made no progress and never cared enough to write it up.
        Perplexed Dec 10, 2010, 1:39 AM
        4 points
        Parent
        Perhaps the reason you are having trouble coming up with a satisfactory characterization of blackmail is that you want a definition with the consequence that it is rational to resist blackmail and therefore not rational to engage in blackmail.
        
        Pleasant though this might be, I fear the universe is not so accomodating.
        
        Elsewhere VN asks how to unpack the notion of a status-quo, and tries to characterize blackmail as a threat which forces the recipient to accept less utility than she would have received in the status quo. I don’t see any reason in game theory why such threats should be treated any differently than other threats. But it is easy enough to define the ‘status-quo’.
        
        The status quo is the solution to a modified game—modified in such a way that the time between moves increases toward infinity and the current significance of those future moves (be they retaliations or compensations) is discounted toward zero. A player who lives in the present and doesn’t respond to delayed gratification or delayed punishment is pretty much immune to threats (and to promises).
        What links here?
        Perplexed's comment on Unpacking the Concept of “Blackmail” by Vladimir_Nesov (Dec 10, 2010, 1:45 AM; 0 points)
    - David_Gerard Dec 9, 2010, 11:30 PM
      7 points
      Parent
      
      Another pointless flamewar.
      
      On RW it’s called Headless Chicken Mode, when the community appears to go nuts for a time. It generally resolves itself once people have the yelling out of their system.
      
      The trick is not to make any decisions based on the fact that things have gone into headless chicken mode. It’ll pass.
      
      [The comment this is in reply to was innocently deleted by the poster, but not before I made this comment. However, I think I’m making a useful point here, so would prefer to keep this comment.]
- Jack Dec 9, 2010, 7:21 PM
  2 points
  Parent
  This is certainly the case with regard to the kind of decision theoretic thing in Roko’s deleted post. I’m not sure if it is the case with all ideas that might come up while discussing FAI.
- Vladimir_Nesov Dec 9, 2010, 7:17 PM
  −16 points
  Parent
  
  Let me try to rephrase: correct FAI theory shouldn’t have dangerous ideas. If we find that the current version does have dangerous ideas, then this suggests that we are on the wrong track. The “Friendly” in “Friendly AI” should mean friendly.
  
  Wrong and stupid.
  - komponisto Dec 9, 2010, 7:26 PM
    10 points
    Parent
    FYI, this is an excellent example of contempt.
    - Vladimir_Nesov Dec 9, 2010, 7:31 PM
      −4 points
      Parent
      And so it was, but not an example for other times when it wasn’t. A rare occurrence. I’m pretty sure it didn’t lead to any errors though, in this simple case.
      
      (I wonder why Eliezer pitched in the way he did, with only weak disambiguation between the content of Tetronian’s comment and commentary on correctness of Roko’s post.)
      - Kutta Dec 10, 2010, 9:58 AM
        0 points
        Parent
        I got the impression that you responded to “FAI Theory” as our theorizing and Eliezer responded to it as the theory making its way to the eventual FAI.
  - [deleted]Dec 9, 2010, 7:19 PM
    3 points
    Parent
    Ok...but why?
    
    Edit: If you don’t want to say why publicly, feel free to PM me.
    - Vladimir_Nesov Dec 9, 2010, 7:25 PM
      3 points
      Parent
      here