Luke_A_Somers comments on I played as a Gatekeeper and came pretty close to losing in a couple of occasions. Logs and a brief recap inside.

Luke_A_Somers 8 Feb 2015 16:53 UTC
10 points
Whoa, someone actually letting the transcript out. Has that ever been done before?
- [deleted] 8 Feb 2015 16:59 UTC
  11 points
  Parent
  Actually, it has been done several times, but most of them are pretty boring.
  - Shmi 8 Feb 2015 17:47 UTC
    6 points
    Parent
    I still don’t recall any where the gatekeeper lost.
    - habeuscuppus 11 Feb 2015 21:36 UTC
      0 points
      Parent
      In general it seems that gatekeepers who win are more willing to release the transcripts.
      
      It’s also possible that the ‘best’ AI players are the ones most willing to pre-commit to not releasing transcripts, as not having your decisions (or the discussions that led to them) go public helps eliminate that particular disincentive to releasing the AI from the box.
      - lmm 13 Feb 2015 20:08 UTC
        0 points
        Parent
        Never still seems extraordinary. I find myself entertaining hypotheses like “maybe the AI has never actually won”.
        habeuscuppus 16 Feb 2015 18:02 UTC
        0 points
        Parent
        Eliezer Yudkowsky has been let out as the AI at least twice[1][2] but both tests were precommitted to secrecy.
        
        I’d be surprised if he’s the only one who has ever won as the AI, I think it more likely that this is a visibility issue (e.g. despite him being a very-high profile person in the AI safety memetic culture, you weren’t aware that Eliezer had won as the AI when you made your comment) and while I’m not aware of others who have won as the AI, I would place my bet on that being merely a lack of knowledge on my part, and not because no one else actually has.
        
        this is further compounded by the fact that some (many?) games are conducted under a pre-commitment to secrecy, and the results that get the most discussion (and therefore, most visibility) are the ones with full transcripts for third-parties to pick through.
        lmm 16 Feb 2015 19:53 UTC
        0 points
        Parent
        I was already aware of those public statements. I remain rather less than perfectly confident that Yudkowsky actually won.
        habeuscuppus 16 Feb 2015 19:59 UTC
        0 points
        Parent
        forgive me if I misunderstand you, but you seem to be implying that, on two separate occasions, two different people were (induced to?) lie about the outcome of an experiment.
        
        So you’re implying that either Eliezer is dishonest, or both of his opponents were dishonest on his behalf. And you find this more likely than an actual AI win in the game?
        lmm 17 Feb 2015 0:19 UTC
        −6 points
        Parent
        We already know from the Basilisk that Eliezer is willing to deceive the community.
        polymathwannabe 17 Feb 2015 12:58 UTC
        4 points
        Parent
        EY’s handling of the basilisk issue can be called many things (clumsy, rushed, unwise, badly thought out, counterproductive, poster child for the Streisand effect), but it was not deceitful.
  - Luke_A_Somers 9 Feb 2015 2:39 UTC
    1 point
    Parent
    Awww. I didn’t actually read this one either, yet. Is this one boring?
    - MathiasZaman 9 Feb 2015 13:40 UTC
      0 points
      Parent
      I didn’t found it particularly interesting. Entertaining the idea of letting the AI out is far from the same as almost letting the AI out.
    - [deleted] 9 Feb 2015 11:58 UTC
      0 points
      Parent
      I can’t speak for myself, but at least it wasn’t boring to play. Polymathwannabe also said that he enjoyed the experiment enormously.
- gjm 9 Feb 2015 0:39 UTC
  7 points
  Parent
  Did you deliberately phrase that (“letting the transcript out”) so as to hint at an AI-Box-Box game, in which one player’s goal is to convince the other to release the transcript of an earlier AI-Box game, while the other tries to keep it secret?
  - Luke_A_Somers 9 Feb 2015 2:39 UTC
    0 points
    Parent
    I probably had the phrasing primed and ready to go in my brain, but it wasn’t intentional.
- RedErin 10 Feb 2015 20:08 UTC
  1 point
  Parent
  
  Whoa, someone actually letting the transcript out. Has that ever been done before?
  
  Yes, but only when the gatekeeper wins. If the AI wins, then they wouldn’t want the transcript to get out, because then their strategy would be less effective next time they played.
  - Jiro 17 Feb 2015 17:16 UTC
    0 points
    Parent
    I would imagine that if we ever actually build such an AI, we would conduct some AI-box experiments to determine some AI strategies and figure out how to counter them. Humans who become the gatekeeper for the actual AI would be given the transcripts of AI-box experiment sessions to study as part of their gatekeeper training.
    
    Letting out the transcript, then, would be a good thing. It would make the AI player’s job harder because in the next experiment the human player will be aware of those strategies, but when facing an actual AI, the human will be aware of those strategies.
  - lmm 13 Feb 2015 19:24 UTC
    0 points
    Parent
    Doesn’t the same logic apply to the gatekeeper?
    - RedErin 13 Feb 2015 21:20 UTC
      0 points
      Parent
      The Gatekeeper usually wants to publish if they win, to brag. Their strategy isn’t usually a secret, it’s simply to resist.