GMHowe comments on I tried my hardest to win in an AI box experiment, and I failed. Here are the logs.

GMHowe 28 Jan 2015 7:41 UTC
14 points
That’s not really in the spirit of the experiment. For the AI to win the gatekeeper must explicitly release the AI. If the gatekeeper fails to abide by the rules that merely invalidates the experiment.
- passive_fist 28 Jan 2015 10:47 UTC
  3 points
  Parent
  Not just that, it’s a futile strategy, cause you just encourage them to look away from the monitor and do nothing for 2 hours (which is entirely fair game).
  - dxu 28 Jan 2015 16:22 UTC
    1 point
    Parent
    Doing nothing for two hours is hard as heck for typical people. And if I recall correctly, although the rules allow the Gatekeeper to not engage with the AI, they don’t allow the Gatekeeper to do something else. What’s your prior for the Gatekeeper getting bored of not doing anything and reengaging the AI?
- wobster109 28 Jan 2015 23:50 UTC
  0 points
  Parent
  In Tuxedage’s rule set, if the gatekeeper leaves before 2 hours, it counts as an AI win. So it’s a viable strategy. However ---
  
  I am sure that it would work against some opponents, but my feeling is it would not work against people on Less Wrong. It was a good try though.
  - GMHowe 29 Jan 2015 1:29 UTC
    4 points
    Parent
    I was not aware of Tuxedage’s ruleset. However any ruleset that allows for the AI to win without being explicitly released by the gatekeeper is problematic.
    
    If asd had won due to the gatekeeper leaving it would only have demonstrated that being unpleasant can cause people to disengage from conversation, which is different from demonstrating that it is possible to convince a person to release a potentially dangerous AI.
    - wobster109 31 Jan 2015 2:23 UTC
      0 points
      Parent
      I kind of agree upon reflection. Tuxedage’s ruleset seems tailored for games where there is money on the line, and in that case it feels very unfair to say GK can leave right away. GK would be heavily incentivized to leave immediately, since that would get GK’s charity a guaranteed donation.
  - Nornagest 29 Jan 2015 1:41 UTC
    2 points
    Parent
    The more natural option seems to be to treat that as a draw. The AI’s not getting out if you leave the conversation, but there’s not much point in going to the trouble of building an AI if you’re not going to talk to it.
  - SilentCal 29 Jan 2015 20:18 UTC
    0 points
    Parent
    I’ve always thought the gatekeeper should have a ‘shutdown’ option that results in both the gatekeeper and the AI losing money (but less loss for the gatekeeper than releasing). That should make verbal abuse strategies a good deal harder.