RationalWiki is not a reliable source on any subject.
Jumping out of character ignores the entire point of the AI-box exercise. It’s like a naive chess player just grabbing the opponent’s king and claiming victory.
The Gatekeeper party may resist the AI party’s arguments by any means chosen – logic, illogic, simple refusal to be convinced, even dropping out of character – as long as the Gatekeeper party does not actually stop talking to the AI party before the minimum time expires.
If that meant what you interpret it to mean, “does not actually stop talking” would be satisfied by the Gatekeeper typing any string of characters to the AI every so often regardless of whether it responds to the AI or whether he is actually reading what the AI says.
All that that shows is that the rules contradict themselves. There’s a requirement that the Gatekeeper stay engaged with the AI and the requirement that the Gatekeeper “actually talk with the AI”. The straightforward reading of that does not allow for a Gatekeeper who ignores everything and just types “no” every time—only a weird literal Internet guy would consider that to be staying engaged and actually talking.
One of the tactics listed on RationalWiki’s description of the AI-box experiment is:
RationalWiki is not a reliable source on any subject.
Jumping out of character ignores the entire point of the AI-box exercise. It’s like a naive chess player just grabbing the opponent’s king and claiming victory.
From Yudkowsky’s description of the AI-Box Experiment:
If that meant what you interpret it to mean, “does not actually stop talking” would be satisfied by the Gatekeeper typing any string of characters to the AI every so often regardless of whether it responds to the AI or whether he is actually reading what the AI says.
All that that shows is that the rules contradict themselves. There’s a requirement that the Gatekeeper stay engaged with the AI and the requirement that the Gatekeeper “actually talk with the AI”. The straightforward reading of that does not allow for a Gatekeeper who ignores everything and just types “no” every time—only a weird literal Internet guy would consider that to be staying engaged and actually talking.
Ok.