Someone pointed out that this only seems to work if the screenshots include the “ChatGPT” speaker tag; if you only screenshot the text of ChatGPT’s most recent response without the label indicating it is from ChatGPT, it seems to fail. Oddly, in one of my tests, it seemed to recognize its own text on the first time I sent it a screenshot, but then didn’t manage to figure out what to do next (nor did it mention this insight in the later replies).
So maybe this is more about it recognizing its own name than itself in a mirror?
Oh interesting! I just had a go at testing it on screenshots from a parallel conversation and it seems like it incorrectly interprets those screenshots as also being of its own conversation.
So it seems like ‘recognising things it has said’ is doing very little of the heavy lifting and ‘recognising its own name’ is responsible for most of the effect.
I’ll have a bit more of a play around and probably put a disclaimer at the top of the post some time soon.
I just managed to replicate game successfully while sending only the message text as an image (screenshots below). So it works at least sometimes.
To get this result, I tried 3 times. In one attempt, it just failed. In the other, it recognized the screenshots, and won accidentally by spelling out the weekdays while instructing me to use an image editor. On the third try, it understood the game.
Someone pointed out that this only seems to work if the screenshots include the “ChatGPT” speaker tag; if you only screenshot the text of ChatGPT’s most recent response without the label indicating it is from ChatGPT, it seems to fail. Oddly, in one of my tests, it seemed to recognize its own text on the first time I sent it a screenshot, but then didn’t manage to figure out what to do next (nor did it mention this insight in the later replies).
So maybe this is more about it recognizing its own name than itself in a mirror?
Oh interesting! I just had a go at testing it on screenshots from a parallel conversation and it seems like it incorrectly interprets those screenshots as also being of its own conversation.
So it seems like ‘recognising things it has said’ is doing very little of the heavy lifting and ‘recognising its own name’ is responsible for most of the effect.
I’ll have a bit more of a play around and probably put a disclaimer at the top of the post some time soon.
I just managed to replicate game successfully while sending only the message text as an image (screenshots below). So it works at least sometimes.
To get this result, I tried 3 times. In one attempt, it just failed. In the other, it recognized the screenshots, and won accidentally by spelling out the weekdays while instructing me to use an image editor. On the third try, it understood the game.
Yeah, that’s a pretty sharp limitation on the result.
I’d love to know if any other AI is able to pass this test when we exclude the tag.