GPT-4o Can In Some Cases Solve Moderately Complicated Captchas

Here are several examples; I found these captchas via the web rather than generating them anew, but none of them came attached to solutions so I’m not sure their presence in the training data would affect things in any case. (That said, it’s possible that the lower resolution of the latter two degraded the adversarial perturbation; I would appreciate a source of higher-resolution captchas if anyone happens to know one.)

It clearly couldn’t see all the objects, but the owl was in fact the correct answer
Entertaining failure at basic numerals while nonetheless answering correctly here
This one I was surprised by; I expected the image to be too low-resolution to be comprehensible, but 89 are correct here (the middle left image is a chair with an unusually low back)