Is anyone else baffled by this ranking? To my eye(/ear) Bard’s attempt is clearly the worst, and the gap between Claude and GPT-4 is small enough to come down to subjective judgment. (I prefer Claude’s rhythm, and its content seems more on-topic and less generic.)
Is anyone else baffled by this ranking? To my eye(/ear) Bard’s attempt is clearly the worst, and the gap between Claude and GPT-4 is small enough to come down to subjective judgment. (I prefer Claude’s rhythm, and its content seems more on-topic and less generic.)