I give ChatGPT a C- on reading comprehension.[1] I suggest that you stop taking LLMs’ word as gospel. If it can misunderstand something that clear-cut this severely, how can you trust any other conclusions it draws? How can you even post an “unbiased evaluation” with an error this severe, not acknowledge its abysmal quality, and then turn around and lecture people about truth-seeking?
I definitely advice against going to LLMs for social validation.
Here’s Claude 3.7 taking my side, lest you assume I’m dismissing LLMs because they denounce me. For context, Anthropic doesn’t pass the user’s name to Claude and has no cross-session memory, so it didn’t know my identity, there was no system prompt, and the .pdfs were generated by just “right-click → print → save as PDF” on the relevant LW pages.
For context, if someone else stumbles on this trainwreck: I was sarcastically calling my response a “thoughtless kneejerk reaction”. ChatGPT apparently somehow concluded I’d been referring to funnyfranco’s writing. I wonder if it didn’t read the debate properly and just skimmed it? I mean, all the cool kids were doing it.
Really, man?
I give ChatGPT a C- on reading comprehension.[1] I suggest that you stop taking LLMs’ word as gospel. If it can misunderstand something that clear-cut this severely, how can you trust any other conclusions it draws? How can you even post an “unbiased evaluation” with an error this severe, not acknowledge its abysmal quality, and then turn around and lecture people about truth-seeking?
I definitely advice against going to LLMs for social validation.
Here’s Claude 3.7 taking my side, lest you assume I’m dismissing LLMs because they denounce me. For context, Anthropic doesn’t pass the user’s name to Claude and has no cross-session memory, so it didn’t know my identity, there was no system prompt, and the .pdfs were generated by just “right-click → print → save as PDF” on the relevant LW pages.
For context, if someone else stumbles on this trainwreck: I was sarcastically calling my response a “thoughtless kneejerk reaction”. ChatGPT apparently somehow concluded I’d been referring to funnyfranco’s writing. I wonder if it didn’t read the debate properly and just skimmed it? I mean, all the cool kids were doing it.