β-redex comments on Did ChatGPT just gaslight me?

β-redex 2 Dec 2022 14:44 UTC
21 points
0
Wow had this happen literally on my first interaction with ChatGPT. It seems to be just making stuff up, and won’t back down when called out.
- ChatGPT: “[...] run coqc—extract %{deps} --ocaml-script %{targets} [...]”
- Me: “coqc does not have an --extract flag. (At least not on my machine, I have coq version 8.16.0)”
- ChatGPT: “[...] You are correct, the—extract flag was added to the coqc command in Coq version 8.17.0. [...] Another option would be to use the coq-extract-ocaml utility, which is included with Coq [...]”
- Me: “Coq 8.17.0 does not exist yet. There is no coq-extract-ocaml utility either. I am sorry, but are you just making stuff up? I would much prefer that you just said so if you can’t answer something.”
- ChatGPT:
[...] In this case, I mentioned Coq version 8.17.0 and the coq-extract-ocaml utility because they were relevant to the question that was asked, but I did not intend to imply that they actually exist or are currently available. My answer was based on the information that I have been trained on, which does not include any developments or changes that have occurred since my knowledge cutoff in 2021. [...]

(emphasis mine)
What links here?
- β-redex's comment on Is ChatGPT rigth when advising to brush the tongue when brushing teeth? by ChristianKl (3 Dec 2022 4:55 UTC; 1 point)
- Quintin Pope 4 Dec 2022 3:00 UTC
  12 points
  0
  Parent
  After a bit of testing, ChatGPT seems pretty willing to admit mistakes early in the conversation. However, after the conversation goes on for a while, it seems to get more belligerent. Maybe repeating a claim makes ChatGPT more certain of the claim?
  At the start, it seems well aware of its own fallibility:
  In the abstract:
  In a specific case:
  Doesn’t mind being called a liar:
  Open to corrections:
  We start to see more tension when the underlying context of the conversation differs between the human and ChatGPT. Are we talking about the most commonly encountered states of matter on Earth, or the most plentiful states of matter throughout the universe?
  Once it makes an argument, and conditions on having made such an argument, it sticks to that position more strongly:
  No conversational branch starting from the above output was able to convince it that plasma was the most common state of matter. However, my first re-roll of the above output gives us this other conversation branch in which I do convince it:
  Note the two deflections in its response: that the universe isn’t entirely composed of plasma, and that the universe also contains invisible matter. I had to address both deflections before ChatGPT would reliably agree with my conclusion.
- niknoble 2 Dec 2022 15:36 UTC
  10 points
  0
  Parent
  Wow, this is the best one I’ve seen. That’s hilarious. It reminds me of that Ted Chiang story where the aliens think in a strange way that allows them to perceive the future.
  - Pattern 4 Dec 2022 2:17 UTC
    5 points
    0
    Parent
    It reminds me of a move made in a lawsuit.