>GPT-3 is very capable of saying “I don’t know” (or “yo be real”), but due to its training dataset it likely won’t say it on its own accord.
I’m not very convinced by the training data point. People write “I don’t know” on the Internet all the time (and “that makes no sense” occasionally). (Hofstadter’s article says both in his article, for example.) Also, RLHF presumably favors “I don’t know” over trying to BS, and still RLHFed models like those underlying ChatGPT and Bing still frequently make stuff up or output nonsense (though it apparently gets the examples from Hofstadter’s article right, see LawrenceC’s comment).
>GPT-3 is very capable of saying “I don’t know” (or “yo be real”), but due to its training dataset it likely won’t say it on its own accord.
I’m not very convinced by the training data point. People write “I don’t know” on the Internet all the time (and “that makes no sense” occasionally). (Hofstadter’s article says both in his article, for example.) Also, RLHF presumably favors “I don’t know” over trying to BS, and still RLHFed models like those underlying ChatGPT and Bing still frequently make stuff up or output nonsense (though it apparently gets the examples from Hofstadter’s article right, see LawrenceC’s comment).