I believe this is a non-scientific question, similar in vein to philosophical zombie questions. Person A says “gpt did come up with a number by that point” and person b says “gpt did not come up with a number by that point”, but as long as it still outputs the correct responses after that point, neither person can be proven correct. This is why real-world scientific results of assessing these AI capabilities are way more informative than intuitive ideas of what they’re supposed to be able to do (even if they’re only programmed to predict the next word, it’s wrong to assume a priori that a next-word predictor is incapable of specific tasks, or declare these achievements to be “faked intelligence” when it gets it right).
I believe this is a non-scientific question, similar in vein to philosophical zombie questions. Person A says “gpt did come up with a number by that point” and person b says “gpt did not come up with a number by that point”, but as long as it still outputs the correct responses after that point, neither person can be proven correct. This is why real-world scientific results of assessing these AI capabilities are way more informative than intuitive ideas of what they’re supposed to be able to do (even if they’re only programmed to predict the next word, it’s wrong to assume a priori that a next-word predictor is incapable of specific tasks, or declare these achievements to be “faked intelligence” when it gets it right).