Surprisingly to me, Claude 3.5 Sonnet is much more consistent in its answer! It is still not perfect, but it usually says the same thing (9/10 times it gave the same answer).
I read somewhere that Claude 3.5 has hidden ” thinking tokens”.
Bing also uses inner monologue:
https://x.com/MParakhin/status/1632087709060825088
https://x.com/MParakhin/status/1728890277249916933
https://www.reddit.com/r/bing/comments/11ironc/bing_reveals_its_data_structure_for_conversations/
Surprisingly to me, Claude 3.5 Sonnet is much more consistent in its answer! It is still not perfect, but it usually says the same thing (9/10 times it gave the same answer).
I read somewhere that Claude 3.5 has hidden ” thinking tokens”.
Bing also uses inner monologue:
https://x.com/MParakhin/status/1632087709060825088
https://x.com/MParakhin/status/1728890277249916933
https://www.reddit.com/r/bing/comments/11ironc/bing_reveals_its_data_structure_for_conversations/