That seem reasonable, but it will probably change a number of correct answers (to tricky questions) as well if asked whether it’s certain. One should verify that the number of incorrect answers fixed is significantly larger than the number of errors introduced.
But it might be difficult to devise a set of equally difficult questions for which the first result is different. Maybe choose questions where different instances give different answers, and see if asking a double check changes the wrong answers but not the correct ones?
Right. I see this as a problem also, asking the model if it’s sure is injecting information if we only ask on wrong answers. If we ask always it may disturb more right answers than it fixes wrong ones.
Its also accuracy dependent—if the model is 99 percent accurate on a subtask then asking if it’s sure may degrade accuracy, while it may improve it on a subtask it’s 50 percent accurate on.
Or in other words, we could prompt it and it might do better on AP English but less good on the bar exam.
That seem reasonable, but it will probably change a number of correct answers (to tricky questions) as well if asked whether it’s certain. One should verify that the number of incorrect answers fixed is significantly larger than the number of errors introduced.
But it might be difficult to devise a set of equally difficult questions for which the first result is different. Maybe choose questions where different instances give different answers, and see if asking a double check changes the wrong answers but not the correct ones?
Right. I see this as a problem also, asking the model if it’s sure is injecting information if we only ask on wrong answers. If we ask always it may disturb more right answers than it fixes wrong ones.
Its also accuracy dependent—if the model is 99 percent accurate on a subtask then asking if it’s sure may degrade accuracy, while it may improve it on a subtask it’s 50 percent accurate on.
Or in other words, we could prompt it and it might do better on AP English but less good on the bar exam.