I have looked on Google Scholar. I could find several proposed measures of calibration. But none are very good; they’re all worse than the things proposed in this thread.
I have looked on Google Scholar. I could find several proposed measures of calibration. But none are very good; they’re all worse than the things proposed in this thread.