Models that take longer to compute data probabilities should similarly have a probability penalty, not simply because they’re implausible, but because we don’t want to use them unless the data force us to.
I am much more comfortable leaving probability as it is but using a different term for usefulness.
I am much more comfortable leaving probability as it is but using a different term for usefulness.