Yeah, I also don’t like Brier scores. My guess is they are better at allowing people to pretend that brier scores on different sets of forecasts are meaningfully comparable (producing meaningless sentences like “superforecasters generally have a brier score around 0.35”), whereas the log scoring rule only ever loses you points, so it’s more clear that it really isn’t comparable between different question sets.
Yeah, I also don’t like Brier scores. My guess is they are better at allowing people to pretend that brier scores on different sets of forecasts are meaningfully comparable (producing meaningless sentences like “superforecasters generally have a brier score around 0.35”), whereas the log scoring rule only ever loses you points, so it’s more clear that it really isn’t comparable between different question sets.