SOTA
Do they have an evaluation result in Brier score, by back testing on resolved questions, similar to what is done in the literature?
(They have a pic with “expected Brier score”, which seems to be based on some kind of simulation?)
Futuresearch bets on Manifold.
Thanks! This seems the best way to eval the bot anyway!
Do they have an evaluation result in Brier score, by back testing on resolved questions, similar to what is done in the literature?
(They have a pic with “expected Brier score”, which seems to be based on some kind of simulation?)
Futuresearch bets on Manifold.
Thanks! This seems the best way to eval the bot anyway!