Fred Zhang comments on AI forecasting bots incoming

Fred Zhang 10 Sep 2024 12:32 UTC
5 points
1

SOTA

Do they have an evaluation result in Brier score, by back testing on resolved questions, similar to what is done in the literature?

(They have a pic with “expected Brier score”, which seems to be based on some kind of simulation?)
- Garrett Baker 10 Sep 2024 17:06 UTC
  5 points
  0
  Parent
  Futuresearch bets on Manifold.
  - Fred Zhang 10 Sep 2024 19:31 UTC
    1 point
    0
    Parent
    Thanks! This seems the best way to eval the bot anyway!