O O comments on AI Forecasting: Two Years In

O O 21 Aug 2023 16:31 UTC
2 points
2
I think it’s better if calculators are counted for the ultimate purpose of the benchmark. We can’t ban AI models from using symbolic logic as an alignment strategy.
- Dan H 21 Aug 2023 16:53 UTC
  3 points
  0
  Parent
  The purpose of this is to test and forecast problem-solving ability, using examples that substantially lose informativeness in the presence of Python executable scripts. I think this restriction isn’t an ideological statement about what sort of alignment strategies we want.