Patrick Leask comments on ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks

Patrick Leask 1 Sep 2023 11:54 UTC
5 points
0
Here you go: https://chat.openai.com/share/c5df0119-13de-43f9-8d4e-1c437bafa8ec
- Lukas Finnveden 2 Sep 2023 1:32 UTC
  2 points
  0
  Parent
  Thanks!