Thomas Broadley comments on Reproducing ARC Evals’ recent report on language model agents

Thomas Broadley 1 Sep 2023 21:46 UTC
4 points
1
EDIT: The agent I built for this replication is now publicly available as part of the METR task workbench: https://drive.google.com/drive/folders/1-m1y0_Akunqq5AWcFoEH2_-BeKwsodPf
I’m torn! I think that better LLM scaffolding accelerates capabilities as much as it accelerates alignment. On the other hand, a programmer (or a non-programmer with help from ChatGPT) could easily reproduce my current scaffolding code. Maybe open-sourcing the current state of the project is fine. What do you think?
- Tao Lin 1 Sep 2023 22:42 UTC
  8 points
  6
  Parent
  I do think open sourcing is better, because there already was a lot of public attention and results on llm capabilities which are messy and misleading, and open sourcing one eval like this might improve our understanding a lot. Also, there are tons of llm agent projects/startups trying to build hype, so if you drop a benchmark here you are unlikely to attract unwanted attention (i’m guessing). I largely agree with https://www.lesswrong.com/posts/fRSj2W4Fjje8rQWm9/thoughts-on-sharing-information-about-language-model
- Gurkenglas 1 Sep 2023 22:36 UTC
  6 points
  2
  Parent
  If it is twice as easy, that halves the positives of open-sourcing and the negatives, it doesn’t change the direction.
  
  Beware the Unilateralist’s Curse.
- Stephen Fowler 2 Sep 2023 2:39 UTC
  4 points
  −2
  Parent
  I believe you should err on the side of not releasing it.
- jacquesthibs 16 Jan 2024 14:44 UTC
  2 points
  0
  Parent
  At the very least, would you be happy to share the code with alignment researchers interested in using it for our experiments?
  - Thomas Broadley 22 Jan 2024 15:59 UTC
    3 points
    0
    Parent
    I neglected to update my comment here—the agent I built for this replication is now publicly available as part of the METR task workbench, here: https://drive.google.com/drive/folders/1-m1y0_Akunqq5AWcFoEH2_-BeKwsodPf
- kuira 3 Sep 2023 8:08 UTC
  2 points
  0
  Parent
  I think that better LLM scaffolding accelerates capabilities as much as it accelerates alignment
  Which is not good enough. We need alignment to accelerate faster than capabilities in order to catch up.
- Ethan Mendes 1 Sep 2023 21:53 UTC
  2 points
  0
  Parent
  I think open-sourcing the current state of the project would be very useful to researchers.