METR releases a report, Evaluating frontier AI R&D capabilities of language model agents against human experts: https://metr.org/blog/2024-11-22-evaluating-r-d-capabilities-of-llms/
Daniel Kokotajlo and Eli Lifland both feel that one should update towards shorter timelines remaining until the start of rapid acceleration via AIs doing AI research based on this report:
https://x.com/DKokotajlo67142/status/1860079440497377641
https://x.com/eli_lifland/status/1860087262849171797
Somewhat pedantic correction: they don’t say “one should update”. They say they update (plus something caveats).
METR releases a report, Evaluating frontier AI R&D capabilities of language model agents against human experts: https://metr.org/blog/2024-11-22-evaluating-r-d-capabilities-of-llms/
Daniel Kokotajlo and Eli Lifland both feel that one should update towards shorter timelines remaining until the start of rapid acceleration via AIs doing AI research based on this report:
https://x.com/DKokotajlo67142/status/1860079440497377641
https://x.com/eli_lifland/status/1860087262849171797
Somewhat pedantic correction: they don’t say “one should update”. They say they update (plus something caveats).