I can then share my attempts with someone at Anthropic.
Alternately, collaborating/sharing with e.g. METR or UK AISI auto ML evals teams might be interesting. Maybe even Pallisade or similar orgs from a ‘scary demo’ perspective? @jacquesthibs might also be interested. I might also get to work on this or something related, depending on how some applications go.
I also expect Sakana, Jeff Clune’s group and some parts of the open-source ML community will try to push this, but I’m more uncertain at least in some of these cases about the various differential acceleration tradeoffs.
Alternately, collaborating/sharing with e.g. METR or UK AISI auto ML evals teams might be interesting. Maybe even Pallisade or similar orgs from a ‘scary demo’ perspective? @jacquesthibs might also be interested. I might also get to work on this or something related, depending on how some applications go.
I also expect Sakana, Jeff Clune’s group and some parts of the open-source ML community will try to push this, but I’m more uncertain at least in some of these cases about the various differential acceleration tradeoffs.