That seems… good? It seems to be a purely mundane-utility capability improvement. It doesn’t improve on the architecture of the base LLM, and a base LLM that would be omnicide-capable wouldn’t need the AGI labs to hold its hand in order to learn how to use the computer. It seems barely different from AutoGPT, or the integration of Gemini into Android, and is about as existentially dangerous as advances in robotics.
The only new dangers it presents are mundane ones, which are likely to be satisfactorily handled by mundane mechanisms.
It’s bad inasmuch as this increases the attention to AI, attracts money to this sector, and increases competitive dynamics. But by itself, it seems fine. If all AI progress from this point on consisted of the labs racing in this direction, increasing the integration and the reliability of LLMs, this would all be perfectly fine and good.
I say Anthropic did nothing wrong in this one instance.
Anthropic did note that this advance ‘brings with it safety challenges.’ They focused their attentions on present-day potential harms, on the theory that this does not fundamentally alter the skills of the underlying model, which remains ASL-2 including its computer use. And they propose that introducing this capability now, while the worst case scenarios are not so bad, we can learn what we’re in store for later, and figure out what improvements would make computer use dangerous.
A safety take from a major AGI lab that actually makes sense? This is unprecedented. Must be a sign of the apocalypse.
That seems… good? It seems to be a purely mundane-utility capability improvement. It doesn’t improve on the architecture of the base LLM, and a base LLM that would be omnicide-capable wouldn’t need the AGI labs to hold its hand in order to learn how to use the computer. It seems barely different from AutoGPT, or the integration of Gemini into Android, and is about as existentially dangerous as advances in robotics.
The only new dangers it presents are mundane ones, which are likely to be satisfactorily handled by mundane mechanisms.
It’s bad inasmuch as this increases the attention to AI, attracts money to this sector, and increases competitive dynamics. But by itself, it seems fine. If all AI progress from this point on consisted of the labs racing in this direction, increasing the integration and the reliability of LLMs, this would all be perfectly fine and good.
I say Anthropic did nothing wrong in this one instance.
A safety take from a major AGI lab that actually makes sense? This is unprecedented. Must be a sign of the apocalypse.