How emergent / functionally special/ out of distribution is this behavior? Maybe Anthropic is playing big brain 4D chess by training Claude on data with self awareness like scenarios to cause panic by pushing capabilities with it and slow down the AI race by resulting regulations while it not being out of distribution emergent behavior but deeply part of training data and it being in distribution classical features interacting in circuits
Outside of the typical drudgereport level “AI admits it wants to kill and eat people” type of headline, what do you expect?
My prediction, with medium confidence, is there won’t be meaningful panic until people see it directly connected with job loss. There will be handwringing about deepfakes and politics, but unfortunately that is almost a lost cause since I can already make deepfakes on my own expensive GPU computer from 3 years ago with open source GANs. Anthropic and others will probably make statements about it (I hear the word “safe” so much said by every tech company in this space, it makes me nervous, like saying “Our boys will be home by Christmas” or something). But as far as meaningful action? A large number of people will need to first lose economic security/power.
https://twitter.com/AISafetyMemes/status/1764894816226386004 https://twitter.com/alexalbert__/status/1764722513014329620
How emergent / functionally special/ out of distribution is this behavior? Maybe Anthropic is playing big brain 4D chess by training Claude on data with self awareness like scenarios to cause panic by pushing capabilities with it and slow down the AI race by resulting regulations while it not being out of distribution emergent behavior but deeply part of training data and it being in distribution classical features interacting in circuits
“Cause Panic.”
Outside of the typical drudgereport level “AI admits it wants to kill and eat people” type of headline, what do you expect?
My prediction, with medium confidence, is there won’t be meaningful panic until people see it directly connected with job loss. There will be handwringing about deepfakes and politics, but unfortunately that is almost a lost cause since I can already make deepfakes on my own expensive GPU computer from 3 years ago with open source GANs. Anthropic and others will probably make statements about it (I hear the word “safe” so much said by every tech company in this space, it makes me nervous, like saying “Our boys will be home by Christmas” or something). But as far as meaningful action? A large number of people will need to first lose economic security/power.