A year later, I still mostly stand by this point. I think that “the AI escapes the datacenter” seems about as likely as “the AI takes control of the datacenter”. I sometimes refer to this distinction as “escaping out of the datacenter” vs “escaping into the datacenter”.
A year later, I still mostly stand by this point. I think that “the AI escapes the datacenter” seems about as likely as “the AI takes control of the datacenter”. I sometimes refer to this distinction as “escaping out of the datacenter” vs “escaping into the datacenter”.
Jan Leike’s post on self-exfiltration is pretty relevant.