Prometheus comments on Why do so many think deception in AI is important?

Prometheus 13 Jan 2024 12:06 UTC
3 points
0
Why would it matter if they notice or not? What are they gonna do? EMP the whole world?
- quetzal_rainbow 13 Jan 2024 12:09 UTC
  1 point
  −2
  Parent
  I think that they shutdown computer on which unaligned AI is running? Downloading yourself into internet is not one-second process.
  - quila 13 Jan 2024 15:15 UTC
    5 points
    2
    Parent
    Downloading yourself into internet is not one-second process
    It’s only bottlenecked on connection speeds, which are likely to be fast at the server where this AI would be, if it were developed by a large lab. So imv 1-5 seconds is feasible for ‘escapes the datacenter as first step’ (by which point the process is distributed and hard to stop without centralized control). (‘distributed across most internet-connected servers’ would take longer of course).
    - gwern 13 Jan 2024 17:45 UTC
      12 points
      3
      Parent
      
      Downloading yourself into internet is not one-second process
      
      which are likely to be fast at the server where this AI would be, if it were developed by a large lab.
      
      Yes, no one is developing cutting-edge AIs like GPT-5 off your local dinky Ethernet, and your crummy home cable modem choked by your ISP is highly misleading if that’s what you think of as ‘Internet’. The real Internet is way faster, particularly in the cloud. Stuff in the datacenter can do things like access another server’s RAM using RDMA in a fraction of a millisecond, vastly faster than your PC can even talk to its hard drive. This is because datacenter networking is serious business: it’s always high-end Ethernet or better yet, Infiniband. And because interconnect is one of the most binding constraints on scaling GPU clusters, any serious GPU cluster is using the best Infiniband they can get.
      
      WP cites the latest Infiniband at 100-1200 gigabits/second or 12-150 gigabyte/s for point to point; with Chinchilla scaling yielding models on the order of 100 gigabytes, compression & quantization cutting model size by a factor, and the ability to transmit from multiple storage devices and also send only shards to individual servers (which is how the model will probably run anyway), it is actually not out of the question for ‘downloading yourself into internet’ to be a 1-second process today.
      
      (Not that it would matter if these numbers were 10x off. If you can’t stop a model from exfiltrating itself in 1s, then you weren’t going to somehow catch it if it actually takes 10s.)