Rob Bensinger comments on AGI Ruin: A List of Lethalities

Rob Bensinger Jun 6, 2022, 10:45 PM
LW: 4 AF: 2
1
AF
But once you invent cheap tech that can control them you don’t need to kill them anymore.
A paperclipper mainly cares about humans because we might have some way to threaten the paperclipper (e.g., by pushing a button that deploys a rival superintelligence); and secondarily, we’re made of atoms that can be used to build paperclips.
It’s harder to monitor the actions of every single human on Earth, than it is to kill all humans; and there’s a risk that monitoring people visibly will cause someone to push the ‘deploy a rival superintelligence’ button, if such a button exists.
Also, every minute that passes without you killing all humans, in the time window between ‘I’m confident I can kill all humans’ and ‘I’m carefully surveilling every human on Earth and know that there’s no secret bunker where someone has a Deploy Superintelligence button’, is a minute where you’re risking somebody pushing the ‘deploy a rival superintelligence’ button. This makes me think that the value of delaying ‘killing all humans’ (once you’re confident you can do it) would need to be very high in order to offset that risk.
One reason I might be wrong is if the AGI is worried about something like a dead man’s switch that deploys a rival superintelligence iff some human isn’t alive and regularly performing some action. (Not necessarily a likely scenario on priors, but once you’re confident enough in your base plan, unlikely scenarios can end up dominating the remaining scenarios where you lose.) Then it’s at least possible that you’d want to delay long enough to confirm that no such switch exists.
the first thing I do if I am superintelligent and wanting to secure my position is not take over the earth, which isn’t in a particularly useful spot resource wise and instead launch my nanofactory beyond the reach of humans to mercury or something.
You should be able to do both in parallel. I don’t have a strong view on which is higher-priority. Given the dead-man’s-switch worry above, you might want to prioritize sending a probe off-planet first as a precaution; but then go ahead and kill humans ASAP.
- romeostevensit Jun 7, 2022, 1:39 AM
  LW: 3 AF: 1
  0
  AF Parent
  This is exactly what I was thinking about though, this idea of monitoring every human on earth seems like a failure of imagination on our part. I’m not safe from predators because I monitor the location of every predator on earth. I admit that many (overwhelming majority probably) of scenarios in this vein are probably pretty bad and involve things like putting only a few humans on ice while getting rid of the rest.
  - Rob Bensinger Jun 7, 2022, 4:07 AM
    LW: 4 AF: 3
    2
    AF Parent
    I mean, all of this feels very speculative and un-cruxy to me; I wouldn’t be surprised if the ASI indeed is able to conclude that humanity is no threat at all, in which case it kills us just to harvest the resources.
    I do think that normal predators are a little misleading in this context, though, because they haven’t crossed the generality (‘can do science and tech’) threshold. Tigers won’t invent new machines, so it’s easier to upper-bound their capabilities. General intelligences are at least somewhat qualitatively trickier, because your enemy is ‘the space of all reachable technologies’ (including tech that may be surprisingly reachable). Tigers can surprise you, but not in very many ways and not to a large degree.