[Error communicating with LW2 server] answers What’s the contingency plan if we get AGI tomorrow?

[Error communicating with LW2 server] 23 Jun 2022 15:27 UTC
13 points
Even if we’re already doomed, we might still negotiate with the AGI.
I borrow the idea in Astronomical Waste. The Virgo Supercluster has a luminosity of about $3 \times 10^{12}$ solar luminosity $\approx 10^{39}$  W, losing mass at a rate of $10^{39} / c^{2} \approx 10^{22}$  kg / s.^[1]
The Earth has mass $\sim 6 \times 10^{24}$  kg.
If human help (or nonresistance) can allow the AGI to effectively start up (and begin space colonization) 600 seconds = 10 minutes earlier, then it would be mutually beneficial for humans to cooperate with the AGI (in the initial stages when the AGI could benefit from human nonresistance), in return for the AGI to spare Earth^[2] (and, at minimum, give us fusion technology to stay alive when the sun is dismantled).
(While the AGI only needs to trust humanity for 10 minutes, humanity needs to trust the AGI eternally. We still need good enough decision-making to cooperate.)
1. ^
  We may choose to consider the reachable universe instead. Armstrong and Sandberg (2013) (section 4.4.2 Reaching into the universe) estimates that we could reach about $10^{9}$ galaxies, with a luminosity of $10^{47}$ W, and a mass loss of $10^{29}$ kg / s. That is dwarfed by the $10^{5}$ stars that becomes unreachable per second (Siegel (2021), Kurzgesagt (2021)), a mass loss of $10^{35}$ kg / s.
2. ^
  Starting earlier but sparing Earth means a space colonization progress curve that starts earlier, but initially increases slower. The AGI requires that space colonization progress with human help be asymptotically 10 minutes earlier, that is:
  For any sufficiently large time $t$ , progress with human help at time $t \geq$ progress with human resistance at time $(t + 10 minutes)$ .
- Donald Hobson 23 Jun 2022 21:36 UTC
  17 points
  Parent
  The opportunity cost to spare earth is far larger than the cost to spare a random planet halfway across the universe. The AI starts on earth. If it can’t disassemble earth for spaceship mass, it has to send a small probe from earth to mars, and then disassemble mars instead. Which introduces a fair bit of delay. Not touching earth is a big restriction in the first few years and first few doublings. Once it gets to a few other solar systems, not touching earth becomes less importat of a restriction.
  Of course, you can’t TDT trade with the AI because you have no acausal correlation with it. We can’t predict the AI’s actions well enough.
  - [Error communicating with LW2 server] 24 Jun 2022 15:21 UTC
    1 point
    Parent
    Intuition pump / generalising from fictional evidence: in the games Pandemic / Plague Inc. (where the player “controls” a pathogen and attempts to infect the whole human population on Earth), a lucky, early cross-border infection can help you win the game faster — more than the difference between a starting infected population of 1 vs 100,000.
    This informs my intuition behind when the bonus of earlier spaceflight (through human help) could outweigh the penalty of not dismantling Earth.
    When might human help outweigh the penalty of not dismantling Earth? It requires these conditions:
    1. The AGI can very quickly reach an alternative source of materials: AGI spaceflight is superhuman.
    AGI spacecraft, once in space, can reach eg. the Moon within hours, the Sun within a day
    The AGI is willing to wait for additional computational power (it can wait until it has reached the Sun), but it really wants to leave Earth quickly
    2. The AGI’s best alternative to a negotiated agreement is to lie in wait initially: AGI ground operations is initially weaker-than-human.
    In the initial days, humans could reliably prevent the AGI from building or launching spacecraft
    In the initial days, the AGI is vulnerable to human action, and would have chosen to lay low, and wouldn’t effectively begin dismantling Earth
    3. If there is a negotiated agreement, then human help (or nonresistance) can allow the AGI to launch its first spacecrafts days earlier.
    Relevant human decision makers recognize that the AGI will eventually win any conflict, and decide to instead start negotiating immediately
    Relevant human decision makers can effectively coordinate multiple parts of the economy (to help the AGI), or (nonresistance) can effectively prevent others from interfering with the initially weak AGI
    I now think that the conjunction of all these conditions is unlikely, so I agree that this negotiation is unlikely to work.
- Yitz 23 Jun 2022 16:44 UTC
  3 points
  Parent
  I’m really intrigued by this idea! It seems very similar to past thoughts I’ve had about “blackmailing” the AI, but with a more positive spin