Prediction Thread: Make Predictions About How Different Factors Affect AGI X-Risk.

MrThink27 Feb 2023 19:15 UTC

15 points

In this post you can make several predictions for how different factors affect the probability that the creation of AGI leads to an extinction level catastrophe. This might be useful for planning.

Please let me know if you have other ideas for questions that could be valuable to ask.

If AGI is developed before 2100, what is the probability it will cause an extinction level catastrophe?

Predictions based on who develops AGI:

If AGI is developed on January 1st, 2030, what is the probability it will cause an extinction level catastrophe?

If AGI is developed on January 1st 2030 by either Google, Microsoft (including OpenAI) or Meta, what is the probability it will cause an extinction level catastrophe?

If AGI is developed on January 1st 2030 by the US Government/military, what is the probability it will cause an extinction level catastrophe?

If AGI is developed on January 1st 2030 by researchers at a University with no close ties to Big Tech or any military, what is the probability it will cause an extinction level catastrophe?

Predictions based on technology used for developing AGI:

If AGI is developed on January 1st 2030 using RLHF similar to today, and no other breakthrough innovation, what is the probability it will cause an extinction level catastrophe?

If AGI is developed on January 1st 2030 using a new paradigm of ML than what we have today, what is the probability it will cause an extinction level catastrophe?

Prediction based on approach for creating AGI:

If an AGI is developed on January 1st 2030 with the sole task of being an oracle, and it acts like an oracle during training, what is the probability it will cause an extinction level catastrophe?

If an AGI is developed on January 1st 2030 with the purpose of being a general assistant/worker, what is the probability it will cause an extinction level catastrophe?

Predictions on how money affects probability of AGI X-risk:

If today (27th of February) someone donated $10 billion purely based on advice from leading AI alignment researcher, how much would the risk of an AGI caused extinction level catastrophe decrease? (Ex if the risk goes from 50% to 49%, the risk decreased by 2%)

If today (27th of February) someone donated $1 trillion purely based on advice from leading AI alignment researcher, how much would the risk of an AGI caused extinction level catastrophe decrease?

What links here?

Results Prediction Thread About How Different Factors Affect AI X-Risk by MrThink (2 Mar 2023 22:13 UTC; 9 points)

MrThink27 Feb 2023 19:15 UTC

15 points

8 comments1 min readLW link

Forecasts (Specific Predictions)AI

JBlack 28 Feb 2023 2:16 UTC
2 points
0
I’m moderately confident that money would just make things worse, but there’s no option for increase in catastrophe risk from increased donations.
- MrThink 28 Feb 2023 8:44 UTC
  1 point
  0
  Parent
  Sadly I could only create questions between 1-99 for some reason, I guess we should interpret 1% to mean 1% or less (including negative).
  
  What makes you think more money would be net negative?
  
  Do you think that it would also be negative if you had 100% of how the money was spent, or would it only apply if other AI Alignment researchers were responsible for the strategy to donate?
  - JBlack 2 Mar 2023 0:04 UTC
    2 points
    0
    Parent
    I think more money spent right now, even with the best of intentions, is likely to increase capabilities much faster than it reduces risk. I think OpenAI and consequent capability races are turning out to be an example of this.
    There are hypothetical worlds where spending an extra ten billion (or a trillion) dollars on AI research with good intentions doesn’t do this, but I don’t think they’re likely to be our world. I don’t think that directing who gets the money is likely to prevent it, without pretty major non-monetary controls in addition.
    - MrThink 2 Mar 2023 8:23 UTC
      1 point
      0
      Parent
      I do agree that OpenAI is an example of good intentions going wrong, however I think we could learn from that and top researchers would be vary of such risks.
      
      Nevertheless I do think your concerns are valid and is important not to dismiss.
Daniel Kokotajlo 27 Feb 2023 20:07 UTC
2 points
2
Disclaimer: my answers are wild guesses & poorly calibrated, I spent ~5s on each of them.
Quintin Pope 27 Feb 2023 19:33 UTC
2 points
0
The odds of any particular AGI destroying the world are << than the odds of AGI as a whole destroying the world, so the questions in “Prediction based on approach for creating AGI:” may be miscalibrated.
- JBlack 28 Feb 2023 2:25 UTC
  3 points
  0
  Parent
  I interpreted that question as a conditional probability, and so not required to be strictly less. In particular, P(X causes catastrophe | X was of the given type, and the first AGI developed, on the given date).
  Nothing requires that this conditional probability should be less than “some AGI destroys the world conditional on any AGI being developed”.
- MrThink 27 Feb 2023 20:00 UTC
  1 point
  0
  Parent
  Excellent point.
  I do think that the first AGI developed will have a big effect on the probability of doom, so hopefully it will be some value possible to derive from the question. But it would be interesting to control for what other AIs do, in order to get better calibrated statistics.