[Question] List of concrete hypotheticals for AI takeover?

Yitz7 Apr 2022 16:54 UTC

7 points

I was wondering if anyone’s compiled a list of posts which give a concrete description (with scenario-specific details) of a hypothetical future where humanity suffers an X or S scale disaster due to AGI takeover. If such a list does not already exist (or does exist but needs to be updated), please put links to specific posts of this kind in the comments!

Yitz7 Apr 2022 16:54 UTC

7 points

5 comments1 min readLW link

Singularity Hypotheticals List of Links

Daniel Kokotajlo 7 Apr 2022 19:59 UTC
8 points
AI Impacts has been building a collection somewhat like this:
https://aiimpacts.org/partially-plausible-fictional-ai-futures/?fbclid=IwAR3n_Jl2IEVh9FlJQwyqdefp2ieZF4w0PGOfsluNN1cSX5khhsq1CigxSLw
Joseph Miller 7 Apr 2022 20:09 UTC
7 points
Gwern recently had a popular post that was exactly that kind of thing: https://www.gwern.net/Clippy
Dagon 7 Apr 2022 19:56 UTC
7 points
Amusingly, I was writing https://www.lesswrong.com/posts/BkHRpF2cafyaoWxaT/believable-near-term-ai-disaster at the same time as you were posting the question, based on an earlier brainstorming exploration at https://www.lesswrong.com/posts/KTbGuLTnycA6wKBza/ .
- Yitz 7 Apr 2022 20:18 UTC
  1 point
  Parent
  Funny, I think we’re both coming from similar sources of inspiration :)

Joseph Miller 7 Apr 2022 20:33 UTC
1 point
All the stories I’ve read, even Gwern’s recent one feel surprisingly abstract. To me the obvious, very concrete story for an intelligence explosion looks like this:
1. Run a program that does the following:
  while true:
  1. Run Codex on its own source with the prompt: “Improve the performance and efficiency of this coding model”
  2. Train a new version of Codex using the modified source code.
  3. Run tests and benchmarks to check it is actually better. If so, update your local version of Codex
2. Wait until it is amazing / you are dead
Obviously Codex isn’t nearly good enough to do this and you would need the benchmarks to include very difficult tasks, so that as it starts to take off it still has room for improvement. But I don’t see why it would require a different kind of model.