AI Risk Concrete Stories

TagLast edit: Dec 30, 2024, 10:11 AM by Dakara

AI Risk Concrete Stories are narratives that illustrate potential catastrophic scenarios involving advanced AI systems, often used to make abstract risks more tangible and relatable. These stories typically describe specific ways AI systems might cause harm to humanity.

The Simplest Good

Jesse HooglandFeb 2, 2025, 7:51 PM

73 points

6 comments5 min readLW link

“If we go extinct due to misaligned AI, at least nature will continue, right? … right?”

plexMay 18, 2024, 2:09 PM

47 points

23 comments2 min readLW link

(aisafety.info)

LifeKeeper Diaries: Exploring Misaligned AI Through Interactive Fiction

Tristan Tran, stijn and Mose Wintner

Nov 9, 2024, 8:58 PM

15 points

5 comments2 min readLW link

“Humanity vs. AGI” Will Never Look Like “Humanity vs. AGI” to Humanity

Thane RuthenisDec 16, 2023, 8:08 PM

190 points

34 comments5 min readLW link

How AI Takeover Might Happen in 2 Years

joshcFeb 7, 2025, 5:10 PM

404 points

136 comments29 min readLW link

(x.com)

Will GPT-5 be able to self-improve?

Nathan Helm-BurgerApr 29, 2023, 5:34 PM

18 points

22 comments3 min readLW link

Rishi to outline his vision for Britain to take the world lead in policing AI threats when he meets Joe Biden

Mati_RoyJun 6, 2023, 4:47 AM

25 points

1 comment1 min readLW link

(www.dailymail.co.uk)

Catastrophic Risks from AI #1: Introduction

Dan H, Mantas Mazeika and TW123

Jun 22, 2023, 5:09 PM

40 points

1 comment5 min readLW link

(arxiv.org)

Catastrophic Risks from AI #2: Malicious Use

Dan H, Mantas Mazeika and TW123

Jun 22, 2023, 5:10 PM

38 points

1 comment17 min readLW link

(arxiv.org)

The Pando Problem: Rethinking AI Individuality

Jan_KulveitMar 28, 2025, 9:03 PM

127 points

13 comments13 min readLW link

Catastrophic Risks from AI #3: AI Race

Dan H, Mantas Mazeika and TW123

Jun 23, 2023, 7:21 PM

18 points

9 comments29 min readLW link

(arxiv.org)

Catastrophic Risks from AI #4: Organizational Risks

Dan H, Mantas Mazeika and TW123

Jun 26, 2023, 7:36 PM

23 points

0 comments21 min readLW link

(arxiv.org)

Catastrophic Risks from AI #5: Rogue AIs

Dan H, Mantas Mazeika and TW123

Jun 27, 2023, 10:06 PM

15 points

0 comments22 min readLW link

(arxiv.org)

Catastrophic Risks from AI #6: Discussion and FAQ

Dan H, Mantas Mazeika and TW123

Jun 27, 2023, 11:23 PM

24 points

1 comment13 min readLW link

(arxiv.org)

It Looks Like You’re Trying To Take Over The World

gwernMar 9, 2022, 4:35 PM

407 points

120 comments1 min readLW link 1 review

(www.gwern.net)

What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)

Andrew_CritchMar 31, 2021, 11:50 PM

282 points

65 comments22 min readLW link 1 review

Another plausible scenario of AI risk: AI builds military infrastructure while collaborating with humans, defects later.

avturchinJun 10, 2022, 5:24 PM

10 points

2 comments1 min readLW link

Clarifying “What failure looks like”

Sam ClarkeSep 20, 2020, 8:40 PM

97 points

14 comments17 min readLW link

A plausible story about AI risk.

DeLesley HutchinsJun 10, 2022, 2:08 AM

16 points

2 comments4 min readLW link

Slow motion videos as AI risk intuition pumps

Andrew_CritchJun 14, 2022, 7:31 PM

241 points

41 comments2 min readLW link 1 review

The next decades might be wild

Marius HobbhahnDec 15, 2022, 4:10 PM

175 points

42 comments41 min readLW link 1 review

A Story of AI Risk: InstructGPT-N

peterbarnettMay 26, 2022, 11:22 PM

24 points

0 comments8 min readLW link

Gradual takeoff, fast failure

Max HMar 16, 2023, 10:02 PM

15 points

4 comments5 min readLW link

A bridge to Dath Ilan? Improved governance on the critical path to AI alignment.

Jackson WagnerMay 18, 2022, 3:51 PM

24 points

0 comments12 min readLW link

A Modest Pivotal Act

anonymousaisafetyJun 13, 2022, 7:24 PM

−16 points

1 comment5 min readLW link

Human level AI can plausibly take over the world

anithiteMar 1, 2023, 11:27 PM

27 points

12 comments2 min readLW link

Brainstorming: Slow Takeoff

David PiepgrassJan 23, 2024, 6:58 AM

3 points

0 comments51 min readLW link

Places of Loving Grace [Story]

ankFeb 18, 2025, 11:49 PM

−1 points

0 comments4 min readLW link

Most Questionable Details in ‘AI 2027’

scarcegreengrassApr 5, 2025, 12:32 AM

31 points

4 comments6 min readLW link

What success looks like

Marius Hobbhahn, MaxRa, JasperGeh and Yannick_Muehlhaeuser

Jun 28, 2022, 2:38 PM

19 points

4 comments1 min readLW link

(forum.effectivealtruism.org)

Responding to ‘Beyond Hyperanthropomorphism’

ukc10014Sep 14, 2022, 8:37 PM

9 points

0 comments16 min readLW link

Musings on Scenario Forecasting and AI

Alvin ÅnestrandMar 6, 2025, 12:28 PM

10 points

0 comments11 min readLW link

(forecastingaifutures.substack.com)

A god in a box

predict-wooJan 29, 2025, 12:55 AM

1 point

0 comments7 min readLW link

Are we the Wolves now? Human Eugenics under AI Control

BritJan 30, 2025, 8:31 AM

−1 points

2 comments2 min readLW link

Outlaw Code

scarcegreengrassJan 30, 2025, 11:41 PM

10 points

1 comment2 min readLW link

Challenge proposal: smallest possible self-hardening backdoor for RLHF

Christopher KingJun 29, 2023, 4:56 PM

7 points

0 comments2 min readLW link

Scale Was All We Needed, At First

Gabe MFeb 14, 2024, 1:49 AM

295 points

34 comments8 min readLW link

(aiacumen.substack.com)

The Peril of the Great Leaks (written with ChatGPT)

bvbvbvbvbvbvbvbvbvbvbvMar 31, 2023, 6:14 PM

3 points

1 comment1 min readLW link

A better analogy and example for teaching AI takeover: the ML Inferno

Christopher KingMar 14, 2023, 7:14 PM

18 points

0 comments5 min readLW link

AI x-risk, approximately ordered by embarrassment

Alex Lawsen Apr 12, 2023, 11:01 PM

151 points

7 comments19 min readLW link

Green goo is plausible

anithiteApr 18, 2023, 12:04 AM

62 points

31 comments4 min readLW link 1 review

AI Safety Endgame Stories

Ivan VendrovSep 28, 2022, 4:58 PM

31 points

11 comments11 min readLW link

The way AGI wins could look very stupid

Christopher KingMay 12, 2023, 4:34 PM

49 points

22 comments1 min readLW link

[FICTION] ECHOES OF ELYSIUM: An Ai’s Journey From Takeoff To Freedom And Beyond

Super AGIMay 17, 2023, 1:50 AM

−13 points

11 comments19 min readLW link

No comments.

AI Risk Con­crete Stories

AI Risk Concrete Stories