All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Against Active Shooter Drills

ZviJun 16, 2022, 1:40 PM

91 points

30 comments7 min readLW link

(thezvi.wordpress.com)

Announcing the Alignment of Complex Systems Research Group

Jan_Kulveit and technicalities

Jun 4, 2022, 4:10 AM

91 points

20 comments5 min readLW link

The “mind-body vicious cycle” model of RSI & back pain

Steven ByrnesJun 9, 2022, 12:30 PM

91 points

32 comments12 min readLW link

I’m trying out “asteroid mindset”

Alex_AltairJun 3, 2022, 1:35 PM

90 points

5 comments4 min readLW link

In defense of flailing, with foreword by Bill Burr

lcJun 17, 2022, 4:40 PM

88 points

6 comments4 min readLW link

I applied for a MIRI job in 2020. Here’s what happened next.

ViktoriaMalyasovaJun 15, 2022, 7:37 PM

86 points

17 comments7 min readLW link

Causal confusion as an argument against the scaling hypothesis

RobertKirk and David Scott Krueger (formerly: capybaralet)

Jun 20, 2022, 10:54 AM

86 points

30 comments15 min readLW link

Transcript of a Twitter Discussion on EA from June 2022

ZviJun 6, 2022, 1:50 PM

85 points

4 comments1 min readLW link

(thezvi.wordpress.com)

Air Conditioner Test Results & Discussion

johnswentworthJun 22, 2022, 10:26 PM

82 points

42 comments6 min readLW link

Air Conditioner Repair

ZviJun 27, 2022, 12:40 PM

81 points

34 comments4 min readLW link

(thezvi.wordpress.com)

Reinventing the wheel

jasoncrawfordJun 4, 2022, 10:39 PM

78 points

13 comments2 min readLW link

(rootsofprogress.org)

AI Training Should Allow Opt-Out

alyssavanceJun 23, 2022, 1:33 AM

76 points

13 comments6 min readLW link

A Quick List of Some Problems in AI Alignment As A Field

Nicholas / Heather KrossJun 21, 2022, 11:23 PM

75 points

12 comments6 min readLW link

(www.thinkingmuchbetter.com)

Worked Examples of Shapley Values

lalaithionJun 24, 2022, 5:13 PM

75 points

11 comments8 min readLW link

Some reflections on the LW community after several months of active engagement

M. Y. ZuoJun 25, 2022, 5:04 PM

72 points

40 comments4 min readLW link

Feature request: voting buttons at the bottom?

Oliver SourbutJun 24, 2022, 2:41 PM

71 points

12 comments1 min readLW link

Book Review: Talent

ZviJun 3, 2022, 8:10 PM

70 points

19 comments79 min readLW link

(thezvi.wordpress.com)

Eliciting Latent Knowledge (ELK) - Distillation/Summary

Marius HobbhahnJun 8, 2022, 1:18 PM

69 points

2 comments21 min readLW link

How to pursue a career in technical AI alignment

Charlie Rogers-SmithJun 4, 2022, 9:11 PM

69 points

1 comment39 min readLW link

Resources I send to AI researchers about AI safety

Vael GatesJun 14, 2022, 2:24 AM

69 points

12 comments1 min readLW link

Epistemological Vigilance for Alignment

adamShimiJun 6, 2022, 12:27 AM

66 points

11 comments10 min readLW link

Seven ways to become unstoppably agentic

Evie CottrellJun 26, 2022, 5:39 PM

64 points

16 comments8 min readLW link

[Question] Has anyone actually tried to convince Terry Tao or other top mathematicians to work on alignment?

P.Jun 8, 2022, 10:26 PM

64 points

51 comments4 min readLW link

Half-baked AI Safety ideas thread

Aryeh EnglanderJun 23, 2022, 4:11 PM

64 points

63 comments1 min readLW link

“Brain enthusiasts” in AI Safety

Jan and Samuel Nellessen

Jun 18, 2022, 9:59 AM

63 points

5 comments10 min readLW link

(universalprior.substack.com)

Ten experiments in modularity, which we’d like you to run!

CallumMcDougall, Lucius Bushnaq and Avery

Jun 16, 2022, 9:17 AM

62 points

3 comments9 min readLW link

[Question] What’s the contingency plan if we get AGI tomorrow?

YitzJun 23, 2022, 3:10 AM

61 points

23 comments1 min readLW link

Open Problems in AI X-Risk [PAIS #5]

Dan H and TW123

Jun 10, 2022, 2:08 AM

61 points

6 comments36 min readLW link

How Do Selection Theorems Relate To Interpretability?

johnswentworthJun 9, 2022, 7:39 PM

60 points

14 comments3 min readLW link

A short conceptual explainer of Immanuel Kant’s Critique of Pure Reason

jessicataJun 3, 2022, 1:06 AM

57 points

12 comments16 min readLW link

(unstableontology.com)

Covid 6/2/22: Declining to Respond

ZviJun 2, 2022, 1:50 PM

55 points

10 comments7 min readLW link

(thezvi.wordpress.com)

Kurzgesagt – The Last Human (Youtube)

habrykaJun 29, 2022, 3:28 AM

54 points

7 comments1 min readLW link

(www.youtube.com)

How To: A Workshop (or anything)

Duncan Sabien (Inactive)Jun 12, 2022, 8:00 AM

53 points

13 comments37 min readLW link 1 review

[Link] OpenAI: Learning to Play Minecraft with Video PreTraining (VPT)

Aryeh EnglanderJun 23, 2022, 4:29 PM

53 points

3 comments1 min readLW link

Paradigms of AI alignment: components and enablers

VikaJun 2, 2022, 6:19 AM

53 points

4 comments8 min readLW link

How fast can we perform a forward pass?

jsteinhardtJun 10, 2022, 11:30 PM

53 points

9 comments15 min readLW link

(bounded-regret.ghost.io)

The horror of what must, yet cannot, be true

Kaj_SotalaJun 2, 2022, 10:20 AM

52 points

18 comments2 min readLW link

(kajsotala.fi)

Latent Adversarial Training

Adam JermynJun 29, 2022, 8:04 PM

52 points

13 comments5 min readLW link

What’s it like to have sex with Duncan?

Duncan Sabien (Inactive)Jun 17, 2022, 2:32 AM

52 points

19 comments17 min readLW link

Perils of optimizing in social contexts

owencbJun 16, 2022, 5:40 PM

50 points

1 comment2 min readLW link

Our mental building blocks are more different than I thought

Marius HobbhahnJun 15, 2022, 11:07 AM

50 points

11 comments14 min readLW link

Child Contracting

jefftkJun 26, 2022, 2:30 AM

48 points

2 comments1 min readLW link

(www.jefftk.com)

Poorly-Aimed Death Rays

Thane Ruthenis11 Jun 2022 18:29 UTC

48 points

5 comments4 min readLW link

Pitching an Alignment Softball

mu_(negative)7 Jun 2022 4:10 UTC

47 points

13 comments10 min readLW link

Why so little AI risk on rationalist-adjacent blogs?

Grant Demaree13 Jun 2022 6:31 UTC

46 points

23 comments8 min readLW link

[Link] Childcare : what the science says

Gunnar_Zarncke24 Jun 2022 21:45 UTC

46 points

4 comments1 min readLW link

(criticalscience.medium.com)

Summary of “AGI Ruin: A List of Lethalities”

Stephen McAleese10 Jun 2022 22:35 UTC

45 points

2 comments8 min readLW link

Dagger of Detect Evil

lsusr21 Jun 2022 6:23 UTC

45 points

22 comments3 min readLW link

Continuity Assumptions

Jan_Kulveit13 Jun 2022 21:31 UTC

44 points

13 comments4 min readLW link

FYI: I’m working on a book about the threat of AGI/ASI for a general audience. I hope it will be of value to the cause and the community

Darren McKee15 Jun 2022 18:08 UTC

43 points

15 comments2 min readLW link