All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

The Falling Drill

ScrewtapeAug 5, 2022, 12:08 AM

46 points

3 comments2 min readLW link

Convergence Towards World-Models: A Gears-Level Model

Thane RuthenisAug 4, 2022, 11:31 PM

38 points

1 comment13 min readLW link

Cambist Booking

ScrewtapeAug 4, 2022, 10:40 PM

20 points

3 comments4 min readLW link

Calibration Trivia

ScrewtapeAug 4, 2022, 10:31 PM

12 points

9 comments4 min readLW link

Monthly Shorts 7/22

CelerAug 4, 2022, 10:30 PM

5 points

0 comments3 min readLW link

(keller.substack.com)

The Pragmascope Idea

johnswentworthAug 4, 2022, 9:52 PM

59 points

20 comments3 min readLW link

Running a Basic Meetup

ScrewtapeAug 4, 2022, 9:49 PM

21 points

1 comment2 min readLW link

Fiber arts, mysterious dodecahedrons, and waiting on “Eureka!”

eukaryoteAug 4, 2022, 8:37 PM

124 points

15 comments9 min readLW link 1 review

(eukaryotewritesblog.com)

[Question] Would “Manhattan Project” style be beneficial or deleterious for AI Alignment?

Valentin2026Aug 4, 2022, 7:12 PM

5 points

1 comment1 min readLW link

[Question] AI alignment: Would a lazy self-preservation instinct be sufficient?

BrainFrogAug 4, 2022, 5:53 PM

−1 points

4 comments1 min readLW link

Socratic Ducking, OODA Loops, Frame-by-Frame Debugging

CFAR!DuncanAug 4, 2022, 5:44 PM

26 points

1 comment5 min readLW link

What do ML researchers think about AI in 2022?

KatjaGraceAug 4, 2022, 3:40 PM

221 points

33 comments3 min readLW link

(aiimpacts.org)

Interpretability isn’t Free

Joel BurgetAug 4, 2022, 3:02 PM

12 points

1 comment2 min readLW link

Covid 8/4/22: Rebound

ZviAug 4, 2022, 11:20 AM

36 points

0 comments11 min readLW link

(thezvi.wordpress.com)

High Reliability Orgs, and AI Companies

RaemonAug 4, 2022, 5:45 AM

86 points

7 comments12 min readLW link 1 review

Surprised by ELK report’s counterexample to Debate, IDA

Evan R. MurphyAug 4, 2022, 2:12 AM

18 points

0 comments5 min readLW link

Clapping Lower

jefftkAug 4, 2022, 2:10 AM

38 points

7 comments1 min readLW link

(www.jefftk.com)

[Question] How do I know if my first post should be a post, or a question?

Nathan1123Aug 4, 2022, 1:46 AM

3 points

4 comments1 min readLW link

Three pillars for avoiding AGI catastrophe: Technical alignment, deployment decisions, and coordination

LintzAAug 3, 2022, 11:15 PM

24 points

0 comments11 min readLW link

Precursor checking for deceptive alignment

evhubAug 3, 2022, 10:56 PM

24 points

0 comments14 min readLW link

Transformer language models are doing something more general

NumendilAug 3, 2022, 9:13 PM

53 points

6 comments2 min readLW link

[Question] Some doubts about Non Superintelligent AIs

aditya malikAug 3, 2022, 7:55 PM

0 points

4 comments1 min readLW link

Announcing Squiggle: Early Access

ozziegooenAug 3, 2022, 7:48 PM

51 points

7 comments7 min readLW link

(forum.effectivealtruism.org)

Survey: What (de)motivates you about AI risk?

Daniel_FriedrichAug 3, 2022, 7:17 PM

1 point

0 comments1 min readLW link

(forms.gle)

Externalized reasoning oversight: a research direction for language model alignment

tameraAug 3, 2022, 12:03 PM

136 points

23 comments6 min readLW link

Open & Welcome Thread—Aug/Sep 2022

ThomasAug 3, 2022, 10:22 AM

9 points

32 comments1 min readLW link

[Question] How does one recognize information and differentiate it from noise?

M. Y. ZuoAug 3, 2022, 3:57 AM

4 points

29 comments1 min readLW link

Law-Following AI 4: Don’t Rely on Vicarious Liability

CullenAug 2, 2022, 11:26 PM

5 points

2 comments3 min readLW link

Two-year update on my personal AI timelines

Ajeya CotraAug 2, 2022, 11:07 PM

293 points

60 comments16 min readLW link

What are the Red Flags for Neural Network Suffering? - Seeds of Science call for reviewers

rogersbaconAug 2, 2022, 10:37 PM

24 points

6 comments1 min readLW link

Againstness

CFAR!DuncanAug 2, 2022, 7:29 PM

50 points

8 comments9 min readLW link

(Summary) Sequence Highlights—Thinking Better on Purpose

qazzquimbyAug 2, 2022, 5:45 PM

33 points

3 comments11 min readLW link

Progress links and tweets, 2022-08-02

jasoncrawfordAug 2, 2022, 5:03 PM

9 points

0 comments1 min readLW link

(rootsofprogress.org)

[Question] I want to donate some money (not much, just what I can afford) to AGI Alignment research, to whatever organization has the best chance of making sure that AGI goes well and doesn’t kill us all. What are my best options, where can I make the most difference per dollar?

lumenwritesAug 2, 2022, 12:08 PM

15 points

9 comments1 min readLW link

Thinking without priors?

Q HomeAug 2, 2022, 9:17 AM

7 points

0 comments9 min readLW link

[Question] Would quantum immortality mean subjective immortality?

n0ahAug 2, 2022, 4:54 AM

2 points

10 comments1 min readLW link

Turbocharging

CFAR!Duncan2 Aug 2022 0:01 UTC

52 points

5 comments9 min readLW link

Letter from leading Soviet Academicians to party and government leaders of the Soviet Union regarding signs of decline and structural problems of the economic-political system (1970)

M. Y. Zuo1 Aug 2022 22:35 UTC

20 points

10 comments16 min readLW link

Technical AI Alignment Study Group

Eric K1 Aug 2022 18:33 UTC

5 points

0 comments1 min readLW link

[Question] Is there any writing about prompt engineering for humans?

Alex Hollow1 Aug 2022 12:52 UTC

18 points

8 comments1 min readLW link

Meditation course claims 65% enlightenment rate: my review

KatWoods1 Aug 2022 11:25 UTC

111 points

35 comments14 min readLW link

[Question] Which intro-to-AI-risk text would you recommend to...

Sherrinford1 Aug 2022 9:36 UTC

12 points

1 comment1 min readLW link

Polaris, Five-Second Versions, and Thought Lengths

CFAR!Duncan1 Aug 2022 7:14 UTC

50 points

12 comments8 min readLW link

A Word is Worth 1,000 Pictures

Kully1 Aug 2022 4:08 UTC

1 point

0 comments2 min readLW link

On akrasia: starting at the bottom

seecrow1 Aug 2022 4:08 UTC

37 points

2 comments3 min readLW link

[Question] How likely do you think worse-than-extinction type fates to be?

span11 Aug 2022 4:08 UTC

3 points

3 comments1 min readLW link

Abstraction sacrifices causal clarity

Marv K31 Jul 2022 19:24 UTC

2 points

0 comments3 min readLW link

Time-logging programs and/or spreadsheets (2022)

mikbp31 Jul 2022 18:18 UTC

3 points

3 comments1 min readLW link

Conservatism is a rational response to epistemic uncertainty

contrarianbrit31 Jul 2022 18:04 UTC

2 points

11 comments9 min readLW link

(thomasprosser.substack.com)

South Bay ACX/LW Meetup

IS31 Jul 2022 15:30 UTC

2 points

0 comments1 min readLW link