All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Interpretability isn’t Free

Joel BurgetAug 4, 2022, 3:02 PM

12 points

1 comment2 min readLW link

Covid 8/4/22: Rebound

ZviAug 4, 2022, 11:20 AM

36 points

0 comments11 min readLW link

(thezvi.wordpress.com)

High Reliability Orgs, and AI Companies

RaemonAug 4, 2022, 5:45 AM

86 points

7 comments12 min readLW link 1 review

Surprised by ELK report’s counterexample to Debate, IDA

Evan R. MurphyAug 4, 2022, 2:12 AM

18 points

0 comments5 min readLW link

Clapping Lower

jefftkAug 4, 2022, 2:10 AM

38 points

7 comments1 min readLW link

(www.jefftk.com)

[Question] How do I know if my first post should be a post, or a question?

Nathan1123Aug 4, 2022, 1:46 AM

3 points

4 comments1 min readLW link

Three pillars for avoiding AGI catastrophe: Technical alignment, deployment decisions, and coordination

LintzAAug 3, 2022, 11:15 PM

24 points

0 comments11 min readLW link

Precursor checking for deceptive alignment

evhubAug 3, 2022, 10:56 PM

24 points

0 comments14 min readLW link

Transformer language models are doing something more general

NumendilAug 3, 2022, 9:13 PM

53 points

6 comments2 min readLW link

[Question] Some doubts about Non Superintelligent AIs

aditya malikAug 3, 2022, 7:55 PM

0 points

4 comments1 min readLW link

Announcing Squiggle: Early Access

ozziegooenAug 3, 2022, 7:48 PM

51 points

7 comments7 min readLW link

(forum.effectivealtruism.org)

Survey: What (de)motivates you about AI risk?

Daniel_FriedrichAug 3, 2022, 7:17 PM

1 point

0 comments1 min readLW link

(forms.gle)

Externalized reasoning oversight: a research direction for language model alignment

tameraAug 3, 2022, 12:03 PM

136 points

23 comments6 min readLW link

Open & Welcome Thread—Aug/Sep 2022

ThomasAug 3, 2022, 10:22 AM

9 points

32 comments1 min readLW link

[Question] How does one recognize information and differentiate it from noise?

M. Y. ZuoAug 3, 2022, 3:57 AM

4 points

29 comments1 min readLW link

Law-Following AI 4: Don’t Rely on Vicarious Liability

CullenAug 2, 2022, 11:26 PM

5 points

2 comments3 min readLW link

Two-year update on my personal AI timelines

Ajeya CotraAug 2, 2022, 11:07 PM

293 points

60 comments16 min readLW link

What are the Red Flags for Neural Network Suffering? - Seeds of Science call for reviewers

rogersbaconAug 2, 2022, 10:37 PM

24 points

6 comments1 min readLW link

Againstness

CFAR!DuncanAug 2, 2022, 7:29 PM

50 points

8 comments9 min readLW link

(Summary) Sequence Highlights—Thinking Better on Purpose

qazzquimbyAug 2, 2022, 5:45 PM

33 points

3 comments11 min readLW link

Progress links and tweets, 2022-08-02

jasoncrawfordAug 2, 2022, 5:03 PM

9 points

0 comments1 min readLW link

(rootsofprogress.org)

[Question] I want to donate some money (not much, just what I can afford) to AGI Alignment research, to whatever organization has the best chance of making sure that AGI goes well and doesn’t kill us all. What are my best options, where can I make the most difference per dollar?

lumenwritesAug 2, 2022, 12:08 PM

15 points

9 comments1 min readLW link

Thinking without priors?

Q HomeAug 2, 2022, 9:17 AM

7 points

0 comments9 min readLW link

[Question] Would quantum immortality mean subjective immortality?

n0ahAug 2, 2022, 4:54 AM

2 points

10 comments1 min readLW link

Turbocharging

CFAR!DuncanAug 2, 2022, 12:01 AM

52 points

5 comments9 min readLW link

Letter from leading Soviet Academicians to party and government leaders of the Soviet Union regarding signs of decline and structural problems of the economic-political system (1970)

M. Y. ZuoAug 1, 2022, 10:35 PM

20 points

10 comments16 min readLW link

Technical AI Alignment Study Group

Eric KAug 1, 2022, 6:33 PM

5 points

0 comments1 min readLW link

[Question] Is there any writing about prompt engineering for humans?

Alex HollowAug 1, 2022, 12:52 PM

18 points

8 comments1 min readLW link

Meditation course claims 65% enlightenment rate: my review

KatWoodsAug 1, 2022, 11:25 AM

111 points

35 comments14 min readLW link

[Question] Which intro-to-AI-risk text would you recommend to...

SherrinfordAug 1, 2022, 9:36 AM

12 points

1 comment1 min readLW link

Polaris, Five-Second Versions, and Thought Lengths

CFAR!DuncanAug 1, 2022, 7:14 AM

50 points

12 comments8 min readLW link

A Word is Worth 1,000 Pictures

KullyAug 1, 2022, 4:08 AM

1 point

0 comments2 min readLW link

On akrasia: starting at the bottom

seecrowAug 1, 2022, 4:08 AM

37 points

2 comments3 min readLW link

[Question] How likely do you think worse-than-extinction type fates to be?

span1Aug 1, 2022, 4:08 AM

3 points

3 comments1 min readLW link

Abstraction sacrifices causal clarity

Marv KJul 31, 2022, 7:24 PM

2 points

0 comments3 min readLW link

Time-logging programs and/or spreadsheets (2022)

mikbpJul 31, 2022, 6:18 PM

3 points

3 comments1 min readLW link

Conservatism is a rational response to epistemic uncertainty

contrarianbritJul 31, 2022, 6:04 PM

2 points

11 comments9 min readLW link

(thomasprosser.substack.com)

South Bay ACX/LW Meetup

ISJul 31, 2022, 3:30 PM

2 points

0 comments1 min readLW link

Perverse Independence Incentives

jefftkJul 31, 2022, 2:40 PM

61 points

3 comments1 min readLW link

(www.jefftk.com)

Wolfram Research v Cook

KennyJul 31, 2022, 1:35 PM

7 points

3 comments8 min readLW link

Wanted: Notation for credal resilience

PeterHJul 31, 2022, 7:35 AM

21 points

12 comments1 min readLW link

Anatomy of a Dating Document

squidiousJul 31, 2022, 2:40 AM

29 points

24 comments4 min readLW link

(opalsandbonobos.blogspot.com)

chinchilla’s wild implications

nostalgebraistJul 31, 2022, 1:18 AM

424 points

128 comments10 min readLW link 1 review

AGI-level reasoner will appear sooner than an agent; what the humanity will do with this reasoner is critical

Roman Leventov30 Jul 2022 20:56 UTC

24 points

10 comments1 min readLW link

[Question] What job should I do?

Tom Paine30 Jul 2022 9:15 UTC

2 points

8 comments1 min readLW link

How transparency changed over time

ViktoriaMalyasova30 Jul 2022 4:36 UTC

21 points

0 comments6 min readLW link

Translating between Latent Spaces

JamesH, Jeremy Gillen and NickyP

30 Jul 2022 3:25 UTC

27 points

2 comments8 min readLW link

Drexler’s Nanotech Forecast

PeterMcCluskey30 Jul 2022 0:45 UTC

25 points

28 comments3 min readLW link

(www.bayesianinvestor.com)

Humans Reflecting on HRH

leogao29 Jul 2022 21:56 UTC

27 points

4 comments2 min readLW link

Comparing Four Approaches to Inner Alignment

Lucas Teixeira29 Jul 2022 21:06 UTC

38 points

1 comment9 min readLW link