All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Mysteries of mode collapse

janusNov 8, 2022, 10:37 AM

284 points

57 comments14 min readLW link 1 review

What it’s like to dissect a cadaver

Alok SinghNov 10, 2022, 6:40 AM

208 points

24 comments5 min readLW link

(alok.github.io)

I Converted Book I of The Sequences Into A Zoomer-Readable Format

dkirmaniNov 10, 2022, 2:59 AM

200 points

32 comments2 min readLW link

The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable

beren and Sid Black

Nov 28, 2022, 12:54 PM

199 points

33 comments31 min readLW link

Tyranny of the Epistemic Majority

Scott GarrabrantNov 22, 2022, 5:19 PM

192 points

13 comments9 min readLW link 1 review

Conjecture: a retrospective after 8 months of work

Connor Leahy, Sid Black, Gabriel Alfour and Chris Scammell

Nov 23, 2022, 5:10 PM

180 points

9 comments8 min readLW link

Geometric Rationality is Not VNM Rational

Scott GarrabrantNov 27, 2022, 7:36 PM

176 points

27 comments3 min readLW link

Planes are still decades away from displacing most bird jobs

guzeyNov 25, 2022, 4:49 PM

168 points

13 comments3 min readLW link

The Geometric Expectation

Scott GarrabrantNov 23, 2022, 6:05 PM

159 points

22 comments4 min readLW link

Mechanistic anomaly detection and ELK

paulfchristianoNov 25, 2022, 6:50 PM

138 points

22 comments21 min readLW link

(ai-alignment.com)

The Alignment Community Is Culturally Broken

sudoNov 13, 2022, 6:53 PM

136 points

68 comments2 min readLW link

AI will change the world, but won’t take it over by playing “3-dimensional chess”.

boazbarak and benedelman

Nov 22, 2022, 6:57 PM

134 points

97 comments24 min readLW link

Sadly, FTX

ZviNov 17, 2022, 2:30 PM

133 points

18 comments47 min readLW link

(thezvi.wordpress.com)

Clarifying AI X-risk

zac_kenton, Rohin Shah, David Lindner, Vikrant Varma, Vika, Mary Phuong, Ramana Kumar and Elliot Catt

Nov 1, 2022, 11:03 AM

127 points

24 comments4 min readLW link 1 review

On the Diplomacy AI

ZviNov 28, 2022, 1:20 PM

127 points

29 comments11 min readLW link

(thezvi.wordpress.com)

Geometric Exploration, Arithmetic Exploitation

Scott GarrabrantNov 24, 2022, 3:36 PM

126 points

5 comments7 min readLW link

Utilitarianism Meets Egalitarianism

Scott GarrabrantNov 21, 2022, 7:00 PM

121 points

16 comments6 min readLW link 1 review

Speculation on Current Opportunities for Unusually High Impact in Global Health

johnswentworthNov 11, 2022, 8:47 PM

114 points

31 comments4 min readLW link

How could we know that an AGI system will have good consequences?

So8resNov 7, 2022, 10:42 PM

111 points

25 comments5 min readLW link

Applying superintelligence without collusion

Eric DrexlerNov 8, 2022, 6:08 PM

109 points

63 comments4 min readLW link

What I Learned Running Refine

adamShimiNov 24, 2022, 2:49 PM

108 points

5 comments4 min readLW link

Caution when interpreting Deepmind’s In-context RL paper

Sam MarksNov 1, 2022, 2:42 AM

105 points

8 comments4 min readLW link

Here’s the exit.

ValentineNov 21, 2022, 6:07 PM

105 points

180 comments10 min readLW link 5 reviews

Instrumental convergence is what makes general intelligence possible

tailcalledNov 11, 2022, 4:38 PM

105 points

11 comments4 min readLW link

LW Beta Feature: Side-Comments

jimrandomhNov 24, 2022, 1:55 AM

103 points

47 comments1 min readLW link

LessWrong readers are invited to apply to the Lurkshop

Jonas V and GradientDissenter

Nov 22, 2022, 9:19 AM

101 points

41 comments3 min readLW link

Instead of technical research, more people should focus on buying time

Orpheus16, OliviaJ and Thomas Larsen

Nov 5, 2022, 8:43 PM

100 points

45 comments14 min readLW link

ARC paper: Formalizing the presumption of independence

Erik JennerNov 20, 2022, 1:22 AM

97 points

2 comments2 min readLW link

(arxiv.org)

Searching for Search

NicholasKees and janus

Nov 28, 2022, 3:31 PM

97 points

9 comments14 min readLW link 1 review

Trying to Make a Treacherous Mesa-Optimizer

MadHatterNov 9, 2022, 6:07 PM

95 points

14 comments4 min readLW link

(attentionspan.blog)

Meta AI announces Cicero: Human-Level Diplomacy play (with dialogue)

Jacy Reese AnthisNov 22, 2022, 4:50 PM

93 points

64 comments1 min readLW link

(www.science.org)

Conjecture Second Hiring Round

Connor Leahy, Sid Black, Gabriel Alfour and Chris Scammell

Nov 23, 2022, 5:11 PM

92 points

0 comments1 min readLW link

Current themes in mechanistic interpretability research

Lee Sharkey, Sid Black and beren

Nov 16, 2022, 2:14 PM

89 points

2 comments12 min readLW link

When AI solves a game, focus on the game’s mechanics, not its theme.

Cleo NardoNov 23, 2022, 7:16 PM

89 points

7 comments2 min readLW link

By Default, GPTs Think In Plain Sight

Fabien RogerNov 19, 2022, 7:15 PM

88 points

36 comments9 min readLW link

Announcing the Progress Forum

jasoncrawfordNov 17, 2022, 7:26 PM

83 points

9 comments1 min readLW link

Always know where your abstractions break

lsusrNov 27, 2022, 6:32 AM

82 points

6 comments2 min readLW link

Results from the interpretability hackathon

Esben Kran and Neel Nanda

Nov 17, 2022, 2:51 PM

81 points

0 comments6 min readLW link

(alignmentjam.com)

Exams-Only Universities

Mati_RoyNov 6, 2022, 10:05 PM

80 points

40 comments2 min readLW link

What is epigenetics?

MetacelsusNov 6, 2022, 1:24 AM

78 points

4 comments6 min readLW link

(denovo.substack.com)

Threat Model Literature Review

zac_kenton, Rohin Shah, David Lindner, Vikrant Varma, Vika, Mary Phuong, Ramana Kumar and Elliot Catt

Nov 1, 2022, 11:03 AM

78 points

4 comments25 min readLW link

Elastic Productivity Tools

Simon BerensNov 19, 2022, 9:59 PM

76 points

8 comments2 min readLW link

(simonberens.me)

Follow up to medical miracle

ElizabethNov 4, 2022, 6:00 PM

76 points

5 comments6 min readLW link

(acesounderglass.com)

Engineering Monosemanticity in Toy Models

Adam Jermyn, evhub and Nicholas Schiefer

Nov 18, 2022, 1:43 AM

75 points

7 comments3 min readLW link

(arxiv.org)

Disagreement with bio anchors that lead to shorter timelines

Marius HobbhahnNov 16, 2022, 2:40 PM

75 points

17 comments7 min readLW link 1 review

Will we run out of ML data? Evidence from projecting dataset size trends

Pablo VillalobosNov 14, 2022, 4:42 PM

75 points

12 comments2 min readLW link

(epochai.org)

K-types vs T-types — what priors do you have?

Cleo NardoNov 3, 2022, 11:29 AM

74 points

25 comments7 min readLW link

Respecting your Local Preferences

Scott GarrabrantNov 26, 2022, 7:04 PM

73 points

1 comment4 min readLW link

Takeaways from a survey on AI alignment resources

DanielFilanNov 5, 2022, 11:40 PM

73 points

10 comments6 min readLW link 1 review

(danielfilan.com)

Announcing AI Alignment Awards: $100k research contests about goal misgeneralization & corrigibility

Orpheus16 and OliviaJ

Nov 22, 2022, 10:19 PM

73 points

20 comments4 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer