Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
The ants and the grasshopper
Richard_Ngo
Jun 4, 2023, 10:00 PM
465
points
44
comments
5
min read
LW
link
4
reviews
(www.narrativeark.xyz)
Things I Learned by Spending Five Thousand Hours In Non-EA Charities
jenn
Jun 1, 2023, 8:48 PM
430
points
35
comments
8
min read
LW
link
1
review
(jenn.site)
Guide to rationalist interior decorating
mingyuan
Jun 19, 2023, 6:47 AM
324
points
52
comments
12
min read
LW
link
4
reviews
When do “brains beat brawn” in Chess? An experiment
titotal
Jun 28, 2023, 1:33 PM
315
points
106
comments
7
min read
LW
link
2
reviews
(titotal.substack.com)
The Base Rate Times, news through prediction markets
vandemonian
Jun 6, 2023, 5:42 PM
268
points
41
comments
4
min read
LW
link
1
review
UFO Betting: Put Up or Shut Up
RatsWrongAboutUAP
Jun 13, 2023, 4:05 AM
260
points
216
comments
2
min read
LW
link
1
review
Lessons On How To Get Things Right On The First Try
johnswentworth
and
David Lorell
Jun 19, 2023, 11:58 PM
252
points
57
comments
10
min read
LW
link
1
review
Munk AI debate: confusions and possible cruxes
Steven Byrnes
Jun 27, 2023, 2:18 PM
244
points
21
comments
8
min read
LW
link
Updates and Reflections on Optimal Exercise after Nearly a Decade
romeostevensit
Jun 8, 2023, 11:02 PM
213
points
57
comments
2
min read
LW
link
1
review
Launching Lightspeed Grants (Apply by July 6th)
habryka
Jun 7, 2023, 2:53 AM
211
points
42
comments
5
min read
LW
link
Lightcone Infrastructure/LessWrong is looking for funding
habryka
Jun 14, 2023, 4:45 AM
205
points
39
comments
1
min read
LW
link
My tentative best guess on how EAs and Rationalists sometimes turn crazy
habryka
Jun 21, 2023, 4:11 AM
199
points
110
comments
8
min read
LW
link
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
likenneth
Jun 11, 2023, 5:38 AM
195
points
4
comments
1
min read
LW
link
(arxiv.org)
Another medical miracle
Dentin
Jun 25, 2023, 8:43 PM
186
points
48
comments
3
min read
LW
link
What will GPT-2030 look like?
jsteinhardt
Jun 7, 2023, 11:40 PM
185
points
43
comments
23
min read
LW
link
(bounded-regret.ghost.io)
I still think it’s very unlikely we’re observing alien aircraft
dynomight
Jun 15, 2023, 1:01 PM
180
points
70
comments
5
min read
LW
link
(dynomight.net)
LLMs Sometimes Generate Purely Negatively-Reinforced Text
Fabien Roger
Jun 16, 2023, 4:31 PM
177
points
11
comments
7
min read
LW
link
Will the growing deer prion epidemic spread to humans? Why not?
eukaryote
Jun 25, 2023, 4:31 AM
170
points
33
comments
13
min read
LW
link
(eukaryotewritesblog.com)
The Dial of Progress
Zvi
Jun 13, 2023, 1:40 PM
161
points
119
comments
11
min read
LW
link
(thezvi.wordpress.com)
Change my mind: Veganism entails trade-offs, and health is one of the axes
Elizabeth
Jun 1, 2023, 5:10 PM
160
points
85
comments
19
min read
LW
link
2
reviews
(acesounderglass.com)
Algorithmic Improvement Is Probably Faster Than Scaling Now
johnswentworth
Jun 6, 2023, 2:57 AM
146
points
25
comments
2
min read
LW
link
My side of an argument with Jacob Cannell about chip interconnect losses
Steven Byrnes
Jun 21, 2023, 1:33 PM
144
points
11
comments
11
min read
LW
link
Yudkowsky vs Hanson on FOOM: Whose Predictions Were Better?
1a3orn
Jun 1, 2023, 7:36 PM
137
points
76
comments
24
min read
LW
link
2
reviews
Think carefully before calling RL policies “agents”
TurnTrout
Jun 2, 2023, 3:46 AM
134
points
38
comments
4
min read
LW
link
1
review
Why Not Subagents?
johnswentworth
and
David Lorell
Jun 22, 2023, 10:16 PM
130
points
52
comments
14
min read
LW
link
1
review
A Disneyland Without Children
L Rudolf L
Jun 4, 2023, 1:06 PM
126
points
11
comments
LW
link
4
reviews
(nosetgauge.substack.com)
The Hubinger lectures on AGI safety: an introductory lecture series
evhub
Jun 22, 2023, 12:59 AM
126
points
0
comments
1
min read
LW
link
(www.youtube.com)
ARC is hiring theoretical researchers
paulfchristiano
,
Jacob_Hilton
and
Mark Xu
Jun 12, 2023, 6:50 PM
126
points
12
comments
4
min read
LW
link
(www.alignment.org)
AI #14: A Very Good Sentence
Zvi
Jun 1, 2023, 9:30 PM
118
points
30
comments
65
min read
LW
link
(thezvi.wordpress.com)
Model, Care, Execution
Ricki Heicklen
and
AvitalM
Jun 26, 2023, 4:05 AM
111
points
10
comments
12
min read
LW
link
1
review
(bayesshammai.substack.com)
My guess for why I was wrong about US housing
romeostevensit
Jun 14, 2023, 12:37 AM
110
points
13
comments
1
min read
LW
link
Did Bengio and Tegmark lose a debate about AI x-risk against LeCun and Mitchell?
Karl von Wendt
Jun 25, 2023, 4:59 PM
106
points
53
comments
7
min read
LW
link
Short Remark on the (subjective) mathematical ‘naturalness’ of the Nanda—Lieberum addition modulo 113 algorithm
carboniferous_umbraculum
Jun 1, 2023, 11:31 AM
104
points
12
comments
2
min read
LW
link
Work dumber not smarter
lemonhope
Jun 1, 2023, 12:40 PM
101
points
17
comments
3
min read
LW
link
Public Transit is not Infinitely Safe
jefftk
Jun 20, 2023, 6:40 PM
97
points
34
comments
1
min read
LW
link
(www.jefftk.com)
AI #17: The Litany
Zvi
Jun 22, 2023, 2:30 PM
95
points
34
comments
56
min read
LW
link
(thezvi.wordpress.com)
Takeaways from the Mechanistic Interpretability Challenges
scasper
Jun 8, 2023, 6:56 PM
94
points
5
comments
6
min read
LW
link
60+ Possible Futures
Bart Bussmann
Jun 26, 2023, 9:16 AM
93
points
18
comments
11
min read
LW
link
A Playbook for AI Risk Reduction (focused on misaligned AI)
HoldenKarnofsky
Jun 6, 2023, 6:05 PM
90
points
42
comments
14
min read
LW
link
1
review
When is correlation transitive?
Ege Erdil
Jun 23, 2023, 4:09 PM
83
points
7
comments
6
min read
LW
link
Ethodynamics of Omelas
dr_s
Jun 10, 2023, 4:24 PM
83
points
18
comments
9
min read
LW
link
1
review
Automatic Rate Limiting on LessWrong
Raemon
Jun 23, 2023, 8:19 PM
83
points
34
comments
8
min read
LW
link
Outreach success: Intro to AI risk that has been successful
Michael Tontchev
Jun 1, 2023, 11:12 PM
83
points
8
comments
74
min read
LW
link
(medium.com)
DSLT 0. Distilling Singular Learning Theory
Liam Carroll
Jun 16, 2023, 9:50 AM
80
points
7
comments
5
min read
LW
link
Carl Shulman on The Lunar Society (7 hour, two-part podcast)
ESRogs
Jun 28, 2023, 1:23 AM
79
points
17
comments
1
min read
LW
link
(www.dwarkeshpatel.com)
A mind needn’t be curious to reap the benefits of curiosity
So8res
Jun 2, 2023, 6:00 PM
78
points
14
comments
1
min read
LW
link
Cultivate an obsession with the object level
Richard_Ngo
Jun 7, 2023, 1:39 AM
77
points
4
comments
3
min read
LW
link
My research agenda in agent foundations
Alex_Altair
Jun 28, 2023, 6:00 PM
75
points
9
comments
11
min read
LW
link
Rational Animations is looking for an AI Safety scriptwriter, a lead community manager, and other roles.
Writer
Jun 16, 2023, 9:41 AM
74
points
1
comment
3
min read
LW
link
A comparison of causal scrubbing, causal abstractions, and related methods
Erik Jenner
,
Adrià Garriga-alonso
and
Egor Zverev
8 Jun 2023 23:40 UTC
73
points
3
comments
22
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel