Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
You don’t know how bad most things are nor precisely how they’re bad.
Solenoid_Entity
Aug 4, 2024, 2:12 PM
327
points
49
comments
5
min read
LW
link
Would catching your AIs trying to escape convince AI developers to slow down or undeploy?
Buck
Aug 26, 2024, 4:46 PM
314
points
77
comments
4
min read
LW
link
Leaving MIRI, Seeking Funding
abramdemski
Aug 8, 2024, 6:32 PM
264
points
19
comments
2
min read
LW
link
Principles for the AGI Race
William_S
Aug 30, 2024, 2:29 PM
248
points
17
comments
18
min read
LW
link
The ‘strong’ feature hypothesis could be wrong
lewis smith
Aug 2, 2024, 2:33 PM
231
points
19
comments
17
min read
LW
link
AGI Safety and Alignment at Google DeepMind: A Summary of Recent Work
Rohin Shah
,
Seb Farquhar
and
Anca Dragan
Aug 20, 2024, 4:22 PM
222
points
33
comments
9
min read
LW
link
How I Learned To Stop Trusting Prediction Markets and Love the Arbitrage
orthonormal
Aug 6, 2024, 2:32 AM
198
points
30
comments
3
min read
LW
link
WTH is Cerebrolysin, actually?
gsfitzgerald
and
delton137
Aug 6, 2024, 8:40 PM
175
points
23
comments
17
min read
LW
link
You can remove GPT2’s LayerNorm by fine-tuning for an hour
StefanHex
Aug 8, 2024, 6:33 PM
165
points
11
comments
8
min read
LW
link
[Question]
things that confuse me about the current AI market.
DMMF
Aug 28, 2024, 1:46 PM
156
points
27
comments
2
min read
LW
link
Liability regimes for AI
Ege Erdil
Aug 19, 2024, 1:25 AM
153
points
34
comments
5
min read
LW
link
The Information: OpenAI shows ‘Strawberry’ to feds, races to launch it
Martín Soto
Aug 27, 2024, 11:10 PM
145
points
15
comments
3
min read
LW
link
Fields that I reference when thinking about AI takeover prevention
Buck
Aug 13, 2024, 11:08 PM
144
points
16
comments
10
min read
LW
link
(redwoodresearch.substack.com)
Nursing doubts
dynomight
Aug 30, 2024, 2:25 AM
144
points
23
comments
9
min read
LW
link
(dynomight.net)
Limitations on Formal Verification for AI Safety
Andrew Dickson
Aug 19, 2024, 11:03 PM
134
points
60
comments
23
min read
LW
link
Parasites (not a metaphor)
lemonhope
Aug 8, 2024, 8:07 PM
133
points
19
comments
1
min read
LW
link
“Can AI Scaling Continue Through 2030?”, Epoch AI (yes)
gwern
Aug 24, 2024, 1:40 AM
130
points
4
comments
3
min read
LW
link
(epochai.org)
How I started believing religion might actually matter for rationality and moral philosophy
zhukeepa
Aug 23, 2024, 5:40 PM
129
points
41
comments
7
min read
LW
link
Near-mode thinking on AI
Olli Järviniemi
Aug 4, 2024, 8:47 PM
128
points
9
comments
5
min read
LW
link
Investigating the Chart of the Century: Why is food so expensive?
Maxwell Tabarrok
Aug 16, 2024, 1:21 PM
122
points
26
comments
3
min read
LW
link
(www.maximum-progress.com)
Please stop using mediocre AI art in your posts
Raemon
Aug 25, 2024, 12:13 AM
115
points
24
comments
2
min read
LW
link
Ten arguments that AI is an existential risk
KatjaGrace
and
Nathan Young
Aug 13, 2024, 5:00 PM
113
points
42
comments
7
min read
LW
link
(blog.aiimpacts.org)
Please support this blog (with money)
Elizabeth
Aug 17, 2024, 3:30 PM
112
points
3
comments
6
min read
LW
link
(acesounderglass.com)
A primer on the current state of longevity research
Abhishaike Mahajan
Aug 22, 2024, 5:14 PM
109
points
6
comments
14
min read
LW
link
(www.owlposting.com)
Danger, AI Scientist, Danger
Zvi
Aug 15, 2024, 10:40 PM
107
points
9
comments
7
min read
LW
link
(thezvi.wordpress.com)
Perplexity wins my AI race
Elizabeth
Aug 24, 2024, 7:20 PM
107
points
12
comments
10
min read
LW
link
(acesounderglass.com)
LLM Applications I Want To See
sarahconstantin
Aug 19, 2024, 9:10 PM
102
points
6
comments
8
min read
LW
link
(sarahconstantin.substack.com)
Why you should be using a retinoid
GeneSmith
Aug 19, 2024, 3:07 AM
98
points
60
comments
5
min read
LW
link
the Giga Press was a mistake
bhauth
Aug 21, 2024, 4:51 AM
98
points
26
comments
5
min read
LW
link
(bhauth.com)
It’s time for a self-reproducing machine
Carl Feynman
Aug 7, 2024, 9:52 PM
96
points
69
comments
9
min read
LW
link
[Question]
Am I confused about the “malign universal prior” argument?
nostalgebraist
Aug 27, 2024, 11:17 PM
95
points
35
comments
8
min read
LW
link
Dragon Agnosticism
jefftk
Aug 1, 2024, 5:00 PM
94
points
75
comments
2
min read
LW
link
(www.jefftk.com)
Defining alignment research
Richard_Ngo
Aug 19, 2024, 8:42 PM
92
points
23
comments
7
min read
LW
link
SB 1047: Final Takes and Also AB 3211
Zvi
Aug 27, 2024, 10:10 PM
92
points
11
comments
21
min read
LW
link
(thezvi.wordpress.com)
Circular Reasoning
abramdemski
Aug 5, 2024, 6:10 PM
91
points
37
comments
8
min read
LW
link
Singular learning theory: exercises
Zach Furman
Aug 30, 2024, 8:00 PM
90
points
5
comments
14
min read
LW
link
Solving adversarial attacks in computer vision as a baby version of general AI alignment
Stanislav Fort
Aug 29, 2024, 5:17 PM
88
points
8
comments
7
min read
LW
link
What Depression Is Like
Sable
Aug 27, 2024, 5:43 PM
87
points
24
comments
4
min read
LW
link
(affablyevil.substack.com)
Darwinian Traps and Existential Risks
KristianRonn
Aug 25, 2024, 10:37 PM
85
points
14
comments
10
min read
LW
link
If we solve alignment, do we die anyway?
Seth Herd
Aug 23, 2024, 1:13 PM
84
points
129
comments
4
min read
LW
link
Secular interpretations of core perennialist claims
zhukeepa
Aug 25, 2024, 11:41 PM
83
points
32
comments
14
min read
LW
link
Release: Optimal Weave (P1): A Prototype Cohabitive Game
mako yass
Aug 17, 2024, 2:08 PM
82
points
21
comments
7
min read
LW
link
In Defense of Open-Minded UDT
abramdemski
Aug 12, 2024, 6:27 PM
79
points
28
comments
11
min read
LW
link
Quick look: applications of chaos theory
Elizabeth
and
Alex_Altair
Aug 18, 2024, 3:00 PM
79
points
51
comments
8
min read
LW
link
(acesounderglass.com)
Value fragility and AI takeover
Joe Carlsmith
Aug 5, 2024, 9:28 PM
76
points
5
comments
30
min read
LW
link
Soft Nationalization: how the USG will control AI labs
Deric Cheng
and
Corin Katzke
27 Aug 2024 15:11 UTC
76
points
7
comments
21
min read
LW
link
(www.convergenceanalysis.org)
A Simple Toy Coherence Theorem
johnswentworth
and
David Lorell
2 Aug 2024 17:47 UTC
74
points
22
comments
7
min read
LW
link
AI for Bio: State Of The Field
sarahconstantin
30 Aug 2024 18:00 UTC
73
points
2
comments
15
min read
LW
link
(sarahconstantin.substack.com)
What is “True Love”?
johnswentworth
18 Aug 2024 16:05 UTC
72
points
11
comments
1
min read
LW
link
FarmKind’s Illusory Offer
jefftk
9 Aug 2024 11:30 UTC
71
points
5
comments
3
min read
LW
link
(www.jefftk.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel