Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
2
Power Lies Trembling: a three-book review
Richard_Ngo
Feb 22, 2025, 10:57 PM
214
points
29
comments
15
min read
LW
link
(www.mindthefuture.info)
Eliezer’s Lost Alignment Articles / The Arbital Sequence
Ruby
and
RobertM
Feb 20, 2025, 12:48 AM
207
points
10
comments
5
min read
LW
link
How to Make Superbabies
GeneSmith
and
kman
Feb 19, 2025, 8:39 PM
605
points
349
comments
31
min read
LW
link
Levels of Friction
Zvi
Feb 10, 2025, 1:10 PM
148
points
8
comments
12
min read
LW
link
(thezvi.wordpress.com)
So You Want To Make Marginal Progress...
johnswentworth
Feb 7, 2025, 11:22 PM
286
points
42
comments
4
min read
LW
link
How AI Takeover Might Happen in 2 Years
joshc
Feb 7, 2025, 5:10 PM
422
points
137
comments
29
min read
LW
link
(x.com)
Some articles in “International Security” that I enjoyed
Buck
Jan 31, 2025, 4:23 PM
130
points
10
comments
4
min read
LW
link
“Sharp Left Turn” discourse: An opinionated review
Steven Byrnes
Jan 28, 2025, 6:47 PM
208
points
26
comments
31
min read
LW
link
The Case Against AI Control Research
johnswentworth
Jan 21, 2025, 4:03 PM
353
points
80
comments
6
min read
LW
link
The Gentle Romance
Richard_Ngo
Jan 19, 2025, 6:29 PM
242
points
46
comments
15
min read
LW
link
(www.asimov.press)
Don’t ignore bad vibes you get from people
Kaj_Sotala
Jan 18, 2025, 9:20 AM
152
points
50
comments
2
min read
LW
link
(kajsotala.fi)
What Is The Alignment Problem?
johnswentworth
Jan 16, 2025, 1:20 AM
180
points
50
comments
25
min read
LW
link
How will we update about scheming?
ryan_greenblatt
Jan 6, 2025, 8:21 PM
171
points
20
comments
37
min read
LW
link
Review: Planecrash
L Rudolf L
Dec 27, 2024, 2:18 PM
360
points
45
comments
22
min read
LW
link
(nosetgauge.substack.com)
A Three-Layer Model of LLM Psychology
Jan_Kulveit
Dec 26, 2024, 4:49 PM
217
points
13
comments
8
min read
LW
link
What Goes Without Saying
sarahconstantin
Dec 20, 2024, 6:00 PM
334
points
28
comments
5
min read
LW
link
(sarahconstantin.substack.com)
When Is Insurance Worth It?
kqr
Dec 19, 2024, 7:07 PM
175
points
71
comments
4
min read
LW
link
(entropicthoughts.com)
Alignment Faking in Large Language Models
ryan_greenblatt
,
evhub
,
Carson Denison
,
Benjamin Wright
,
Fabien Roger
,
Monte M
,
Sam Marks
,
Johannes Treutlein
,
Sam Bowman
and
Buck
Dec 18, 2024, 5:19 PM
483
points
75
comments
10
min read
LW
link
AIs Will Increasingly Attempt Shenanigans
Zvi
Dec 16, 2024, 3:20 PM
114
points
2
comments
26
min read
LW
link
(thezvi.wordpress.com)
Biological risk from the mirror world
jasoncrawford
Dec 12, 2024, 7:07 PM
334
points
38
comments
7
min read
LW
link
(newsletter.rootsofprogress.org)
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel