Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Don’t align agents to evaluations of plans
TurnTrout
Nov 26, 2022, 9:16 PM
48
points
49
comments
18
min read
LW
link
What videos should Rational Animations make?
Writer
Nov 26, 2022, 8:28 PM
30
points
24
comments
1
min read
LW
link
The First Filter
adamShimi
and
Gabriel Alfour
Nov 26, 2022, 7:37 PM
67
points
5
comments
1
min read
LW
link
Respecting your Local Preferences
Scott Garrabrant
Nov 26, 2022, 7:04 PM
73
points
1
comment
4
min read
LW
link
[Question]
Opinions on the sleep synaptic homeostasis hypothesis?
Angela Pretorius
Nov 26, 2022, 7:01 PM
3
points
0
comments
1
min read
LW
link
Why square errors?
Aprillion
Nov 26, 2022, 1:40 PM
41
points
11
comments
2
min read
LW
link
[Question]
Assuming that at least one religion is true, what would you expect it to be?
risedive
Nov 26, 2022, 8:34 AM
−9
points
9
comments
1
min read
LW
link
Three Alignment Schemas & Their Problems
Shoshannah Tekofsky
Nov 26, 2022, 4:25 AM
19
points
1
comment
6
min read
LW
link
The many types of blog posts
Adam Zerner
Nov 26, 2022, 3:57 AM
10
points
2
comments
4
min read
LW
link
New Frontiers in Mojibake
Adam Scherlis
Nov 26, 2022, 2:37 AM
60
points
7
comments
6
min read
LW
link
1
review
(adam.scherlis.com)
Semi-conductor/AI Stock Discussion.
sapphire
Nov 25, 2022, 11:35 PM
28
points
25
comments
1
min read
LW
link
NEFFA Should Allow Small Children
jefftk
Nov 25, 2022, 11:00 PM
10
points
2
comments
2
min read
LW
link
(www.jefftk.com)
Podcast: Shoshannah Tekofsky on skilling up in AI safety, visiting Berkeley, and developing novel research ideas
Akash
Nov 25, 2022, 8:47 PM
37
points
2
comments
9
min read
LW
link
The man and the tool
pedroalvarado
Nov 25, 2022, 7:51 PM
−1
points
0
comments
4
min read
LW
link
[Question]
What AI newsletters or substacks about AI do you recommend?
wunan
Nov 25, 2022, 7:29 PM
6
points
1
comment
1
min read
LW
link
Mechanistic anomaly detection and ELK
paulfchristiano
Nov 25, 2022, 6:50 PM
135
points
22
comments
21
min read
LW
link
(ai-alignment.com)
The Least Controversial Application of Geometric Rationality
Scott Garrabrant
Nov 25, 2022, 4:50 PM
60
points
22
comments
4
min read
LW
link
Planes are still decades away from displacing most bird jobs
guzey
Nov 25, 2022, 4:49 PM
166
points
13
comments
3
min read
LW
link
Take part in our giant study of cognitive abilities and get a customized report of your strengths and weaknesses!
spencerg
Nov 25, 2022, 4:28 PM
8
points
1
comment
1
min read
LW
link
(www.guidedtrack.com)
Guardian AI (Misaligned systems are all around us.)
Jessica Rumbelow
Nov 25, 2022, 3:55 PM
15
points
6
comments
2
min read
LW
link
Intuitions by ML researchers may get progressively worse concerning likely candidates for transformative AI
Viktor Rehnberg
Nov 25, 2022, 3:49 PM
7
points
0
comments
2
min read
LW
link
Refining the Sharp Left Turn threat model, part 2: applying alignment techniques
Vika
,
Vikrant Varma
,
Ramana Kumar
and
Rohin Shah
Nov 25, 2022, 2:36 PM
39
points
9
comments
6
min read
LW
link
(vkrakovna.wordpress.com)
[Question]
Who holds all the USDT?
ChristianKl
Nov 25, 2022, 11:58 AM
17
points
6
comments
1
min read
LW
link
Fair Collective Efficient Altruism
Jobst Heitzig
Nov 25, 2022, 9:38 AM
2
points
1
comment
5
min read
LW
link
[Question]
If humanity one day discovers that it is a form of disease that threatens to destroy the universe, should it allow itself to be shut down?
Shmi
Nov 25, 2022, 8:27 AM
4
points
12
comments
1
min read
LW
link
Could a single alien message destroy us?
Writer
and
Matthew Barnett
Nov 25, 2022, 7:32 AM
61
points
23
comments
6
min read
LW
link
(youtu.be)
How do I start a programming career in the West?
Lao Mein
Nov 25, 2022, 6:37 AM
38
points
7
comments
2
min read
LW
link
The AI Safety community has four main work groups, Strategy, Governance, Technical and Movement Building
peterslattery
Nov 25, 2022, 3:45 AM
1
point
0
comments
6
min read
LW
link
Less Successful Cider Adventures
jefftk
Nov 25, 2022, 1:50 AM
11
points
1
comment
1
min read
LW
link
(www.jefftk.com)
Gliders in Language Models
Alexandre Variengien
Nov 25, 2022, 12:38 AM
30
points
11
comments
10
min read
LW
link
On Kelly and altruism
philh
Nov 24, 2022, 11:40 PM
17
points
6
comments
12
min read
LW
link
(reasonableapproximation.net)
Open technical problem: A Quinean proof of Löb’s theorem, for an easier cartoon guide
Andrew_Critch
Nov 24, 2022, 9:16 PM
58
points
35
comments
3
min read
LW
link
1
review
[Question]
Historical examples of people gaining unusual cognitive abilities?
Nicholas / Heather Kross
Nov 24, 2022, 7:01 PM
8
points
2
comments
1
min read
LW
link
Corrigibility Via Thought-Process Deference
Thane Ruthenis
Nov 24, 2022, 5:06 PM
17
points
5
comments
9
min read
LW
link
Geometric Exploration, Arithmetic Exploitation
Scott Garrabrant
Nov 24, 2022, 3:36 PM
126
points
5
comments
7
min read
LW
link
What I Learned Running Refine
adamShimi
Nov 24, 2022, 2:49 PM
108
points
5
comments
4
min read
LW
link
Covid 11/24/22: Thanks for Good Health
Zvi
Nov 24, 2022, 1:00 PM
26
points
4
comments
8
min read
LW
link
(thezvi.wordpress.com)
[Question]
Dumb and ill-posed question: Is conceptual research like this MIRI paper on the shutdown problem/Corrigibility “real”
joraine
Nov 24, 2022, 5:08 AM
25
points
11
comments
1
min read
LW
link
Clarifying wireheading terminology
leogao
Nov 24, 2022, 4:53 AM
66
points
6
comments
1
min read
LW
link
LW Beta Feature: Side-Comments
jimrandomh
Nov 24, 2022, 1:55 AM
103
points
47
comments
1
min read
LW
link
Against “Classic Style”
Cleo Nardo
Nov 23, 2022, 10:10 PM
67
points
30
comments
4
min read
LW
link
South Bay ACX/LW Meetup
IS
Nov 23, 2022, 10:05 PM
2
points
0
comments
1
min read
LW
link
Meme Dialects
jefftk
Nov 23, 2022, 9:30 PM
26
points
1
comment
2
min read
LW
link
(www.jefftk.com)
[Question]
When do you visualize (or not) while doing math?
Alex_Altair
Nov 23, 2022, 8:15 PM
20
points
9
comments
1
min read
LW
link
When AI solves a game, focus on the game’s mechanics, not its theme.
Cleo Nardo
Nov 23, 2022, 7:16 PM
89
points
7
comments
2
min read
LW
link
The Geometric Expectation
Scott Garrabrant
Nov 23, 2022, 6:05 PM
151
points
21
comments
4
min read
LW
link
“Far Coordination”
DragonGod
Nov 23, 2022, 5:14 PM
6
points
17
comments
9
min read
LW
link
Conjecture Second Hiring Round
Connor Leahy
,
Sid Black
,
Gabriel Alfour
and
Chris Scammell
23 Nov 2022 17:11 UTC
92
points
0
comments
1
min read
LW
link
Conjecture: a retrospective after 8 months of work
Connor Leahy
,
Sid Black
,
Gabriel Alfour
and
Chris Scammell
23 Nov 2022 17:10 UTC
180
points
9
comments
8
min read
LW
link
Against a General Factor of Doom
Jeffrey Heninger
23 Nov 2022 16:50 UTC
61
points
19
comments
4
min read
LW
link
1
review
(aiimpacts.org)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel