Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Slides: Potential Risks From Advanced AI
Aryeh Englander
Apr 28, 2022, 2:15 AM
7
points
0
comments
1
min read
LW
link
Naive comments on AGIlignment
Ericf
Apr 28, 2022, 1:08 AM
−8
points
4
comments
1
min read
LW
link
AI Alternative Futures: Scenario Mapping Artificial Intelligence Risk—Request for Participation (*Closed*)
Kakili
Apr 27, 2022, 10:07 PM
10
points
2
comments
8
min read
LW
link
The Speed + Simplicity Prior is probably anti-deceptive
Yonadav Shavit
Apr 27, 2022, 7:30 PM
30
points
28
comments
12
min read
LW
link
If you’re very optimistic about ELK then you should be optimistic about outer alignment
Sam Marks
Apr 27, 2022, 7:30 PM
17
points
8
comments
3
min read
LW
link
The Game of Masks
Slimepriestess
Apr 27, 2022, 6:03 PM
50
points
18
comments
11
min read
LW
link
(hivewired.wordpress.com)
Law-Following AI 3: Lawless AI Agents Undermine Stabilizing Agreements
Cullen
Apr 27, 2022, 5:30 PM
2
points
2
comments
3
min read
LW
link
Law-Following AI 2: Intent Alignment + Superintelligence → Lawless AI (By Default)
Cullen
Apr 27, 2022, 5:27 PM
5
points
2
comments
6
min read
LW
link
Law-Following AI 1: Sequence Introduction and Structure
Cullen
Apr 27, 2022, 5:26 PM
18
points
10
comments
9
min read
LW
link
[Intro to brain-like-AGI safety] 13. Symbol grounding & human social instincts
Steven Byrnes
Apr 27, 2022, 1:30 PM
73
points
15
comments
15
min read
LW
link
The case for turning glowfic into Sequences
Thomas Kwa
Apr 27, 2022, 6:58 AM
87
points
29
comments
5
min read
LW
link
[Link] Evidence of Fabricated Data in a Vitamin C trial by Paul E Marik et al in CHEST
Kenny
Apr 27, 2022, 6:48 AM
6
points
1
comment
1
min read
LW
link
SERI ML Alignment Theory Scholars Program 2022
Ryan Kidd
,
Victor Warlop
and
ozhang
Apr 27, 2022, 12:43 AM
67
points
6
comments
3
min read
LW
link
EU Maximizing in a Gloomy World
David Udell
Apr 27, 2022, 12:28 AM
6
points
2
comments
1
min read
LW
link
Why Copilot Accelerates Timelines
Michaël Trazzi
Apr 26, 2022, 10:06 PM
35
points
14
comments
7
min read
LW
link
Universals of Morality: Toward Human-Centric Communication Platforms
scafaria
Apr 26, 2022, 9:15 PM
−3
points
3
comments
5
min read
LW
link
(scafaria.com)
[$20K in Prizes] AI Safety Arguments Competition
Dan H
,
Kevin Liu
,
ozhang
,
TW123
and
Sidney Hough
Apr 26, 2022, 4:13 PM
75
points
518
comments
3
min read
LW
link
Continental Philosophy as Undergraduate Mathematics
Jan
Apr 26, 2022, 8:05 AM
17
points
3
comments
9
min read
LW
link
(universalprior.substack.com)
dalle2 comments
nostalgebraist
Apr 26, 2022, 5:30 AM
183
points
14
comments
13
min read
LW
link
(nostalgebraist.tumblr.com)
Make a neural network in ~10 minutes
Arjun Yadav
Apr 26, 2022, 5:24 AM
8
points
0
comments
4
min read
LW
link
(arjunyadav.net)
Framings of Deceptive Alignment
peterbarnett
Apr 26, 2022, 4:25 AM
32
points
7
comments
5
min read
LW
link
Why pessimism sounds smart
jasoncrawford
Apr 25, 2022, 8:10 PM
76
points
15
comments
1
min read
LW
link
(rootsofprogress.org)
[Question]
What is being improved in recursive self improvement?
Lone Pine
Apr 25, 2022, 6:30 PM
7
points
6
comments
1
min read
LW
link
21 on 21
Amir Bolous
Apr 25, 2022, 6:22 PM
43
points
5
comments
4
min read
LW
link
[Question]
Rationalist Inspired Coming-of-age Rituals
iceplant
Apr 25, 2022, 5:22 PM
10
points
3
comments
1
min read
LW
link
[Request for Distillation] Coherence of Distributed Decisions With Different Inputs Implies Conditioning
johnswentworth
Apr 25, 2022, 5:01 PM
22
points
14
comments
2
min read
LW
link
[Question]
Quadratic voting with automatic collusion?
SarahNibs
Apr 25, 2022, 4:15 PM
10
points
5
comments
1
min read
LW
link
Intuitions about solving hard problems
Richard_Ngo
Apr 25, 2022, 3:29 PM
106
points
23
comments
6
min read
LW
link
Ukraine Post #11: Longer Term Predictions
Zvi
Apr 25, 2022, 2:10 PM
32
points
6
comments
11
min read
LW
link
(thezvi.wordpress.com)
Key questions about artificial sentience: an opinionated guide
Robbo
Apr 25, 2022, 12:09 PM
51
points
31
comments
18
min read
LW
link
On Tables and Happiness
Alexander
Apr 25, 2022, 9:51 AM
25
points
0
comments
2
min read
LW
link
Why I’m Not a Utilitarian in Modern America
DanB
Apr 24, 2022, 9:43 PM
5
points
5
comments
8
min read
LW
link
Examining Evolution as an Upper Bound for AGI Timelines
meanderingmoose
Apr 24, 2022, 7:08 PM
6
points
1
comment
9
min read
LW
link
Athens, Greece – ACX Spring Meetups 2022
Elias
Apr 24, 2022, 6:06 PM
1
point
1
comment
1
min read
LW
link
AI safety raising awareness resources bleg
iivonen
Apr 24, 2022, 5:13 PM
6
points
0
comments
1
min read
LW
link
[Question]
Anyone Familiar with Ground News?
jmh
Apr 24, 2022, 12:46 PM
2
points
2
comments
1
min read
LW
link
[Question]
Where can I publish an article containing a list of intellectuals who publicly admitted their mistakes once proven wrong?
Hashem ElAssad
Apr 24, 2022, 9:00 AM
0
points
1
comment
1
min read
LW
link
What Is a Major Chord?
jefftk
Apr 24, 2022, 7:20 AM
59
points
11
comments
3
min read
LW
link
(www.jefftk.com)
Slack gives you space to notice/reflect on subtle things
Raemon
Apr 24, 2022, 2:30 AM
158
points
18
comments
1
min read
LW
link
Calling for Student Submissions: AI Safety Distillation Contest
Aris
Apr 24, 2022, 1:53 AM
48
points
15
comments
4
min read
LW
link
Rationality Dojo
lsusr
Apr 24, 2022, 12:53 AM
14
points
5
comments
1
min read
LW
link
[Question]
Deletion
011eNigma235
Apr 23, 2022, 11:01 PM
1
point
1
comment
1
min read
LW
link
Cape Town ACX meetup
Jordan Pieters
Apr 23, 2022, 11:00 PM
1
point
0
comments
1
min read
LW
link
Re: So You Want to Be a Dharma Teacher
lsusr
23 Apr 2022 22:31 UTC
30
points
4
comments
2
min read
LW
link
(hardcorezen.info)
Ineffective Altruism
lsusr
23 Apr 2022 22:07 UTC
86
points
17
comments
1
min read
LW
link
[Question]
Has anyone written a reductionist theory of creativity?
Grant Demaree
23 Apr 2022 22:05 UTC
4
points
3
comments
1
min read
LW
link
Progress Report 5: tying it together
Nathan Helm-Burger
23 Apr 2022 21:07 UTC
10
points
0
comments
2
min read
LW
link
The New Right appears to be on the rise for better or worse
Chris_Leong
23 Apr 2022 19:36 UTC
6
points
18
comments
1
min read
LW
link
[ASoT] Consequentialist models as a superset of mesaoptimizers
leogao
23 Apr 2022 17:57 UTC
38
points
2
comments
4
min read
LW
link
Report likelihood ratios
Ege Erdil
23 Apr 2022 17:10 UTC
80
points
9
comments
7
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel