Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
A Library and Tutorial for Factored Cognition with Language Models
stuhlmueller
,
justin_dan
and
goodgravy
Sep 28, 2022, 6:15 PM
47
points
0
comments
1
min read
LW
link
Reward IS the Optimization Target
Carn
Sep 28, 2022, 5:59 PM
−2
points
3
comments
5
min read
LW
link
AI Safety Endgame Stories
Ivan Vendrov
Sep 28, 2022, 4:58 PM
31
points
11
comments
11
min read
LW
link
Will Values and Competition Decouple?
interstice
Sep 28, 2022, 4:27 PM
15
points
11
comments
17
min read
LW
link
Georgism in Space
harsimony
Sep 28, 2022, 4:05 PM
42
points
12
comments
4
min read
LW
link
(harsimony.wordpress.com)
QAPR 3: interpretability-guided training of neural nets
Quintin Pope
Sep 28, 2022, 4:02 PM
58
points
2
comments
10
min read
LW
link
Strange Loops—Self-Reference from Number Theory to AI
ojorgensen
Sep 28, 2022, 2:10 PM
19
points
6
comments
18
min read
LW
link
Why I think strong general AI is coming soon
porby
Sep 28, 2022, 5:40 AM
337
points
141
comments
34
min read
LW
link
1
review
About Q Home
Q Home
Sep 28, 2022, 4:56 AM
11
points
4
comments
1
min read
LW
link
[Linkpost] “Intensity and frequency of extreme novel epidemics” by Mariani et al. (2021)
T431
Sep 28, 2022, 3:31 AM
10
points
0
comments
LW
link
Threat-Resistant Bargaining Megapost: Introducing the ROSE Value
Diffractor
Sep 28, 2022, 1:20 AM
162
points
19
comments
53
min read
LW
link
2
reviews
7 traps that (we think) new alignment researchers often fall into
Orpheus16
and
Thomas Larsen
Sep 27, 2022, 11:13 PM
176
points
10
comments
4
min read
LW
link
Failure modes in a shard theory alignment plan
Thomas Kwa
Sep 27, 2022, 10:34 PM
26
points
2
comments
7
min read
LW
link
[Question]
Is a PhD necessary to contribute meaningfully to a field?
TrudosKudos
Sep 27, 2022, 9:27 PM
4
points
7
comments
1
min read
LW
link
Why we’re not founding a human-data-for-alignment org
L Rudolf L
and
Matt Putz
Sep 27, 2022, 8:14 PM
88
points
6
comments
29
min read
LW
link
(forum.effectivealtruism.org)
A Poorly Planned Loft Bed
jefftk
Sep 27, 2022, 5:50 PM
9
points
2
comments
1
min read
LW
link
(www.jefftk.com)
Wise Crowd & Democratic Spirit
Hristo Zaykov
Sep 27, 2022, 5:45 PM
1
point
0
comments
2
min read
LW
link
(www.hristo.blog)
Soft skills for meetups
mingyuan
Sep 27, 2022, 5:26 PM
49
points
3
comments
5
min read
LW
link
[Question]
Enriching Youtube content recommendations
Martín Soto
Sep 27, 2022, 4:54 PM
8
points
4
comments
1
min read
LW
link
The Onion Test for Personal and Institutional Honesty
chanamessinger
and
Andrew_Critch
Sep 27, 2022, 3:26 PM
163
points
31
comments
3
min read
LW
link
3
reviews
Book review: “The Heart of the Brain: The Hypothalamus and Its Hormones”
Steven Byrnes
Sep 27, 2022, 1:20 PM
65
points
3
comments
18
min read
LW
link
My Thoughts on the ML Safety Course
zeshen
Sep 27, 2022, 1:15 PM
50
points
3
comments
17
min read
LW
link
Summary of ML Safety Course
zeshen
Sep 27, 2022, 1:05 PM
7
points
0
comments
6
min read
LW
link
Probabilistic reasoning for description and experience
Q Home
Sep 27, 2022, 10:57 AM
0
points
0
comments
26
min read
LW
link
A Prince, a Pauper, Power, Panama
Alok Singh
Sep 27, 2022, 7:10 AM
10
points
0
comments
1
min read
LW
link
(alok.github.io)
Double Asteroid Redirection Test succeeds
sanxiyn
Sep 27, 2022, 6:37 AM
19
points
5
comments
1
min read
LW
link
(twitter.com)
[Question]
How would I know if a PhD is the right career path?
Bob Guran
Sep 27, 2022, 5:49 AM
4
points
4
comments
1
min read
LW
link
Review of Examine.com’s vitamin write-ups
Elizabeth
and
Martin Bernstorff
Sep 26, 2022, 11:40 PM
60
points
1
comment
5
min read
LW
link
(acesounderglass.com)
D&D.Sci September 2022 Evaluation and Ruleset
abstractapplic
Sep 26, 2022, 10:19 PM
30
points
5
comments
3
min read
LW
link
[MLSN #5]: Prize Compilation
Dan H
Sep 26, 2022, 9:55 PM
15
points
1
comment
2
min read
LW
link
Loss of Alignment is not the High-Order Bit for AI Risk
yieldthought
Sep 26, 2022, 9:16 PM
14
points
18
comments
2
min read
LW
link
Inverse Scaling Prize: Round 1 Winners
Ethan Perez
and
Ian McKenzie
Sep 26, 2022, 7:57 PM
93
points
16
comments
4
min read
LW
link
(irmckenzie.co.uk)
[Question]
Does the existence of shared human values imply alignment is “easy”?
Morpheus
Sep 26, 2022, 6:01 PM
7
points
15
comments
1
min read
LW
link
Meetup: Madison, WI (Oct 8)
svfritz
Sep 26, 2022, 5:55 PM
1
point
0
comments
1
min read
LW
link
Ambiguity in Prediction Market Resolution is Harmful
aphyer
Sep 26, 2022, 4:22 PM
69
points
17
comments
5
min read
LW
link
Framery Phone Booth CO2 Accumulation
jefftk
26 Sep 2022 16:10 UTC
25
points
0
comments
1
min read
LW
link
(www.jefftk.com)
[Question]
How can I remove the launch button from my LW home page?
sudo
26 Sep 2022 15:15 UTC
8
points
4
comments
1
min read
LW
link
Brief Notes on Transformers
Adam Jermyn
26 Sep 2022 14:46 UTC
48
points
3
comments
2
min read
LW
link
You are Underestimating The Likelihood That Convergent Instrumental Subgoals Lead to Aligned AGI
Mark Neyer
26 Sep 2022 14:22 UTC
3
points
6
comments
3
min read
LW
link
Climate-contingent Finance, and A Generalized Mechanism for X-Risk Reduction Financing
John Nay
26 Sep 2022 13:23 UTC
0
points
2
comments
LW
link
Self-Control Secrets of the Puritan Masters
David Hugh-Jones
26 Sep 2022 9:04 UTC
67
points
3
comments
5
min read
LW
link
(wyclif.substack.com)
How I buy things when Lightcone wants them fast
Bird Concept
26 Sep 2022 5:02 UTC
224
points
21
comments
8
min read
LW
link
Oren’s Field Guide of Bad AGI Outcomes
Eris Discordia
26 Sep 2022 4:06 UTC
0
points
0
comments
1
min read
LW
link
On Generality
Eris Discordia
26 Sep 2022 4:06 UTC
2
points
0
comments
5
min read
LW
link
Planning a Loft Bed
jefftk
26 Sep 2022 0:10 UTC
15
points
15
comments
2
min read
LW
link
(www.jefftk.com)
Becoming Black Boxish
vitaliya
25 Sep 2022 23:35 UTC
16
points
0
comments
2
min read
LW
link
Announcing Balsa Research
Zvi
25 Sep 2022 22:50 UTC
235
points
64
comments
2
min read
LW
link
1
review
(thezvi.wordpress.com)
[Question]
How to learn: Struggle VS Lookup-Table?
Nicholas / Heather Kross
25 Sep 2022 21:58 UTC
16
points
2
comments
2
min read
LW
link
An Unexpected GPT-3 Decision in a Simple Gamble
casualphysicsenjoyer
25 Sep 2022 16:46 UTC
8
points
4
comments
1
min read
LW
link
“Agency” needs nuance
Evie Cottrell
25 Sep 2022 7:40 UTC
23
points
1
comment
14
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel