Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Bureaucracy of AIs
Logan Zoellner
Jun 9, 2022, 11:03 PM
17
points
6
comments
14
min read
LW
link
You Only Get One Shot: an Intuition Pump for Embedded Agency
Oliver Sourbut
Jun 9, 2022, 9:38 PM
24
points
4
comments
2
min read
LW
link
[Question]
Forestalling Atmospheric Ignition
Lone Pine
Jun 9, 2022, 8:49 PM
11
points
9
comments
1
min read
LW
link
How Do Selection Theorems Relate To Interpretability?
johnswentworth
Jun 9, 2022, 7:39 PM
60
points
14
comments
3
min read
LW
link
Progress links and tweets, 2022-06-08
jasoncrawford
Jun 9, 2022, 7:13 PM
11
points
0
comments
1
min read
LW
link
(rootsofprogress.org)
If no near-term alignment strategy, research should aim for the long-term
harsimony
Jun 9, 2022, 7:10 PM
7
points
1
comment
1
min read
LW
link
Operationalizing two tasks in Gary Marcus’s AGI challenge
Bill Benzon
Jun 9, 2022, 6:31 PM
12
points
3
comments
8
min read
LW
link
Why it’s bad to kill Grandma
dynomight
Jun 9, 2022, 6:12 PM
29
points
14
comments
8
min read
LW
link
(dynomight.substack.com)
[Question]
Modeling humanity’s robustness to GCRs?
T431
Jun 9, 2022, 5:34 PM
2
points
2
comments
2
min read
LW
link
[Question]
If there was a millennium equivalent prize for AI alignment, what would the problems be?
Yair Halberstadt
Jun 9, 2022, 4:56 PM
17
points
4
comments
1
min read
LW
link
Book Review: How the World Became Rich
Davis Kedrosky
Jun 9, 2022, 4:55 PM
14
points
0
comments
10
min read
LW
link
(daviskedrosky.substack.com)
Covid 6/9/22: Nice
Zvi
Jun 9, 2022, 4:30 PM
26
points
2
comments
12
min read
LW
link
(thezvi.wordpress.com)
Website For Yoda Timers
Adam Zerner
Jun 9, 2022, 4:28 PM
16
points
1
comment
1
min read
LW
link
AI Could Defeat All Of Us Combined
HoldenKarnofsky
Jun 9, 2022, 3:50 PM
170
points
42
comments
17
min read
LW
link
(www.cold-takes.com)
The “mind-body vicious cycle” model of RSI & back pain
Steven Byrnes
Jun 9, 2022, 12:30 PM
91
points
32
comments
12
min read
LW
link
[Linkpost & Discussion] AI Trained on 4Chan Becomes ‘Hate Speech Machine’ [and outperforms GPT-3 on TruthfulQA Benchmark?!]
Yitz
Jun 9, 2022, 10:59 AM
16
points
5
comments
2
min read
LW
link
(www.vice.com)
Comment reply: my low-quality thoughts on why CFAR didn’t get farther with a “real/efficacious art of rationality”
AnnaSalamon
Jun 9, 2022, 2:12 AM
263
points
63
comments
17
min read
LW
link
1
review
Today in AI Risk History: The Terminator (1984 film) was released.
Impassionata
Jun 9, 2022, 1:32 AM
−3
points
6
comments
1
min read
LW
link
There’s probably a tradeoff between AI capability and safety, and we should act like it
David Johnston
Jun 9, 2022, 12:17 AM
3
points
3
comments
1
min read
LW
link
[Question]
Has anyone actually tried to convince Terry Tao or other top mathematicians to work on alignment?
P.
Jun 8, 2022, 10:26 PM
64
points
51
comments
4
min read
LW
link
Entitlement as a major amplifier of unhappiness
VipulNaik
Jun 8, 2022, 10:08 PM
29
points
6
comments
7
min read
LW
link
[Question]
Silly Online Rules
Gunnar_Zarncke
Jun 8, 2022, 8:40 PM
8
points
12
comments
1
min read
LW
link
Untypical SIA
avturchin
Jun 8, 2022, 2:23 PM
5
points
3
comments
2
min read
LW
link
Eliciting Latent Knowledge (ELK) - Distillation/Summary
Marius Hobbhahn
Jun 8, 2022, 1:18 PM
69
points
2
comments
21
min read
LW
link
Research Questions from Stained Glass Windows
StefanHex
Jun 8, 2022, 12:38 PM
4
points
0
comments
2
min read
LW
link
[Question]
Steelmanning Marxism/Communism
Suh_Prance_Alot
Jun 8, 2022, 10:05 AM
6
points
9
comments
1
min read
LW
link
Staying Split: Sabatini and Social Justice
Duncan Sabien (Inactive)
Jun 8, 2022, 8:32 AM
153
points
28
comments
21
min read
LW
link
Less Wrong / ACX Budapest June 11th Meetup
Richard Horvath
Jun 8, 2022, 5:16 AM
2
points
0
comments
1
min read
LW
link
Puddle Temperature Alarm
jefftk
Jun 8, 2022, 2:10 AM
13
points
1
comment
1
min read
LW
link
(www.jefftk.com)
Why I don’t believe in doom
mukashi
Jun 7, 2022, 11:49 PM
6
points
30
comments
4
min read
LW
link
“Pivotal Acts” means something specific
Raemon
Jun 7, 2022, 9:56 PM
127
points
23
comments
2
min read
LW
link
Embodiment is Indispensable for AGI
P. G. Keerthana Gopalakrishnan
Jun 7, 2022, 9:31 PM
6
points
1
comment
6
min read
LW
link
(keerthanapg.com)
Stephen Wolfram’s ideas are under-appreciated
Kenny
Jun 7, 2022, 8:09 PM
20
points
52
comments
1
min read
LW
link
Who models the models that model models? An exploration of GPT-3′s in-context model fitting ability
Lovre
Jun 7, 2022, 7:37 PM
112
points
16
comments
9
min read
LW
link
[Question]
How Does Cognitive Performance Translate to Real World Capability?
DragonGod
Jun 7, 2022, 5:39 PM
5
points
6
comments
1
min read
LW
link
On The Spectrum, On The Guest List: (iv) Silencio
party girl
Jun 7, 2022, 3:39 PM
5
points
2
comments
2
min read
LW
link
(onthespectrumontheguestlist.substack.com)
[Question]
Confused Thoughts on AI Afterlife (seriously)
Epirito
Jun 7, 2022, 2:37 PM
−4
points
6
comments
1
min read
LW
link
Thinking about Broad Classes of Utility-like Functions
J Bostock
Jun 7, 2022, 2:05 PM
7
points
0
comments
4
min read
LW
link
Thoughts on Formalizing Composition
Tom Lieberum
Jun 7, 2022, 7:51 AM
13
points
0
comments
7
min read
LW
link
AGI Safety FAQ / all-dumb-questions-allowed thread
Aryeh Englander
Jun 7, 2022, 5:47 AM
227
points
526
comments
4
min read
LW
link
Pitching an Alignment Softball
mu_(negative)
Jun 7, 2022, 4:10 AM
47
points
13
comments
10
min read
LW
link
We will be around in 30 years
mukashi
Jun 7, 2022, 3:47 AM
12
points
205
comments
2
min read
LW
link
Where to Live for Happiness
nomagicpill
Jun 7, 2022, 1:09 AM
19
points
6
comments
26
min read
LW
link
(210ethan.github.io)
A descriptive, not prescriptive, overview of current AI Alignment Research
Jan
,
Logan Riggs
,
jacquesthibs
and
janus
Jun 6, 2022, 9:59 PM
139
points
21
comments
7
min read
LW
link
Rationalist Meetup—Lyngby Denmark 2022-07-03
Carl Dybdahl
Jun 6, 2022, 8:35 PM
6
points
1
comment
1
min read
LW
link
Microphone on Electric Mandolin II
jefftk
Jun 6, 2022, 8:30 PM
9
points
0
comments
2
min read
LW
link
(www.jefftk.com)
We haven’t quit evolution [short]
the gears to ascension
Jun 6, 2022, 7:07 PM
5
points
3
comments
2
min read
LW
link
Grokking “Forecasting TAI with biological anchors”
anson.ho
Jun 6, 2022, 6:58 PM
38
points
0
comments
14
min read
LW
link
DALL-E 2 - Unofficial Natural Language Image Editing, Art Critique Survey
bakztfuture
Jun 6, 2022, 6:27 PM
0
points
0
comments
1
min read
LW
link
(bakztfuture.substack.com)
We Should Break Up Elite Colleges
sahajsharda
Jun 6, 2022, 6:15 PM
−1
points
0
comments
1
min read
LW
link
(sahajsharda.substack.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel