Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Robin Hanson asks “Why Not Wait On AI Risk?”
Gunnar_Zarncke
Jun 26, 2022, 11:32 PM
22
points
4
comments
1
min read
LW
link
(www.overcomingbias.com)
Sex Fairy Lore
pchvykov
Jun 26, 2022, 8:42 PM
−25
points
10
comments
6
min read
LW
link
King David’s %: Establishing a new symbol for Bayesian probability.
Paul Logan
Jun 26, 2022, 7:47 PM
−11
points
1
comment
5
min read
LW
link
(laulpogan.substack.com)
Training Trace Priors and Speed Priors
Adam Jermyn
Jun 26, 2022, 6:07 PM
17
points
0
comments
3
min read
LW
link
My current take on Internal Family Systems “parts”
Kaj_Sotala
Jun 26, 2022, 5:40 PM
97
points
11
comments
3
min read
LW
link
(kajsotala.fi)
A Quick Ontology of Agreement
ravedon
Jun 26, 2022, 5:39 PM
5
points
2
comments
2
min read
LW
link
Seven ways to become unstoppably agentic
Evie Cottrell
Jun 26, 2022, 5:39 PM
64
points
16
comments
8
min read
LW
link
Formalizing Deception
JamesH
Jun 26, 2022, 5:39 PM
14
points
2
comments
5
min read
LW
link
Dust Theory vs Ruliad
svemirski
Jun 26, 2022, 4:08 PM
3
points
0
comments
1
min read
LW
link
My cognitive inertia cycle
MSRayne
Jun 26, 2022, 3:49 PM
7
points
4
comments
4
min read
LW
link
How do poor countries get rich: some theories
NathanBarnard
Jun 26, 2022, 10:41 AM
8
points
2
comments
10
min read
LW
link
Child Contracting
jefftk
Jun 26, 2022, 2:30 AM
48
points
2
comments
1
min read
LW
link
(www.jefftk.com)
Conditioning Generative Models
Adam Jermyn
Jun 25, 2022, 10:15 PM
24
points
18
comments
10
min read
LW
link
Unforgivable
Novalis
Jun 25, 2022, 8:57 PM
−9
points
12
comments
5
min read
LW
link
(novalis.blog)
SunPJ in Alenia
FlorianH
Jun 25, 2022, 7:39 PM
9
points
19
comments
8
min read
LW
link
(plausiblestuff.com)
[Question]
Should any human enslave an AGI system?
AlignmentMirror
Jun 25, 2022, 7:35 PM
−13
points
44
comments
1
min read
LW
link
Fundamental Uncertainty: Chapter 3 - Why don’t we agree on what’s right?
Gordon Seidoh Worley
Jun 25, 2022, 5:50 PM
27
points
22
comments
14
min read
LW
link
[Question]
How “should” counterfactual prediction markets work?
eapi
Jun 25, 2022, 5:44 PM
9
points
6
comments
1
min read
LW
link
Conversation with Eliezer: What do you want the system to do?
Orpheus16
Jun 25, 2022, 5:36 PM
114
points
38
comments
2
min read
LW
link
AI-Written Critiques Help Humans Notice Flaws
paulfchristiano
Jun 25, 2022, 5:22 PM
137
points
5
comments
3
min read
LW
link
(openai.com)
Some reflections on the LW community after several months of active engagement
M. Y. Zuo
Jun 25, 2022, 5:04 PM
72
points
40
comments
4
min read
LW
link
On The Spectrum, On The Guest List: (vii) The Marquee
party girl
Jun 25, 2022, 4:54 PM
5
points
0
comments
19
min read
LW
link
(onthespectrumontheguestlist.substack.com)
Identification of Natural Modularity
Stephen Fowler
Jun 25, 2022, 3:05 PM
15
points
3
comments
7
min read
LW
link
[LQ] Some Thoughts on Messaging Around AI Risk
DragonGod
Jun 25, 2022, 1:53 PM
5
points
3
comments
6
min read
LW
link
Quick Summaries of Two Papers on Kant and Game Theory
Erich_Grunewald
Jun 25, 2022, 10:25 AM
8
points
2
comments
4
min read
LW
link
(www.erichgrunewald.com)
[Question]
Do you consider your current, non-superhuman self aligned with “humanity” already?
Rana Dexsin
Jun 25, 2022, 4:15 AM
10
points
19
comments
1
min read
LW
link
LW/ACX/EA Seattle summer meetup
nsokolsky
Jun 24, 2022, 11:30 PM
4
points
2
comments
1
min read
LW
link
Dependencies for AGI pessimism
Yitz
Jun 24, 2022, 10:25 PM
7
points
4
comments
1
min read
LW
link
[Link] Childcare : what the science says
Gunnar_Zarncke
Jun 24, 2022, 9:45 PM
46
points
4
comments
1
min read
LW
link
(criticalscience.medium.com)
What if the best path for a person who wants to work on AGI alignment is to join Facebook or Google?
dbasch
Jun 24, 2022, 9:23 PM
2
points
3
comments
1
min read
LW
link
[Link] Adversarially trained neural representations may already be as robust as corresponding biological neural representations
Gunnar_Zarncke
Jun 24, 2022, 8:51 PM
35
points
9
comments
1
min read
LW
link
Updated Deference is not a strong argument against the utility uncertainty approach to alignment
Ivan Vendrov
Jun 24, 2022, 7:32 PM
26
points
8
comments
4
min read
LW
link
Cracks in the Wall, Part I: The Conscious
silo
Jun 24, 2022, 6:29 PM
−3
points
28
comments
12
min read
LW
link
(stephenfoster.substack.com)
[Question]
Do alignment concerns extend to powerful non-AI agents?
Ozyrus
Jun 24, 2022, 6:26 PM
21
points
13
comments
1
min read
LW
link
Raphaël Millière on Generalization and Scaling Maximalism
Michaël Trazzi
Jun 24, 2022, 6:18 PM
21
points
2
comments
4
min read
LW
link
(theinsideview.ai)
Worked Examples of Shapley Values
lalaithion
Jun 24, 2022, 5:13 PM
75
points
11
comments
8
min read
LW
link
Feature request: voting buttons at the bottom?
Oliver Sourbut
Jun 24, 2022, 2:41 PM
71
points
12
comments
1
min read
LW
link
Intelligence in Commitment Races
David Udell
Jun 24, 2022, 2:30 PM
28
points
8
comments
5
min read
LW
link
Linkpost: Robin Hanson—Why Not Wait On AI Risk?
Yair Halberstadt
Jun 24, 2022, 2:23 PM
41
points
14
comments
1
min read
LW
link
(www.overcomingbias.com)
[Question]
“Science Cathedrals”
Alex Vermillion
Jun 24, 2022, 3:30 AM
22
points
9
comments
1
min read
LW
link
LessWrong Has Agree/Disagree Voting On All New Comment Threads
Ben Pace
Jun 24, 2022, 12:43 AM
154
points
219
comments
2
min read
LW
link
1
review
Book review: The Passenger by Lisa Lutz
KatjaGrace
Jun 23, 2022, 11:10 PM
12
points
1
comment
1
min read
LW
link
(worldspiritsockpuppet.com)
20 Critiques of AI Safety That I Found on Twitter
dkirmani
Jun 23, 2022, 7:23 PM
21
points
16
comments
1
min read
LW
link
The Limits of Automation
milkandcigarettes
Jun 23, 2022, 6:03 PM
5
points
1
comment
5
min read
LW
link
(milkandcigarettes.com)
[Question]
Is CIRL a promising agenda?
Chris_Leong
Jun 23, 2022, 5:12 PM
28
points
16
comments
1
min read
LW
link
[Link] OpenAI: Learning to Play Minecraft with Video PreTraining (VPT)
Aryeh Englander
Jun 23, 2022, 4:29 PM
53
points
3
comments
1
min read
LW
link
Half-baked AI Safety ideas thread
Aryeh Englander
Jun 23, 2022, 4:11 PM
64
points
63
comments
1
min read
LW
link
Nonprofit Boards are Weird
HoldenKarnofsky
Jun 23, 2022, 2:40 PM
156
points
26
comments
20
min read
LW
link
1
review
(www.cold-takes.com)
Covid 6/23/22: Under Five Alive
Zvi
Jun 23, 2022, 2:00 PM
26
points
9
comments
10
min read
LW
link
(thezvi.wordpress.com)
How do states respond to changes in nuclear risk
NathanBarnard
Jun 23, 2022, 12:42 PM
8
points
2
comments
5
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel