Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
LW/ACX/EA Seattle summer meetup
nsokolsky
Jun 24, 2022, 11:30 PM
4
points
2
comments
1
min read
LW
link
Dependencies for AGI pessimism
Yitz
Jun 24, 2022, 10:25 PM
7
points
4
comments
1
min read
LW
link
[Link] Childcare : what the science says
Gunnar_Zarncke
Jun 24, 2022, 9:45 PM
46
points
4
comments
1
min read
LW
link
(criticalscience.medium.com)
What if the best path for a person who wants to work on AGI alignment is to join Facebook or Google?
dbasch
Jun 24, 2022, 9:23 PM
2
points
3
comments
1
min read
LW
link
[Link] Adversarially trained neural representations may already be as robust as corresponding biological neural representations
Gunnar_Zarncke
Jun 24, 2022, 8:51 PM
35
points
9
comments
1
min read
LW
link
Updated Deference is not a strong argument against the utility uncertainty approach to alignment
Ivan Vendrov
Jun 24, 2022, 7:32 PM
26
points
8
comments
4
min read
LW
link
Cracks in the Wall, Part I: The Conscious
silo
Jun 24, 2022, 6:29 PM
−3
points
28
comments
12
min read
LW
link
(stephenfoster.substack.com)
[Question]
Do alignment concerns extend to powerful non-AI agents?
Ozyrus
Jun 24, 2022, 6:26 PM
21
points
13
comments
1
min read
LW
link
Raphaël Millière on Generalization and Scaling Maximalism
Michaël Trazzi
Jun 24, 2022, 6:18 PM
21
points
2
comments
4
min read
LW
link
(theinsideview.ai)
Worked Examples of Shapley Values
lalaithion
Jun 24, 2022, 5:13 PM
75
points
11
comments
8
min read
LW
link
Feature request: voting buttons at the bottom?
Oliver Sourbut
Jun 24, 2022, 2:41 PM
71
points
12
comments
1
min read
LW
link
Intelligence in Commitment Races
David Udell
Jun 24, 2022, 2:30 PM
28
points
8
comments
5
min read
LW
link
Linkpost: Robin Hanson—Why Not Wait On AI Risk?
Yair Halberstadt
Jun 24, 2022, 2:23 PM
41
points
14
comments
1
min read
LW
link
(www.overcomingbias.com)
[Question]
“Science Cathedrals”
Alex Vermillion
Jun 24, 2022, 3:30 AM
22
points
9
comments
1
min read
LW
link
LessWrong Has Agree/Disagree Voting On All New Comment Threads
Ben Pace
Jun 24, 2022, 12:43 AM
154
points
219
comments
2
min read
LW
link
1
review
Book review: The Passenger by Lisa Lutz
KatjaGrace
Jun 23, 2022, 11:10 PM
12
points
1
comment
1
min read
LW
link
(worldspiritsockpuppet.com)
20 Critiques of AI Safety That I Found on Twitter
dkirmani
Jun 23, 2022, 7:23 PM
21
points
16
comments
1
min read
LW
link
The Limits of Automation
milkandcigarettes
Jun 23, 2022, 6:03 PM
5
points
1
comment
5
min read
LW
link
(milkandcigarettes.com)
[Question]
Is CIRL a promising agenda?
Chris_Leong
Jun 23, 2022, 5:12 PM
28
points
16
comments
1
min read
LW
link
[Link] OpenAI: Learning to Play Minecraft with Video PreTraining (VPT)
Aryeh Englander
Jun 23, 2022, 4:29 PM
53
points
3
comments
1
min read
LW
link
Half-baked AI Safety ideas thread
Aryeh Englander
Jun 23, 2022, 4:11 PM
64
points
63
comments
1
min read
LW
link
Nonprofit Boards are Weird
HoldenKarnofsky
Jun 23, 2022, 2:40 PM
156
points
26
comments
20
min read
LW
link
1
review
(www.cold-takes.com)
Covid 6/23/22: Under Five Alive
Zvi
Jun 23, 2022, 2:00 PM
26
points
9
comments
10
min read
LW
link
(thezvi.wordpress.com)
How do states respond to changes in nuclear risk
NathanBarnard
Jun 23, 2022, 12:42 PM
8
points
2
comments
5
min read
LW
link
[Question]
What’s the contingency plan if we get AGI tomorrow?
Yitz
Jun 23, 2022, 3:10 AM
61
points
23
comments
1
min read
LW
link
[Question]
What are the best “policy” approaches in worlds where alignment is difficult?
LHA
Jun 23, 2022, 1:53 AM
1
point
0
comments
1
min read
LW
link
AI Training Should Allow Opt-Out
alyssavance
Jun 23, 2022, 1:33 AM
76
points
13
comments
6
min read
LW
link
Loose thoughts on AGI risk
Yitz
Jun 23, 2022, 1:02 AM
7
points
3
comments
1
min read
LW
link
Air Conditioner Test Results & Discussion
johnswentworth
Jun 22, 2022, 10:26 PM
82
points
42
comments
6
min read
LW
link
Announcing the LessWrong Curated Podcast
Ben Pace
and
Solenoid_Entity
Jun 22, 2022, 10:16 PM
137
points
27
comments
1
min read
LW
link
Google’s new text-to-image model—Parti, a demonstration of scaling benefits
Kayden
Jun 22, 2022, 8:00 PM
32
points
4
comments
1
min read
LW
link
Building an Epistemic Status Tracker
rcu
Jun 22, 2022, 6:57 PM
7
points
8
comments
1
min read
LW
link
Confusion about neuroscience/cognitive science as a danger for AI Alignment
Samuel Nellessen
Jun 22, 2022, 5:59 PM
3
points
1
comment
3
min read
LW
link
(snellessen.com)
[Question]
How do I use caffeine optimally?
randomstring
Jun 22, 2022, 5:59 PM
18
points
31
comments
1
min read
LW
link
Make learning a reality
Dalton Mabery
Jun 22, 2022, 3:58 PM
13
points
2
comments
1
min read
LW
link
Reflection Mechanisms as an Alignment target: A survey
Marius Hobbhahn
,
elandgre
and
Beth Barnes
Jun 22, 2022, 3:05 PM
32
points
1
comment
14
min read
LW
link
House Phone
jefftk
Jun 22, 2022, 2:20 PM
15
points
2
comments
1
min read
LW
link
(www.jefftk.com)
How to Visualize Bayesianism
David Udell
Jun 22, 2022, 1:57 PM
9
points
2
comments
3
min read
LW
link
[Question]
Are there spaces for extremely short-form rationality content?
Aleksi Liimatainen
Jun 22, 2022, 10:39 AM
5
points
1
comment
1
min read
LW
link
Solstice Movie Review: Summer Wars
SebastianG
Jun 22, 2022, 1:09 AM
22
points
6
comments
1
min read
LW
link
Security Mindset: Lessons from 20+ years of Software Security Failures Relevant to AGI Alignment
elspood
Jun 21, 2022, 11:55 PM
362
points
42
comments
7
min read
LW
link
1
review
A Quick List of Some Problems in AI Alignment As A Field
Nicholas / Heather Kross
Jun 21, 2022, 11:23 PM
75
points
12
comments
6
min read
LW
link
(www.thinkingmuchbetter.com)
[Question]
What is the difference between AI misalignment and bad programming?
puzzleGuzzle
Jun 21, 2022, 9:52 PM
6
points
2
comments
1
min read
LW
link
What I mean by the phrase “getting intimate with reality”
Luise
21 Jun 2022 19:42 UTC
6
points
0
comments
2
min read
LW
link
(forum.effectivealtruism.org)
What I mean by the phrase “taking ideas seriously”
Luise
21 Jun 2022 19:42 UTC
5
points
2
comments
1
min read
LW
link
(forum.effectivealtruism.org)
Hydrophobic Glasses Coating Review
jefftk
21 Jun 2022 18:00 UTC
16
points
6
comments
1
min read
LW
link
(www.jefftk.com)
Progress links and tweets, 2022-06-20
jasoncrawford
21 Jun 2022 17:12 UTC
12
points
2
comments
1
min read
LW
link
(rootsofprogress.org)
Debating Whether AI is Conscious Is A Distraction from Real Problems
sidhe_they
21 Jun 2022 16:56 UTC
2
points
10
comments
1
min read
LW
link
(techpolicy.press)
Mitigating the damage from unaligned ASI by cooperating with aliens that don’t exist yet
MSRayne
21 Jun 2022 16:12 UTC
−8
points
7
comments
6
min read
LW
link
The inordinately slow spread of good AGI conversations in ML
Rob Bensinger
21 Jun 2022 16:09 UTC
173
points
62
comments
8
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel