Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Everyone is an Imposter
Tharin
Jul 13, 2022, 8:46 AM
19
points
1
comment
9
min read
LW
link
(echoesandchimes.com)
[Question]
Which AI Safety research agendas are the most promising?
Chris_Leong
Jul 13, 2022, 7:54 AM
27
points
5
comments
1
min read
LW
link
Straw-Steelmanning
Chris van Merwijk
Jul 13, 2022, 5:48 AM
29
points
2
comments
1
min read
LW
link
Alien Message Contest: Solution
DaemonicSigil
Jul 13, 2022, 4:07 AM
29
points
2
comments
4
min read
LW
link
[Question]
What is wrong with this approach to corrigibility?
Rafael Cosman
Jul 12, 2022, 10:55 PM
7
points
8
comments
1
min read
LW
link
Acceptability Verification: A Research Agenda
David Udell
and
evhub
Jul 12, 2022, 8:11 PM
50
points
0
comments
1
min read
LW
link
(docs.google.com)
Progress links and tweets, 2022-07-12
jasoncrawford
Jul 12, 2022, 3:30 PM
12
points
0
comments
1
min read
LW
link
(rootsofprogress.org)
Response to Blake Richards: AGI, generality, alignment, & loss functions
Steven Byrnes
Jul 12, 2022, 1:56 PM
62
points
9
comments
15
min read
LW
link
Three Minimum Pivotal Acts Possible by Narrow AI
Michael Soareverix
Jul 12, 2022, 9:51 AM
0
points
4
comments
2
min read
LW
link
Mosaic and Palimpsests: Two Shapes of Research
adamShimi
Jul 12, 2022, 9:05 AM
39
points
3
comments
9
min read
LW
link
[Question]
How do you concisely communicate & navigate the politics / culture at your job working at a large corporation or institution?
Willa
Jul 12, 2022, 3:22 AM
10
points
6
comments
1
min read
LW
link
On how various plans miss the hard bits of the alignment challenge
So8res
Jul 12, 2022, 2:49 AM
313
points
89
comments
29
min read
LW
link
3
reviews
Rainmaking
WalterL
Jul 12, 2022, 12:42 AM
26
points
5
comments
1
min read
LW
link
(www.youtube.com)
Book Review: Neal Stephenson’s “Termination Shock”
Tyler Simmons
Jul 12, 2022, 12:07 AM
13
points
0
comments
30
min read
LW
link
(www.words-and-dirt.com)
Announcing Future Forum—Apply Now
wANIEL
and
freemany
Jul 11, 2022, 10:57 PM
8
points
0
comments
4
min read
LW
link
(forum.effectivealtruism.org)
Defining Optimization in a Deeper Way Part 2
J Bostock
Jul 11, 2022, 8:29 PM
7
points
0
comments
4
min read
LW
link
Marriage, the Giving What We Can Pledge, and the damage caused by vague public commitments
Jeffrey Ladish
Jul 11, 2022, 7:38 PM
98
points
27
comments
6
min read
LW
link
1
review
Systemization
CFAR!Duncan
Jul 11, 2022, 6:39 PM
42
points
5
comments
12
min read
LW
link
[Question]
How do AI timelines affect how you live your life?
Quadratic Reciprocity
Jul 11, 2022, 1:54 PM
80
points
50
comments
1
min read
LW
link
Cambridge LW Meetup: Free Speech
Darmani
Jul 11, 2022, 4:36 AM
7
points
0
comments
1
min read
LW
link
Checksum Sensor Alignment
lsusr
Jul 11, 2022, 3:31 AM
12
points
2
comments
1
min read
LW
link
The Alignment Problem
lsusr
Jul 11, 2022, 3:03 AM
47
points
18
comments
3
min read
LW
link
Immanuel Kant and the Decision Theory App Store
Daniel Kokotajlo
Jul 10, 2022, 4:04 PM
92
points
12
comments
5
min read
LW
link
Metaculus is seeking experienced leaders, researchers & operators for high-impact roles
ChristianWilliams
Jul 10, 2022, 2:27 PM
9
points
0
comments
1
min read
LW
link
(apply.workable.com)
Avoid the abbreviation “FLOPs” – use “FLOP” or “FLOP/s” instead
Daniel_Eth
Jul 10, 2022, 10:44 AM
70
points
13
comments
1
min read
LW
link
My Opportunity Costs
abstractapplic
Jul 10, 2022, 10:14 AM
22
points
3
comments
3
min read
LW
link
Why Portland
Adam Zerner
Jul 10, 2022, 7:20 AM
25
points
18
comments
9
min read
LW
link
Hessian and Basin volume
Vivek Hebbar
Jul 10, 2022, 6:59 AM
35
points
10
comments
4
min read
LW
link
Taste & Shaping
CFAR!Duncan
Jul 10, 2022, 5:50 AM
67
points
1
comment
16
min read
LW
link
Comment on “Propositions Concerning Digital Minds and Society”
Zack_M_Davis
Jul 10, 2022, 5:48 AM
99
points
12
comments
8
min read
LW
link
Heaven: The last part of dystopia
Existism
Jul 9, 2022, 10:36 PM
−1
points
1
comment
6
min read
LW
link
Hope Can = Heaven
Existism
Jul 9, 2022, 10:35 PM
−2
points
0
comments
3
min read
LW
link
Report from a civilizational observer on Earth
owencb
Jul 9, 2022, 5:26 PM
49
points
12
comments
6
min read
LW
link
Grouped Loss may disfavor discontinuous capabilities
Adam Jermyn
Jul 9, 2022, 5:22 PM
14
points
2
comments
4
min read
LW
link
Train first VS prune first in neural networks.
Donald Hobson
Jul 9, 2022, 3:53 PM
18
points
5
comments
2
min read
LW
link
Visualizing Neural networks, how to blame the bias
Donald Hobson
Jul 9, 2022, 3:52 PM
7
points
1
comment
6
min read
LW
link
Using Ngram to estimate depression prevalence over time
David Gross
Jul 9, 2022, 2:57 PM
10
points
3
comments
2
min read
LW
link
(www.pnas.org)
Making it harder for an AGI to “trick” us, with STVs
Tor Økland Barstad
Jul 9, 2022, 2:42 PM
15
points
5
comments
22
min read
LW
link
Ars D&D.sci: Mysteries of Mana
aphyer
Jul 9, 2022, 12:19 PM
38
points
13
comments
3
min read
LW
link
[Question]
I’ve become a medical mystery and I don’t know how to effectively get help
CraigMichael
Jul 9, 2022, 6:58 AM
30
points
53
comments
2
min read
LW
link
Some thoughts on Animals
nitinkhanna
Jul 9, 2022, 2:11 AM
2
points
6
comments
2
min read
LW
link
Changes in Community Dynamics: A Follow-Up to ‘The Berkeley Community & the Rest of Us’
Evan_Gaensbauer
Jul 9, 2022, 1:44 AM
21
points
6
comments
4
min read
LW
link
MATS Models
johnswentworth
Jul 9, 2022, 12:14 AM
94
points
5
comments
16
min read
LW
link
Research Notes: What are we aligning for?
Shoshannah Tekofsky
Jul 8, 2022, 10:13 PM
19
points
8
comments
2
min read
LW
link
[Question]
What New Desktop Should I Buy?
Zvi
8 Jul 2022 15:04 UTC
15
points
19
comments
1
min read
LW
link
Being a donor for Fecal Microbiota Transplants (FMT): Do good & earn easy money (up to 180k/y)
EternallyBlissful
8 Jul 2022 6:17 UTC
36
points
26
comments
8
min read
LW
link
(forum.effectivealtruism.org)
User research as a barometer of software design
Adam Zerner
8 Jul 2022 6:02 UTC
31
points
13
comments
3
min read
LW
link
Reinforcement Learner Wireheading
Nate Showell
8 Jul 2022 5:32 UTC
8
points
2
comments
3
min read
LW
link
Exposition as science: some ideas for how to make progress
riceissa
8 Jul 2022 1:29 UTC
21
points
1
comment
8
min read
LW
link
In Search of Strategic Clarity
james.lucassen
8 Jul 2022 0:52 UTC
11
points
1
comment
5
min read
LW
link
(jlucassen.com)
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel