Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
A misconception about immigration
limerott
Aug 19, 2019, 10:37 PM
1
point
9
comments
4
min read
LW
link
(limerott.com)
[Question]
Do We Change Our Minds Less Often Than We Think?
Raemon
Aug 19, 2019, 9:37 PM
20
points
5
comments
1
min read
LW
link
Classifying specification problems as variants of Goodhart’s Law
Vika
Aug 19, 2019, 8:40 PM
72
points
5
comments
5
min read
LW
link
1
review
Unstriving
Jacob Falkovich
Aug 19, 2019, 2:31 PM
38
points
7
comments
6
min read
LW
link
Goodhart’s Curse and Limitations on AI Alignment
Gordon Seidoh Worley
Aug 19, 2019, 7:57 AM
25
points
18
comments
10
min read
LW
link
Raph Koster on Virtual Worlds vs Games (notes)
Raemon
Aug 18, 2019, 7:01 PM
26
points
8
comments
2
min read
LW
link
“Can We Survive Technology” by von Neumann
Ben Pace
Aug 18, 2019, 6:58 PM
32
points
2
comments
1
min read
LW
link
(geosci.uchicago.edu)
Prokaryote Multiverse. An argument that potential simulators do not have significantly more complex physics than ours
mako yass
Aug 18, 2019, 4:22 AM
0
points
5
comments
2
min read
LW
link
Neural Nets in Python 1
lifelonglearner
Aug 18, 2019, 2:48 AM
10
points
3
comments
8
min read
LW
link
Inspection Paradox as a Driver of Group Separation
Shmi
Aug 17, 2019, 9:47 PM
29
points
0
comments
1
min read
LW
link
South Bay Meetup
David Friedman
Aug 17, 2019, 7:56 PM
1
point
0
comments
1
min read
LW
link
Problems in AI Alignment that philosophers could potentially contribute to
Wei Dai
Aug 17, 2019, 5:38 PM
79
points
14
comments
2
min read
LW
link
[Question]
How can you use music to boost learning?
Matthew Barnett
Aug 17, 2019, 6:59 AM
11
points
1
comment
1
min read
LW
link
A Primer on Matrix Calculus, Part 3: The Chain Rule
Matthew Barnett
Aug 17, 2019, 1:50 AM
12
points
4
comments
6
min read
LW
link
Nashville SSC September Meetup
friedelcraftiness
Aug 16, 2019, 3:16 PM
1
point
0
comments
1
min read
LW
link
Beliefs Are For True Things
Davis_Kingsley
Aug 15, 2019, 11:23 PM
8
points
5
comments
3
min read
LW
link
[Question]
What experiments would demonstrate “upper limits of augmented working memory?”
Raemon
Aug 15, 2019, 10:09 PM
33
points
6
comments
2
min read
LW
link
Clarifying some key hypotheses in AI alignment
Ben Cottier
and
Rohin Shah
Aug 15, 2019, 9:29 PM
79
points
12
comments
9
min read
LW
link
Tessercube — OpenPGP Made Mobile
Suji Yan
Aug 15, 2019, 9:34 AM
4
points
0
comments
1
min read
LW
link
A Primer on Matrix Calculus, Part 2: Jacobians and other fun
Matthew Barnett
Aug 15, 2019, 1:13 AM
22
points
7
comments
7
min read
LW
link
Partial summary of debate with Benquo and Jessicata [pt 1]
Raemon
Aug 14, 2019, 8:02 PM
89
points
63
comments
22
min read
LW
link
3
reviews
“Designing agent incentives to avoid reward tampering”, DeepMind
gwern
Aug 14, 2019, 4:57 PM
28
points
15
comments
1
min read
LW
link
(medium.com)
Subagents, trauma and rationality
Kaj_Sotala
Aug 14, 2019, 1:14 PM
111
points
4
comments
19
min read
LW
link
Predicted AI alignment event/meeting calendar
rmoehn
Aug 14, 2019, 7:14 AM
29
points
14
comments
1
min read
LW
link
Natural laws should be explicit constraints on strategy space
ryan_b
Aug 13, 2019, 8:22 PM
8
points
6
comments
1
min read
LW
link
Distance Functions are Hard
Grue_Slinky
Aug 13, 2019, 5:33 PM
31
points
19
comments
6
min read
LW
link
Book Review: Secular Cycles
Scott Alexander
Aug 13, 2019, 4:10 AM
62
points
10
comments
16
min read
LW
link
1
review
(slatestarcodex.com)
A Primer on Matrix Calculus, Part 1: Basic review
Matthew Barnett
Aug 12, 2019, 11:44 PM
25
points
4
comments
7
min read
LW
link
[Question]
What explanatory power does Kahneman’s System 2 possess?
Richard_Ngo
Aug 12, 2019, 3:23 PM
31
points
2
comments
1
min read
LW
link
Mesa-Optimizers and Over-optimization Failure (Optimizing and Goodhart Effects, Clarifying Thoughts—Part 4)
Davidmanheim
Aug 12, 2019, 8:07 AM
15
points
3
comments
4
min read
LW
link
Adjectives from the Future: The Dangers of Result-based Descriptions
Pradeep_Kumar
Aug 11, 2019, 7:19 PM
19
points
8
comments
11
min read
LW
link
[Question]
Could we solve this email mess if we all moved to paid emails?
jacobjacob
Aug 11, 2019, 4:31 PM
29
points
50
comments
4
min read
LW
link
AI Safety Reading Group
Søren Elverlin
Aug 11, 2019, 9:01 AM
16
points
8
comments
1
min read
LW
link
[Question]
Does human choice have to be transitive in order to be rational/consistent?
jmh
Aug 11, 2019, 1:49 AM
9
points
6
comments
1
min read
LW
link
Diana Fleischman and Geoffrey Miller—Audience Q&A
Jacob Falkovich
Aug 10, 2019, 10:37 PM
38
points
6
comments
9
min read
LW
link
Intransitive Preferences You Can’t Pump
zulupineapple
Aug 9, 2019, 11:10 PM
0
points
2
comments
1
min read
LW
link
Categorial preferences and utility functions
DavidHolmes
Aug 9, 2019, 9:36 PM
10
points
6
comments
5
min read
LW
link
[Question]
What is the state of the ego depletion field?
Eli Tyre
Aug 9, 2019, 8:30 PM
27
points
10
comments
1
min read
LW
link
Why Gradients Vanish and Explode
Matthew Barnett
Aug 9, 2019, 2:54 AM
25
points
9
comments
3
min read
LW
link
AI Forecasting Dictionary (Forecasting infrastructure, part 1)
jacobjacob
and
bgold
Aug 8, 2019, 4:10 PM
50
points
0
comments
5
min read
LW
link
[Question]
Why do humans not have built-in neural i/o channels?
Richard_Ngo
Aug 8, 2019, 1:09 PM
25
points
23
comments
1
min read
LW
link
Which of these five AI alignment research projects ideas are no good?
rmoehn
Aug 8, 2019, 7:17 AM
25
points
13
comments
1
min read
LW
link
Calibrating With Cards
lifelonglearner
Aug 8, 2019, 6:44 AM
32
points
3
comments
3
min read
LW
link
[Question]
Is there a source/market for LW-related t-shirts?
jooyous
8 Aug 2019 4:30 UTC
8
points
3
comments
1
min read
LW
link
Verification and Transparency
DanielFilan
8 Aug 2019 1:50 UTC
35
points
6
comments
2
min read
LW
link
(danielfilan.com)
Toy model piece #2: Combining short and long range partial preferences
Stuart_Armstrong
8 Aug 2019 0:11 UTC
14
points
0
comments
4
min read
LW
link
Four Ways An Impact Measure Could Help Alignment
Matthew Barnett
8 Aug 2019 0:10 UTC
21
points
1
comment
9
min read
LW
link
Nashville August SSC Meetup
friedelcraftiness
7 Aug 2019 20:11 UTC
1
point
0
comments
1
min read
LW
link
In defense of Oracle (“Tool”) AI research
Steven Byrnes
7 Aug 2019 19:14 UTC
22
points
11
comments
4
min read
LW
link
Help forecast study replication in this social science prediction market
rosiecam
7 Aug 2019 18:18 UTC
29
points
3
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel