Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
2
EA Funds: Long-Term Future fund is open to applications until November 24th (this Saturday)
habryka
Nov 21, 2018, 3:39 AM
37
points
0
comments
1
min read
LW
link
Rationality Is Not Systematized Winning
namespace
Nov 11, 2018, 10:05 PM
36
points
20
comments
1
min read
LW
link
(www.thelastrationalist.com)
Values Weren’t Complex, Once.
Davidmanheim
Nov 25, 2018, 9:17 AM
36
points
13
comments
2
min read
LW
link
Combat vs Nurture: Cultural Genesis
Ruby
Nov 12, 2018, 2:11 AM
35
points
12
comments
6
min read
LW
link
Reflective oracles as a solution to the converse Lawvere problem
SamEisenstat
Nov 29, 2018, 3:23 AM
35
points
2
comments
7
min read
LW
link
Letting Others Be Vulnerable
lifelonglearner
Nov 19, 2018, 2:59 AM
34
points
6
comments
7
min read
LW
link
New safety research agenda: scalable agent alignment via reward modeling
Vika
Nov 20, 2018, 5:29 PM
34
points
12
comments
1
min read
LW
link
(medium.com)
Acknowledging Human Preference Types to Support Value Learning
Nandi
Nov 13, 2018, 6:57 PM
34
points
4
comments
9
min read
LW
link
Model Mis-specification and Inverse Reinforcement Learning
Owain_Evans
and
jsteinhardt
Nov 9, 2018, 3:33 PM
34
points
3
comments
16
min read
LW
link
Iteration Fixed Point Exercises
Scott Garrabrant
and
SamEisenstat
Nov 22, 2018, 12:35 AM
33
points
12
comments
3
min read
LW
link
How rapidly are GPUs improving in price performance?
gallabytes
Nov 25, 2018, 7:54 PM
31
points
9
comments
LW
link
(mediangroup.org)
An unaligned benchmark
paulfchristiano
Nov 17, 2018, 3:51 PM
31
points
0
comments
9
min read
LW
link
Hyperreal Brouwer
Scott Garrabrant
Nov 29, 2018, 3:15 AM
31
points
2
comments
6
min read
LW
link
Approval-directed agents
paulfchristiano
Nov 22, 2018, 9:15 PM
31
points
10
comments
15
min read
LW
link
Stabilize-Reflect-Execute
ozziegooen
Nov 28, 2018, 5:26 PM
30
points
1
comment
2
min read
LW
link
October gwern.net links
gwern
Nov 1, 2018, 1:11 AM
29
points
8
comments
LW
link
(www.gwern.net)
Summary: Surreal Decisions
Chris_Leong
Nov 27, 2018, 2:15 PM
29
points
20
comments
3
min read
LW
link
Genetically Modified Humans Born (Allegedly)
ryan_b
Nov 28, 2018, 4:14 PM
29
points
12
comments
1
min read
LW
link
The new Effective Altruism forum just launched
habryka
Nov 8, 2018, 1:59 AM
27
points
6
comments
1
min read
LW
link
Discussion on the machine learning approach to AI safety
Vika
Nov 1, 2018, 8:54 PM
27
points
3
comments
4
min read
LW
link
Status model
Bucky
Nov 26, 2018, 3:05 PM
26
points
7
comments
3
min read
LW
link
Bounded Oracle Induction
Diffractor
Nov 28, 2018, 8:11 AM
25
points
0
comments
9
min read
LW
link
Beliefs at different timescales
Nisan
Nov 4, 2018, 8:10 PM
25
points
12
comments
2
min read
LW
link
Clickbait might not be destroying our general Intelligence
Donald Hobson
Nov 19, 2018, 12:13 AM
25
points
13
comments
2
min read
LW
link
The Inspection Paradox is Everywhere
Chris_Leong
Nov 15, 2018, 10:55 AM
24
points
3
comments
1
min read
LW
link
(allendowney.blogspot.com)
Latent Variables and Model Mis-Specification
jsteinhardt
Nov 7, 2018, 2:48 PM
24
points
8
comments
9
min read
LW
link
Specification gaming examples in AI
Samuel Rødal
Nov 10, 2018, 12:00 PM
24
points
6
comments
1
min read
LW
link
(docs.google.com)
Approval-directed bootstrapping
paulfchristiano
Nov 25, 2018, 11:18 PM
24
points
0
comments
1
min read
LW
link
Rationality of demonstrating & voting
bfinn
Nov 7, 2018, 12:09 AM
24
points
21
comments
8
min read
LW
link
Kelly bettors
DanielFilan
Nov 13, 2018, 12:40 AM
24
points
3
comments
10
min read
LW
link
(danielfilan.com)
deluks917 on Online Weirdos
Jacob Falkovich
Nov 24, 2018, 5:03 PM
24
points
3
comments
10
min read
LW
link
Alignment Newsletter #34
Rohin Shah
Nov 26, 2018, 11:10 PM
24
points
0
comments
10
min read
LW
link
(mailchi.mp)
Suggestion: New material shouldn’t be released too fast
Chris_Leong
Nov 21, 2018, 4:39 PM
23
points
7
comments
1
min read
LW
link
On first looking into Russell’s History
Richard_Ngo
Nov 8, 2018, 11:20 AM
23
points
6
comments
5
min read
LW
link
(thinkingcomplete.blogspot.com)
The Ubiquitous Converse Lawvere Problem
Scott Garrabrant
Nov 29, 2018, 3:16 AM
23
points
0
comments
2
min read
LW
link
Alignment Newsletter #33
Rohin Shah
Nov 19, 2018, 5:20 PM
23
points
0
comments
9
min read
LW
link
(mailchi.mp)
Speculations on improving debating
Richard_Ngo
Nov 5, 2018, 4:10 PM
22
points
4
comments
4
min read
LW
link
(thinkingcomplete.blogspot.com)
AI development incentive gradients are not uniformly terrible
rk
Nov 12, 2018, 4:27 PM
21
points
12
comments
6
min read
LW
link
Bayes Questions
Bucky
Nov 7, 2018, 4:54 PM
21
points
13
comments
2
min read
LW
link
Review: Artifact
Zvi
Nov 22, 2018, 3:00 PM
21
points
3
comments
13
min read
LW
link
(thezvi.wordpress.com)
Implementations of immortality
Richard_Ngo
Nov 1, 2018, 2:20 PM
20
points
11
comments
5
min read
LW
link
(thinkingcomplete.blogspot.com)
Meta-execution
paulfchristiano
Nov 1, 2018, 10:18 PM
20
points
1
comment
5
min read
LW
link
The Semantic Man
namespace
Nov 22, 2018, 8:38 AM
19
points
4
comments
1
min read
LW
link
(www.generalsemantics.org)
Alignment Newsletter #32
Rohin Shah
Nov 12, 2018, 5:20 PM
18
points
0
comments
12
min read
LW
link
(mailchi.mp)
[Insert clever intro here]
Bae's Theorem
20 Nov 2018 3:26 UTC
18
points
13
comments
1
min read
LW
link
Real-time hiring with prediction markets
ryan_b
9 Nov 2018 22:10 UTC
17
points
9
comments
1
min read
LW
link
Alignment Newsletter #31
Rohin Shah
5 Nov 2018 23:50 UTC
17
points
0
comments
12
min read
LW
link
(mailchi.mp)
What if people simply forecasted your future choices?
ozziegooen
23 Nov 2018 10:52 UTC
16
points
6
comments
6
min read
LW
link
Madison Solstice Gathering
mingyuan
28 Nov 2018 21:36 UTC
16
points
0
comments
1
min read
LW
link
Goodhart’s Law and Genies
thomascolthurst
1 Nov 2018 1:38 UTC
15
points
5
comments
9
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel