Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Bet or update: fixing the will-to-wager assumption
cousin_it
7 Jun 2017 15:03 UTC
62
points
61
comments
1
min read
LW
link
New circumstances, new values?
Stuart_Armstrong
6 Jun 2017 8:20 UTC
11
points
14
comments
1
min read
LW
link
New circumstances, new values?
Stuart_Armstrong
6 Jun 2017 8:18 UTC
0
points
0
comments
1
min read
LW
link
Becoming a Better Community
Sable
6 Jun 2017 7:11 UTC
11
points
16
comments
5
min read
LW
link
Argument From Infinity
DragonGod
5 Jun 2017 21:33 UTC
0
points
19
comments
3
min read
LW
link
Mode Collapse and the Norm One Principle
tristanm
5 Jun 2017 21:30 UTC
28
points
13
comments
11
min read
LW
link
The Simple World Hypothesis
DragonGod
5 Jun 2017 19:34 UTC
4
points
15
comments
8
min read
LW
link
Cognitive Science/Psychology As a Neglected Approach to AI Safety
Kaj_Sotala
5 Jun 2017 13:55 UTC
8
points
5
comments
1
min read
LW
link
(effective-altruism.com)
Open thread, June 5 - June 11, 2017
Elo
5 Jun 2017 4:23 UTC
2
points
97
comments
1
min read
LW
link
Birth of a Stereotype
DragonGod
5 Jun 2017 3:29 UTC
0
points
13
comments
6
min read
LW
link
A Comment on Expected Utility Theory
DragonGod
5 Jun 2017 3:26 UTC
0
points
5
comments
4
min read
LW
link
Rationality as A Value Decider
DragonGod
5 Jun 2017 3:21 UTC
1
point
0
comments
8
min read
LW
link
Book Review: Weapons of Math Destruction
Zvi
4 Jun 2017 21:20 UTC
1
point
0
comments
16
min read
LW
link
Rationalist Seder: Dayenu, Lo Dayenu
Raemon
4 Jun 2017 20:55 UTC
7
points
2
comments
3
min read
LW
link
The Personal Growth Cycle
Gordon Seidoh Worley
4 Jun 2017 17:20 UTC
8
points
4
comments
5
min read
LW
link
(mapandterritory.org)
A new, better way to read the Sequences
Said Achmiz
4 Jun 2017 5:10 UTC
19
points
13
comments
1
min read
LW
link
Rationalist Seder: A Story of War
Raemon
3 Jun 2017 20:17 UTC
12
points
14
comments
2
min read
LW
link
Cooperative Oracles: Nonexploited Bargaining
Scott Garrabrant
3 Jun 2017 0:39 UTC
6
points
6
comments
3
min read
LW
link
Cooperative Oracles: Stratified Pareto Optima and Almost Stratified Pareto Optima
Scott Garrabrant
3 Jun 2017 0:38 UTC
5
points
8
comments
4
min read
LW
link
Cooperative Oracles: Introduction
Scott Garrabrant
3 Jun 2017 0:36 UTC
12
points
3
comments
2
min read
LW
link
Entangled Equilibria and the Twin Prisoners’ Dilemma
Scott Garrabrant
2 Jun 2017 22:09 UTC
5
points
2
comments
3
min read
LW
link
An algorithm with preferences: from zero to one variable
Stuart_Armstrong
2 Jun 2017 16:35 UTC
4
points
0
comments
1
min read
LW
link
Reward/value learning for reinforcement learning
Stuart_Armstrong
2 Jun 2017 16:34 UTC
0
points
2
comments
2
min read
LW
link
The best value indifference method (so far)
Stuart_Armstrong
2 Jun 2017 16:33 UTC
0
points
9
comments
5
min read
LW
link
How to judge moral learning failure
Stuart_Armstrong
2 Jun 2017 16:32 UTC
0
points
2
comments
2
min read
LW
link
Counterfactuals on POMDP
Stuart_Armstrong
2 Jun 2017 16:30 UTC
2
points
0
comments
2
min read
LW
link
Uninfluenceable learning agents
Stuart_Armstrong
2 Jun 2017 16:30 UTC
3
points
7
comments
1
min read
LW
link
Ontology, lost purposes, and instrumental goals
Stuart_Armstrong
2 Jun 2017 16:28 UTC
0
points
1
comment
1
min read
LW
link
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
2 Jun 2017 16:27 UTC
2
points
0
comments
3
min read
LW
link
Corrigibility thoughts II: the robot operator
Stuart_Armstrong
2 Jun 2017 16:27 UTC
0
points
12
comments
2
min read
LW
link
Corrigibility thoughts III: manipulating versus deceiving
Stuart_Armstrong
2 Jun 2017 16:27 UTC
0
points
0
comments
1
min read
LW
link
The radioactive burrito and learning from positive examples
Stuart_Armstrong
2 Jun 2017 16:25 UTC
0
points
2
comments
1
min read
LW
link
Thoughts on Quantilizers
Stuart_Armstrong
2 Jun 2017 16:24 UTC
2
points
0
comments
2
min read
LW
link
Emergency learning
Stuart_Armstrong
2 Jun 2017 16:23 UTC
1
point
0
comments
4
min read
LW
link
Humans as a truth channel
Stuart_Armstrong
2 Jun 2017 16:22 UTC
1
point
0
comments
2
min read
LW
link
All the indifference designs
Stuart_Armstrong
2 Jun 2017 16:20 UTC
2
points
1
comment
4
min read
LW
link
Indifference and compensatory rewards
Stuart_Armstrong
2 Jun 2017 16:19 UTC
0
points
0
comments
1
min read
LW
link
Counterfactually uninfluenceable agents
Stuart_Armstrong
2 Jun 2017 16:17 UTC
11
points
0
comments
2
min read
LW
link
Translation “counterfactual”
Stuart_Armstrong
2 Jun 2017 16:16 UTC
0
points
0
comments
2
min read
LW
link
Understanding the important facts
Stuart_Armstrong
2 Jun 2017 16:15 UTC
0
points
0
comments
1
min read
LW
link
Low impact versus low side effects
Stuart_Armstrong
2 Jun 2017 16:14 UTC
1
point
0
comments
2
min read
LW
link
Agents that don’t become maximisers
Stuart_Armstrong
2 Jun 2017 16:13 UTC
0
points
0
comments
3
min read
LW
link
AI safety: three human problems and one AI issue
Stuart_Armstrong
2 Jun 2017 16:12 UTC
2
points
4
comments
3
min read
LW
link
Optimisation in manipulating humans: engineered fanatics vs yes-men
Stuart_Armstrong
2 Jun 2017 15:51 UTC
0
points
0
comments
2
min read
LW
link
Divergent preferences and meta-preferences
Stuart_Armstrong
2 Jun 2017 15:51 UTC
9
points
0
comments
3
min read
LW
link
Acausal trade: double decrease
Stuart_Armstrong
2 Jun 2017 15:33 UTC
10
points
3
comments
2
min read
LW
link
Acausal trade: different utilities, different trades
Stuart_Armstrong
2 Jun 2017 15:33 UTC
1
point
1
comment
3
min read
LW
link
Acausal trade: universal utility, or selling non-existence insurance too late
Stuart_Armstrong
2 Jun 2017 15:33 UTC
1
point
1
comment
3
min read
LW
link
Acausal trade: trade barriers
Stuart_Armstrong
2 Jun 2017 15:32 UTC
0
points
1
comment
2
min read
LW
link
Futarchy, Xrisks, and near misses
Stuart_Armstrong
2 Jun 2017 8:02 UTC
1
point
0
comments
1
min read
LW
link
Back to top
Next