Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Book Review: Weapons of Math Destruction
Zvi
4 Jun 2017 21:20 UTC
1
point
0
comments
16
min read
LW
link
Rationalist Seder: Dayenu, Lo Dayenu
Raemon
4 Jun 2017 20:55 UTC
7
points
2
comments
3
min read
LW
link
The Personal Growth Cycle
Gordon Seidoh Worley
4 Jun 2017 17:20 UTC
8
points
4
comments
5
min read
LW
link
(mapandterritory.org)
A new, better way to read the Sequences
Said Achmiz
4 Jun 2017 5:10 UTC
19
points
13
comments
1
min read
LW
link
Rationalist Seder: A Story of War
Raemon
3 Jun 2017 20:17 UTC
12
points
14
comments
2
min read
LW
link
Cooperative Oracles: Nonexploited Bargaining
Scott Garrabrant
3 Jun 2017 0:39 UTC
6
points
6
comments
3
min read
LW
link
Cooperative Oracles: Stratified Pareto Optima and Almost Stratified Pareto Optima
Scott Garrabrant
3 Jun 2017 0:38 UTC
5
points
8
comments
4
min read
LW
link
Cooperative Oracles: Introduction
Scott Garrabrant
3 Jun 2017 0:36 UTC
12
points
3
comments
2
min read
LW
link
Entangled Equilibria and the Twin Prisoners’ Dilemma
Scott Garrabrant
2 Jun 2017 22:09 UTC
5
points
2
comments
3
min read
LW
link
An algorithm with preferences: from zero to one variable
Stuart_Armstrong
2 Jun 2017 16:35 UTC
4
points
0
comments
1
min read
LW
link
Reward/value learning for reinforcement learning
Stuart_Armstrong
2 Jun 2017 16:34 UTC
0
points
2
comments
2
min read
LW
link
The best value indifference method (so far)
Stuart_Armstrong
2 Jun 2017 16:33 UTC
0
points
9
comments
5
min read
LW
link
How to judge moral learning failure
Stuart_Armstrong
2 Jun 2017 16:32 UTC
0
points
2
comments
2
min read
LW
link
Counterfactuals on POMDP
Stuart_Armstrong
2 Jun 2017 16:30 UTC
2
points
0
comments
2
min read
LW
link
Uninfluenceable learning agents
Stuart_Armstrong
2 Jun 2017 16:30 UTC
3
points
7
comments
1
min read
LW
link
Ontology, lost purposes, and instrumental goals
Stuart_Armstrong
2 Jun 2017 16:28 UTC
0
points
1
comment
1
min read
LW
link
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
2 Jun 2017 16:27 UTC
2
points
0
comments
3
min read
LW
link
Corrigibility thoughts II: the robot operator
Stuart_Armstrong
2 Jun 2017 16:27 UTC
0
points
12
comments
2
min read
LW
link
Corrigibility thoughts III: manipulating versus deceiving
Stuart_Armstrong
2 Jun 2017 16:27 UTC
0
points
0
comments
1
min read
LW
link
The radioactive burrito and learning from positive examples
Stuart_Armstrong
2 Jun 2017 16:25 UTC
0
points
2
comments
1
min read
LW
link
Thoughts on Quantilizers
Stuart_Armstrong
2 Jun 2017 16:24 UTC
2
points
0
comments
2
min read
LW
link
Emergency learning
Stuart_Armstrong
2 Jun 2017 16:23 UTC
1
point
0
comments
4
min read
LW
link
Humans as a truth channel
Stuart_Armstrong
2 Jun 2017 16:22 UTC
1
point
0
comments
2
min read
LW
link
All the indifference designs
Stuart_Armstrong
2 Jun 2017 16:20 UTC
2
points
1
comment
4
min read
LW
link
Indifference and compensatory rewards
Stuart_Armstrong
2 Jun 2017 16:19 UTC
0
points
0
comments
1
min read
LW
link
Counterfactually uninfluenceable agents
Stuart_Armstrong
2 Jun 2017 16:17 UTC
11
points
0
comments
2
min read
LW
link
Translation “counterfactual”
Stuart_Armstrong
2 Jun 2017 16:16 UTC
0
points
0
comments
2
min read
LW
link
Understanding the important facts
Stuart_Armstrong
2 Jun 2017 16:15 UTC
0
points
0
comments
1
min read
LW
link
Low impact versus low side effects
Stuart_Armstrong
2 Jun 2017 16:14 UTC
1
point
0
comments
2
min read
LW
link
Agents that don’t become maximisers
Stuart_Armstrong
2 Jun 2017 16:13 UTC
0
points
0
comments
3
min read
LW
link
AI safety: three human problems and one AI issue
Stuart_Armstrong
2 Jun 2017 16:12 UTC
2
points
4
comments
3
min read
LW
link
Optimisation in manipulating humans: engineered fanatics vs yes-men
Stuart_Armstrong
2 Jun 2017 15:51 UTC
0
points
0
comments
2
min read
LW
link
Divergent preferences and meta-preferences
Stuart_Armstrong
2 Jun 2017 15:51 UTC
9
points
0
comments
3
min read
LW
link
Acausal trade: double decrease
Stuart_Armstrong
2 Jun 2017 15:33 UTC
10
points
3
comments
2
min read
LW
link
Acausal trade: different utilities, different trades
Stuart_Armstrong
2 Jun 2017 15:33 UTC
1
point
1
comment
3
min read
LW
link
Acausal trade: universal utility, or selling non-existence insurance too late
Stuart_Armstrong
2 Jun 2017 15:33 UTC
1
point
1
comment
3
min read
LW
link
Acausal trade: trade barriers
Stuart_Armstrong
2 Jun 2017 15:32 UTC
0
points
1
comment
2
min read
LW
link
Futarchy, Xrisks, and near misses
Stuart_Armstrong
2 Jun 2017 8:02 UTC
1
point
0
comments
1
min read
LW
link
Futarchy, Xrisks, and near misses
Stuart_Armstrong
2 Jun 2017 8:02 UTC
10
points
10
comments
1
min read
LW
link
Book recommendation requests
ChristianKl
1 Jun 2017 22:33 UTC
16
points
49
comments
1
min read
LW
link
Why I am not currently working on the AAMLS agenda
jessicata
1 Jun 2017 17:57 UTC
28
points
3
comments
5
min read
LW
link
June 2017 Media Thread
ArisKatsaris
1 Jun 2017 6:17 UTC
2
points
35
comments
1
min read
LW
link
Strong men are socialist—how to use a study’s own data to disprove it
Jacob Falkovich
31 May 2017 4:18 UTC
9
points
5
comments
1
min read
LW
link
(putanumonit.com)
Regulatory lags for New Technology [2013 notes]
gwern
31 May 2017 1:27 UTC
9
points
5
comments
69
min read
LW
link
Philosophical Parenthood
SquirrelInHell
30 May 2017 14:09 UTC
1
point
25
comments
1
min read
LW
link
(squirrelinhell.blogspot.com)
Divergent preferences and meta-preferences
Stuart_Armstrong
30 May 2017 7:33 UTC
6
points
1
comment
2
min read
LW
link
Futarchy Fix
abramdemski
30 May 2017 5:46 UTC
7
points
9
comments
9
min read
LW
link
“AIXIjs: A Software Demo for General Reinforcement Learning”, Aslanides 2017
gwern
29 May 2017 21:09 UTC
7
points
1
comment
1
min read
LW
link
(arxiv.org)
Invitation to comment on a draft on multiverse-wide cooperation via alternatives to causal decision theory (FDT/UDT/EDT/...)
Caspar Oesterheld
29 May 2017 8:34 UTC
6
points
7
comments
1
min read
LW
link
Meetup : Sydney Rationality—Pub meetup June
Elo
29 May 2017 6:52 UTC
2
points
0
comments
1
min read
LW
link
Back to top
Next