Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Can corrigibility be learned safely?
Wei Dai
1 Apr 2018 23:07 UTC
35
points
115
comments
4
min read
LW
link
Global insect declines: Why aren’t we all dead yet?
eukaryote
1 Apr 2018 20:38 UTC
28
points
26
comments
1
min read
LW
link
Announcing Rational Newsletter
Alexey Lapitsky
1 Apr 2018 14:37 UTC
10
points
9
comments
1
min read
LW
link
April Fools: Announcing: Karma 2.0
habryka
1 Apr 2018 10:33 UTC
63
points
56
comments
1
min read
LW
link
Life hacks
Jan_Kulveit
1 Apr 2018 10:29 UTC
4
points
0
comments
1
min read
LW
link
One-Year Anniversary Retrospective—Los Angeles
RobertM
1 Apr 2018 6:34 UTC
12
points
4
comments
3
min read
LW
link
My take on agent foundations: formalizing metaphilosophical competence
zhukeepa
1 Apr 2018 6:33 UTC
21
points
6
comments
1
min read
LW
link
Corrigible but misaligned: a superintelligent messiah
zhukeepa
1 Apr 2018 6:20 UTC
28
points
26
comments
5
min read
LW
link
LW Update 3/31 - Post Highlights and Bug Fixes
Raemon
1 Apr 2018 4:01 UTC
10
points
2
comments
1
min read
LW
link
Schelling Shifts During AI Self-Modification
MikailKhan
1 Apr 2018 1:58 UTC
6
points
3
comments
6
min read
LW
link
Reframing misaligned AGI’s: well-intentioned non-neurotypical assistants
zhukeepa
1 Apr 2018 1:22 UTC
46
points
14
comments
2
min read
LW
link
The Regularizing-Reducing Model
RyenKrusinga
1 Apr 2018 1:16 UTC
3
points
6
comments
1
min read
LW
link
(drive.google.com)
Metaphilosophical competence can’t be disentangled from alignment
zhukeepa
1 Apr 2018 0:38 UTC
46
points
39
comments
3
min read
LW
link
Belief alignment
hnowak
1 Apr 2018 0:13 UTC
1
point
2
comments
6
min read
LW
link
A Sketch of Good Communication
Ben Pace
31 Mar 2018 22:48 UTC
201
points
35
comments
3
min read
LW
link
1
review
Harry Potter and the Method of Entropy 1 [LessWrong version]
habryka
31 Mar 2018 20:38 UTC
6
points
0
comments
3
min read
LW
link
Harry Potter and the Method of Entropy
alkjash
31 Mar 2018 20:10 UTC
11
points
12
comments
1
min read
LW
link
(radimentary.wordpress.com)
Salience
Tueskes
31 Mar 2018 19:52 UTC
6
points
1
comment
4
min read
LW
link
Opportunities for individual donors in AI safety
Alex Flint
31 Mar 2018 18:37 UTC
30
points
3
comments
11
min read
LW
link
Time in Machine Metaethics
Razmęk Massaräinen
31 Mar 2018 15:02 UTC
2
points
1
comment
6
min read
LW
link
Nice Things
Zvi
31 Mar 2018 12:30 UTC
14
points
0
comments
2
min read
LW
link
(thezvi.wordpress.com)
Reducing Agents: When abstractions break
Hazard
31 Mar 2018 0:03 UTC
13
points
10
comments
8
min read
LW
link
Sydney Rationality Dojo—April
luminosity
30 Mar 2018 14:18 UTC
1
point
0
comments
1
min read
LW
link
The Eternal Grind
Zvi
30 Mar 2018 11:40 UTC
10
points
1
comment
17
min read
LW
link
(thezvi.wordpress.com)
Reward hacking and Goodhart’s law by evolutionary algorithms
Jan_Kulveit
30 Mar 2018 7:57 UTC
18
points
5
comments
1
min read
LW
link
(arxiv.org)
Rationalist Lent is over
Qiaochu_Yuan
30 Mar 2018 5:57 UTC
20
points
16
comments
1
min read
LW
link
Resolving human values, completely and adequately
Stuart_Armstrong
30 Mar 2018 3:35 UTC
32
points
30
comments
12
min read
LW
link
Charting Deaths: Reality vs Reported
lifelonglearner
30 Mar 2018 0:50 UTC
13
points
1
comment
1
min read
LW
link
(owenshen24.github.io)
Site search will be down for a few hours
habryka
30 Mar 2018 0:43 UTC
4
points
0
comments
1
min read
LW
link
Hufflepuff Cynicism on Hypocrisy
abramdemski
29 Mar 2018 21:01 UTC
21
points
78
comments
5
min read
LW
link
2018 Prediction Contest—Propositions Needed
jbeshir
29 Mar 2018 15:02 UTC
7
points
6
comments
4
min read
LW
link
A framework for thinking about AI timescales
Tobias_Baumann
29 Mar 2018 9:29 UTC
7
points
0
comments
1
min read
LW
link
(s-risks.org)
Every Implementation of You is You: An Intuition Ladder
lolbifrons
29 Mar 2018 5:14 UTC
3
points
47
comments
3
min read
LW
link
Washington, D.C.: Meta-Meta Meetup
RobinZ
28 Mar 2018 18:54 UTC
2
points
0
comments
1
min read
LW
link
Open-Category Classification
TurnTrout
28 Mar 2018 14:49 UTC
14
points
6
comments
10
min read
LW
link
*Deleted*
Martin Bernstorff
28 Mar 2018 10:22 UTC
−5
points
21
comments
1
min read
LW
link
‘Trivial Inconvenience Day’ Retrospective
namespace
28 Mar 2018 5:14 UTC
32
points
3
comments
6
min read
LW
link
Karnofsky on forecasting and what science does
Rob Bensinger
28 Mar 2018 1:55 UTC
14
points
0
comments
8
min read
LW
link
(80000hours.org)
The fundamental complementarity of consciousness and work
KatjaGrace
28 Mar 2018 1:20 UTC
16
points
5
comments
2
min read
LW
link
(meteuphoric.wordpress.com)
Defining the ways human values are messy
Stuart_Armstrong
27 Mar 2018 22:42 UTC
9
points
6
comments
2
min read
LW
link
Optimal level of hierarchy for effective altruism?
Jan_Kulveit
27 Mar 2018 22:38 UTC
3
points
0
comments
2
min read
LW
link
(effective-altruism.com)
Learn Bayes Nets!
abramdemski
27 Mar 2018 22:00 UTC
52
points
8
comments
2
min read
LW
link
Evaluating Existing Approaches to AGI Alignment
Gordon Seidoh Worley
27 Mar 2018 19:57 UTC
12
points
0
comments
4
min read
LW
link
(mapandterritory.org)
The master skill of matching map and territory
Rafael Harth
27 Mar 2018 12:06 UTC
14
points
13
comments
1
min read
LW
link
[Preprint for commenting] Digital Immortality: Theory and Protocol for Indirect Mind Uploading
avturchin
27 Mar 2018 11:49 UTC
8
points
5
comments
1
min read
LW
link
Problems with Amplification/Distillation
Stuart_Armstrong
27 Mar 2018 11:12 UTC
29
points
7
comments
10
min read
LW
link
GreaterWrong—several new features & enhancements
Said Achmiz
27 Mar 2018 2:36 UTC
15
points
3
comments
1
min read
LW
link
Non-Adversarial Goodhart and AI Risks
Davidmanheim
27 Mar 2018 1:39 UTC
22
points
11
comments
6
min read
LW
link
A Difficulty With Density-Zero Exploration
Diffractor
27 Mar 2018 1:03 UTC
0
points
1
comment
2
min read
LW
link
My Thoughts on Takeoff Speeds
tristanm
27 Mar 2018 0:05 UTC
11
points
2
comments
7
min read
LW
link
Back to top
Next