Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
Page
1
“Other people are wrong” vs “I am right”
Buck
22 Feb 2019 20:01 UTC
263
points
20
comments
9
min read
LW
link
2
reviews
Rule Thinkers In, Not Out
Scott Alexander
27 Feb 2019 2:40 UTC
227
points
67
comments
4
min read
LW
link
4
reviews
(slatestarcodex.com)
Humans Who Are Not Concentrating Are Not General Intelligences
sarahconstantin
25 Feb 2019 20:40 UTC
187
points
35
comments
6
min read
LW
link
1
review
(srconstantin.wordpress.com)
Unconscious Economics
jacobjacob
27 Feb 2019 12:58 UTC
138
points
30
comments
4
min read
LW
link
3
reviews
Blackmail
Zvi
19 Feb 2019 3:50 UTC
133
points
55
comments
16
min read
LW
link
2
reviews
(thezvi.wordpress.com)
Thoughts on Human Models
Ramana Kumar
and
Scott Garrabrant
21 Feb 2019 9:10 UTC
126
points
32
comments
10
min read
LW
link
1
review
The Tale of Alice Almost: Strategies for Dealing With Pretty Good People
sarahconstantin
27 Feb 2019 19:34 UTC
116
points
6
comments
6
min read
LW
link
2
reviews
(srconstantin.wordpress.com)
Epistemic Tenure
Scott Garrabrant
18 Feb 2019 22:56 UTC
89
points
27
comments
3
min read
LW
link
Probability space has 2 metrics
Donald Hobson
10 Feb 2019 0:28 UTC
88
points
11
comments
1
min read
LW
link
Some Thoughts on Metaphilosophy
Wei Dai
10 Feb 2019 0:28 UTC
76
points
30
comments
4
min read
LW
link
Test Cases for Impact Regularisation Methods
DanielFilan
6 Feb 2019 21:50 UTC
72
points
5
comments
13
min read
LW
link
(danielfilan.com)
[Question]
How does Gradient Descent Interact with Goodhart?
Scott Garrabrant
2 Feb 2019 0:14 UTC
68
points
19
comments
4
min read
LW
link
The Case for a Bigger Audience
John_Maxwell
9 Feb 2019 7:22 UTC
68
points
58
comments
2
min read
LW
link
RAISE is launching their MVP
null
26 Feb 2019 11:45 UTC
67
points
1
comment
1
min read
LW
link
Pavlov Generalizes
abramdemski
20 Feb 2019 9:03 UTC
67
points
4
comments
7
min read
LW
link
How the MtG Color Wheel Explains AI Safety
Scott Garrabrant
15 Feb 2019 23:42 UTC
65
points
4
comments
6
min read
LW
link
When to use quantilization
RyanCarey
5 Feb 2019 17:17 UTC
65
points
5
comments
4
min read
LW
link
The Argument from Philosophical Difficulty
Wei Dai
10 Feb 2019 0:28 UTC
59
points
31
comments
1
min read
LW
link
The Hamming Question
Raemon
8 Feb 2019 19:34 UTC
59
points
38
comments
3
min read
LW
link
[Question]
If a “Kickstarter for Inadequate Equlibria” was built, do you have a concrete inadequate equilibrium to fix?
Raemon
21 Feb 2019 21:32 UTC
56
points
40
comments
1
min read
LW
link
Two Small Experiments on GPT-2
jimrandomh
21 Feb 2019 2:59 UTC
54
points
28
comments
1
min read
LW
link
[Question]
Why didn’t Agoric Computing become popular?
Wei Dai
16 Feb 2019 6:19 UTC
52
points
22
comments
2
min read
LW
link
Conclusion to the sequence on value learning
Rohin Shah
3 Feb 2019 21:05 UTC
51
points
20
comments
5
min read
LW
link
Coherent behaviour in the real world is an incoherent concept
Richard_Ngo
11 Feb 2019 17:00 UTC
51
points
17
comments
9
min read
LW
link
Urgent & important: How (not) to do your to-do list
bfinn
1 Feb 2019 17:44 UTC
51
points
20
comments
13
min read
LW
link
[Question]
How does OpenAI’s language model affect our AI timeline estimates?
jimrandomh
15 Feb 2019 3:11 UTC
50
points
7
comments
1
min read
LW
link
[Question]
How good is a human’s gut judgement at guessing someone’s IQ?
habryka
25 Feb 2019 21:23 UTC
50
points
21
comments
1
min read
LW
link
Arguments for moral indefinability
Richard_Ngo
12 Feb 2019 10:40 UTC
50
points
10
comments
7
min read
LW
link
(thinkingcomplete.blogspot.com)
Avoiding Jargon Confusion
Raemon
17 Feb 2019 23:37 UTC
46
points
35
comments
4
min read
LW
link
(notes on) Policy Desiderata for Superintelligent AI: A Vector Field Approach
Ben Pace
4 Feb 2019 22:08 UTC
43
points
5
comments
7
min read
LW
link
The Prediction Pyramid: Why Fundamental Work is Needed for Prediction Work
ozziegooen
14 Feb 2019 16:21 UTC
43
points
15
comments
3
min read
LW
link
Learning preferences by looking at the world
Rohin Shah
12 Feb 2019 22:25 UTC
43
points
10
comments
7
min read
LW
link
(bair.berkeley.edu)
Is voting theory important? An attempt to check my bias.
Jameson Quinn
17 Feb 2019 23:45 UTC
42
points
14
comments
6
min read
LW
link
HCH is not just Mechanical Turk
William_S
9 Feb 2019 0:46 UTC
42
points
6
comments
3
min read
LW
link
Major Donation: Long Term Future Fund Application Extended 1 Week
habryka
16 Feb 2019 23:30 UTC
42
points
3
comments
1
min read
LW
link
Philosophy as low-energy approximation
Charlie Steiner
5 Feb 2019 19:34 UTC
40
points
20
comments
3
min read
LW
link
The RAIN Framework for Informational Effectiveness
ozziegooen
13 Feb 2019 12:54 UTC
37
points
16
comments
6
min read
LW
link
Knowing I’m Being Tricked is Barely Enough
Elizabeth
26 Feb 2019 17:50 UTC
37
points
10
comments
2
min read
LW
link
(acesounderglass.com)
How to get value learning and reference wrong
Charlie Steiner
26 Feb 2019 20:22 UTC
37
points
2
comments
6
min read
LW
link
Implications of GPT-2
Gurkenglas
18 Feb 2019 10:57 UTC
36
points
28
comments
1
min read
LW
link
Some disjunctive reasons for urgency on AI risk
Wei Dai
15 Feb 2019 20:43 UTC
36
points
24
comments
1
min read
LW
link
New versions of posts in “Map and Territory” and “How To Actually Change Your Mind” are up (also, new revision system)
habryka
26 Feb 2019 3:17 UTC
36
points
3
comments
1
min read
LW
link
[Question]
When should we expect the education bubble to pop? How can we short it?
jacobjacob
9 Feb 2019 21:39 UTC
35
points
12
comments
1
min read
LW
link
Is the World Getting Better? A brief summary of recent debate
ErickBall
6 Feb 2019 17:38 UTC
35
points
8
comments
2
min read
LW
link
(capx.co)
Drexler on AI Risk
PeterMcCluskey
1 Feb 2019 5:11 UTC
35
points
10
comments
9
min read
LW
link
(www.bayesianinvestor.com)
EA grants available (to individuals)
Jameson Quinn
7 Feb 2019 15:17 UTC
34
points
8
comments
3
min read
LW
link
Can HCH epistemically dominate Ramanujan?
zhukeepa
23 Feb 2019 22:00 UTC
33
points
6
comments
2
min read
LW
link
[Question]
Why do you reject negative utilitarianism?
Teo Ajantaival
11 Feb 2019 15:38 UTC
32
points
27
comments
1
min read
LW
link
Complexity Penalties in Statistical Learning
michael_h
6 Feb 2019 4:13 UTC
31
points
3
comments
6
min read
LW
link
[Question]
Is LessWrong a “classic style intellectual world”?
Gordon Seidoh Worley
26 Feb 2019 21:33 UTC
29
points
6
comments
1
min read
LW
link
Back to top
Next