Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Page
1
UML XI: Nearest Neighbor Schemes
Rafael Harth
16 Feb 2020 20:30 UTC
15
points
3
comments
9
min read
LW
link
[Link and commentary] The Offense-Defense Balance of Scientific Knowledge: Does Publishing AI Research Reduce Misuse?
MichaelA
16 Feb 2020 19:56 UTC
24
points
4
comments
3
min read
LW
link
Training Regime Day 2: Searching for bugs
Mark Xu
16 Feb 2020 17:16 UTC
31
points
2
comments
3
min read
LW
link
Taking the Outgroup Seriously
Davis_Kingsley
16 Feb 2020 13:23 UTC
21
points
8
comments
2
min read
LW
link
On characterizing heavy-tailedness
Jsevillamol
16 Feb 2020 0:14 UTC
38
points
6
comments
4
min read
LW
link
Training Regime Day 1: What is applied rationality?
Mark Xu
15 Feb 2020 21:03 UTC
33
points
7
comments
4
min read
LW
link
[Question]
It “wanted” …
jmh
15 Feb 2020 20:52 UTC
4
points
7
comments
1
min read
LW
link
Why Science is slowing down, Universities and Maslow’s hierarchy of needs
George3d6
15 Feb 2020 20:39 UTC
16
points
25
comments
10
min read
LW
link
Exercises in Comprehensive Information Gathering
johnswentworth
15 Feb 2020 17:27 UTC
141
points
18
comments
3
min read
LW
link
1
review
Reference Post: Trivial Decision Theory Problem
Chris_Leong
15 Feb 2020 17:13 UTC
16
points
4
comments
2
min read
LW
link
[Question]
What is the difference between robustness and inner alignment?
JanB
15 Feb 2020 13:28 UTC
9
points
2
comments
1
min read
LW
link
[Question]
Does iterated amplification tackle the inner alignment problem?
JanB
15 Feb 2020 12:58 UTC
7
points
4
comments
1
min read
LW
link
Bayesian Evolving-to-Extinction
abramdemski
14 Feb 2020 23:55 UTC
40
points
13
comments
5
min read
LW
link
[Question]
A ‘Practice of Rationality’ Sequence?
abramdemski
14 Feb 2020 22:56 UTC
78
points
25
comments
3
min read
LW
link
The Catastrophic Convergence Conjecture
TurnTrout
14 Feb 2020 21:16 UTC
45
points
16
comments
8
min read
LW
link
The Reasonable Effectiveness of Mathematics or: AI vs sandwiches
Vanessa Kosoy
14 Feb 2020 18:46 UTC
34
points
8
comments
9
min read
LW
link
1
review
Perceptrons Explained
lifelonglearner
14 Feb 2020 17:34 UTC
13
points
2
comments
1
min read
LW
link
(owenshen24.github.io)
Please Help Metaculus Forecast COVID-19
AABoyles
14 Feb 2020 17:31 UTC
34
points
0
comments
1
min read
LW
link
(www.metaculus.com)
Training Regime Day 0: Introduction
Mark Xu
14 Feb 2020 8:22 UTC
40
points
4
comments
2
min read
LW
link
Distinguishing definitions of takeoff
Matthew Barnett
14 Feb 2020 0:16 UTC
79
points
6
comments
6
min read
LW
link
Effective Altruism 80,000 hours workshop materials & outline (and Feb 10 ’19 KC meetup notes)
samstowers
13 Feb 2020 21:48 UTC
5
points
0
comments
2
min read
LW
link
[Question]
How do you use face masks?
ChristianKl
13 Feb 2020 14:18 UTC
12
points
1
comment
1
min read
LW
link
In theory: does building the subagent have an “impact”?
Stuart_Armstrong
13 Feb 2020 14:17 UTC
17
points
4
comments
4
min read
LW
link
[Question]
What fraction of work time in the world is done at a computer?
Mati_Roy
13 Feb 2020 9:53 UTC
9
points
0
comments
1
min read
LW
link
A Variance Indifferent Maximizer Alternative
Nevan Wichers
13 Feb 2020 9:06 UTC
7
points
1
comment
4
min read
LW
link
Confirmation Bias As Misfire Of Normal Bayesian Reasoning
Scott Alexander
13 Feb 2020 7:20 UTC
43
points
9
comments
2
min read
LW
link
(slatestarcodex.com)
Building and using the subagent
Stuart_Armstrong
12 Feb 2020 19:28 UTC
17
points
3
comments
2
min read
LW
link
[AN #86]: Improving debate and factored cognition through human experiments
Rohin Shah
12 Feb 2020 18:10 UTC
15
points
0
comments
9
min read
LW
link
(mailchi.mp)
Suspiciously balanced evidence
gjm
12 Feb 2020 17:04 UTC
50
points
24
comments
4
min read
LW
link
[Question]
What are the risks of having your genome publicly available?
Mati_Roy
11 Feb 2020 21:54 UTC
16
points
13
comments
1
min read
LW
link
Demons in Imperfect Search
johnswentworth
11 Feb 2020 20:25 UTC
107
points
21
comments
3
min read
LW
link
[Question]
Will COVID-19 survivors suffer lasting disability at a high rate?
jimrandomh
11 Feb 2020 20:23 UTC
134
points
11
comments
1
min read
LW
link
The Relational Stance
Raemon
11 Feb 2020 5:16 UTC
48
points
11
comments
8
min read
LW
link
Intelligence without causality
Donald Hobson
11 Feb 2020 0:34 UTC
9
points
0
comments
2
min read
LW
link
South Bay Meetup
DavidFriedman
10 Feb 2020 22:36 UTC
4
points
0
comments
1
min read
LW
link
Simulation of technological progress (work in progress)
Daniel Kokotajlo
10 Feb 2020 20:39 UTC
21
points
9
comments
5
min read
LW
link
[Question]
Why do we refuse to take action claiming our impact would be too small?
hookdump
10 Feb 2020 19:33 UTC
5
points
31
comments
1
min read
LW
link
Gricean communication and meta-preferences
Charlie Steiner
10 Feb 2020 5:05 UTC
24
points
0
comments
3
min read
LW
link
Attainable Utility Landscape: How The World Is Changed
TurnTrout
10 Feb 2020 0:58 UTC
52
points
7
comments
6
min read
LW
link
A Simple Introduction to Neural Networks
Rafael Harth
9 Feb 2020 22:02 UTC
34
points
13
comments
18
min read
LW
link
[Question]
Did AI pioneers not worry much about AI risks?
lisperati
9 Feb 2020 19:58 UTC
42
points
9
comments
1
min read
LW
link
[Question]
Source of Karma
jmh
9 Feb 2020 14:13 UTC
4
points
14
comments
1
min read
LW
link
State Space of X-Risk Trajectories
David_Kristoffersson
9 Feb 2020 13:56 UTC
11
points
0
comments
9
min read
LW
link
[Question]
Does there exist an AGI-level parameter setting for modern DRL architectures?
TurnTrout
9 Feb 2020 5:09 UTC
15
points
3
comments
1
min read
LW
link
[Question]
Who… (or what) designed this site and where did they come from?
thedayismine
9 Feb 2020 4:04 UTC
12
points
3
comments
1
min read
LW
link
How to Frame Negative Feedback as Forward-Facing Guidance
Liron
9 Feb 2020 2:47 UTC
46
points
7
comments
3
min read
LW
link
Relationship Outcomes Are Not Particularly Sensitive to Small Variations in Verbal Ability
Zack_M_Davis
9 Feb 2020 0:34 UTC
14
points
2
comments
1
min read
LW
link
(zackmdavis.net)
What can the principal-agent literature tell us about AI risk?
apc
8 Feb 2020 21:28 UTC
104
points
29
comments
16
min read
LW
link
A Cautionary Note on Unlocking the Emotional Brain
eapache
8 Feb 2020 17:21 UTC
54
points
20
comments
2
min read
LW
link
[Question]
What is this review feature?
Long try
8 Feb 2020 15:30 UTC
1
point
1
comment
1
min read
LW
link
Back to top
Next