Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Page
1
Effective Altruism 80,000 hours workshop materials & outline (and Feb 10 ’19 KC meetup notes)
samstowers
13 Feb 2020 21:48 UTC
5
points
0
comments
2
min read
LW
link
[Question]
How do you use face masks?
ChristianKl
13 Feb 2020 14:18 UTC
12
points
1
comment
1
min read
LW
link
In theory: does building the subagent have an “impact”?
Stuart_Armstrong
13 Feb 2020 14:17 UTC
17
points
4
comments
4
min read
LW
link
[Question]
What fraction of work time in the world is done at a computer?
Mati_Roy
13 Feb 2020 9:53 UTC
9
points
0
comments
1
min read
LW
link
A Variance Indifferent Maximizer Alternative
Nevan Wichers
13 Feb 2020 9:06 UTC
7
points
1
comment
4
min read
LW
link
Confirmation Bias As Misfire Of Normal Bayesian Reasoning
Scott Alexander
13 Feb 2020 7:20 UTC
43
points
9
comments
2
min read
LW
link
(slatestarcodex.com)
Building and using the subagent
Stuart_Armstrong
12 Feb 2020 19:28 UTC
17
points
3
comments
2
min read
LW
link
[AN #86]: Improving debate and factored cognition through human experiments
Rohin Shah
12 Feb 2020 18:10 UTC
15
points
0
comments
9
min read
LW
link
(mailchi.mp)
Suspiciously balanced evidence
gjm
12 Feb 2020 17:04 UTC
50
points
24
comments
4
min read
LW
link
[Question]
What are the risks of having your genome publicly available?
Mati_Roy
11 Feb 2020 21:54 UTC
16
points
13
comments
1
min read
LW
link
Demons in Imperfect Search
johnswentworth
11 Feb 2020 20:25 UTC
107
points
21
comments
3
min read
LW
link
[Question]
Will COVID-19 survivors suffer lasting disability at a high rate?
jimrandomh
11 Feb 2020 20:23 UTC
134
points
11
comments
1
min read
LW
link
The Relational Stance
Raemon
11 Feb 2020 5:16 UTC
47
points
11
comments
8
min read
LW
link
Intelligence without causality
Donald Hobson
11 Feb 2020 0:34 UTC
9
points
0
comments
2
min read
LW
link
South Bay Meetup
DavidFriedman
10 Feb 2020 22:36 UTC
4
points
0
comments
1
min read
LW
link
Simulation of technological progress (work in progress)
Daniel Kokotajlo
10 Feb 2020 20:39 UTC
21
points
9
comments
5
min read
LW
link
[Question]
Why do we refuse to take action claiming our impact would be too small?
hookdump
10 Feb 2020 19:33 UTC
5
points
31
comments
1
min read
LW
link
Gricean communication and meta-preferences
Charlie Steiner
10 Feb 2020 5:05 UTC
24
points
0
comments
3
min read
LW
link
Attainable Utility Landscape: How The World Is Changed
TurnTrout
10 Feb 2020 0:58 UTC
52
points
7
comments
6
min read
LW
link
A Simple Introduction to Neural Networks
Rafael Harth
9 Feb 2020 22:02 UTC
34
points
13
comments
18
min read
LW
link
[Question]
Did AI pioneers not worry much about AI risks?
lisperati
9 Feb 2020 19:58 UTC
42
points
9
comments
1
min read
LW
link
[Question]
Source of Karma
jmh
9 Feb 2020 14:13 UTC
4
points
14
comments
1
min read
LW
link
State Space of X-Risk Trajectories
David_Kristoffersson
9 Feb 2020 13:56 UTC
11
points
0
comments
9
min read
LW
link
[Question]
Does there exist an AGI-level parameter setting for modern DRL architectures?
TurnTrout
9 Feb 2020 5:09 UTC
15
points
3
comments
1
min read
LW
link
[Question]
Who… (or what) designed this site and where did they come from?
thedayismine
9 Feb 2020 4:04 UTC
12
points
3
comments
1
min read
LW
link
How to Frame Negative Feedback as Forward-Facing Guidance
Liron
9 Feb 2020 2:47 UTC
46
points
7
comments
3
min read
LW
link
Relationship Outcomes Are Not Particularly Sensitive to Small Variations in Verbal Ability
Zack_M_Davis
9 Feb 2020 0:34 UTC
14
points
2
comments
1
min read
LW
link
(zackmdavis.net)
What can the principal-agent literature tell us about AI risk?
apc
8 Feb 2020 21:28 UTC
104
points
29
comments
16
min read
LW
link
A Cautionary Note on Unlocking the Emotional Brain
eapache
8 Feb 2020 17:21 UTC
54
points
20
comments
2
min read
LW
link
[Question]
What is this review feature?
Long try
8 Feb 2020 15:30 UTC
1
point
1
comment
1
min read
LW
link
Halifax SSC Meetup—FEB 8
interstice
8 Feb 2020 0:45 UTC
4
points
0
comments
1
min read
LW
link
On the falsifiability of hypercomputation
jessicata
7 Feb 2020 8:16 UTC
24
points
4
comments
4
min read
LW
link
(unstableontology.com)
More writeups!
jefftk
7 Feb 2020 3:10 UTC
40
points
5
comments
1
min read
LW
link
(www.jefftk.com)
Book Review: Decisive by Chip and Dan Heath
Ian David Moss
6 Feb 2020 20:15 UTC
4
points
0
comments
2
min read
LW
link
(medium.com)
Bayes-Up: An App for Sharing Bayesian-MCQ
Louis Faucon
6 Feb 2020 19:01 UTC
53
points
9
comments
1
min read
LW
link
Mazes Sequence Roundup: Final Thoughts and Paths Forward
Zvi
6 Feb 2020 16:10 UTC
88
points
28
comments
14
min read
LW
link
1
review
(thezvi.wordpress.com)
Plausibly, almost every powerful algorithm would be manipulative
Stuart_Armstrong
6 Feb 2020 11:50 UTC
38
points
25
comments
3
min read
LW
link
Some quick notes on hand hygiene
willbradshaw
6 Feb 2020 2:47 UTC
68
points
52
comments
3
min read
LW
link
Potential Research Topic: Vingean Reflection, Value Alignment and Aspiration
Vaughn Papenhausen
6 Feb 2020 1:09 UTC
15
points
4
comments
4
min read
LW
link
Synthesizing amplification and debate
evhub
5 Feb 2020 22:53 UTC
33
points
10
comments
4
min read
LW
link
Writeup: Progress on AI Safety via Debate
Beth Barnes
and
paulfchristiano
5 Feb 2020 21:04 UTC
102
points
18
comments
33
min read
LW
link
[AN #85]: The normative questions we should be asking for AI alignment, and a surprisingly good chatbot
Rohin Shah
5 Feb 2020 18:20 UTC
14
points
2
comments
7
min read
LW
link
(mailchi.mp)
The Adventure: a new Utopia story
Stuart_Armstrong
5 Feb 2020 16:50 UTC
100
points
37
comments
51
min read
LW
link
“But that’s your job”: why organisations can work
Stuart_Armstrong
5 Feb 2020 12:25 UTC
77
points
12
comments
4
min read
LW
link
Training a tiny SupAmp model on easy tasks. The influence of failure rate on learning curves
rmoehn
5 Feb 2020 7:22 UTC
5
points
0
comments
1
min read
LW
link
Physical alignment—do you have it? Take a minute & check.
leggi
5 Feb 2020 4:02 UTC
4
points
4
comments
1
min read
LW
link
Open & Welcome Thread—February 2020
ryan_b
4 Feb 2020 20:49 UTC
17
points
114
comments
1
min read
LW
link
Meta-Preference Utilitarianism
B Jacobs
4 Feb 2020 20:24 UTC
10
points
30
comments
1
min read
LW
link
Philosophical self-ratification
jessicata
3 Feb 2020 22:48 UTC
23
points
13
comments
5
min read
LW
link
(unstableontology.com)
Twenty-three AI alignment research project definitions
rmoehn
3 Feb 2020 22:21 UTC
23
points
0
comments
6
min read
LW
link
Back to top
Next