Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Page
2
Reference Post: Trivial Decision Theory Problem
Chris_Leong
Feb 15, 2020, 5:13 PM
16
points
4
comments
2
min read
LW
link
[Question]
What is the difference between robustness and inner alignment?
JanB
Feb 15, 2020, 1:28 PM
9
points
2
comments
1
min read
LW
link
[Question]
Does iterated amplification tackle the inner alignment problem?
JanB
Feb 15, 2020, 12:58 PM
7
points
4
comments
1
min read
LW
link
Bayesian Evolving-to-Extinction
abramdemski
Feb 14, 2020, 11:55 PM
40
points
13
comments
5
min read
LW
link
[Question]
A ‘Practice of Rationality’ Sequence?
abramdemski
Feb 14, 2020, 10:56 PM
79
points
25
comments
3
min read
LW
link
The Catastrophic Convergence Conjecture
TurnTrout
Feb 14, 2020, 9:16 PM
45
points
16
comments
8
min read
LW
link
The Reasonable Effectiveness of Mathematics or: AI vs sandwiches
Vanessa Kosoy
Feb 14, 2020, 6:46 PM
34
points
8
comments
9
min read
LW
link
1
review
Perceptrons Explained
lifelonglearner
Feb 14, 2020, 5:34 PM
13
points
2
comments
1
min read
LW
link
(owenshen24.github.io)
Please Help Metaculus Forecast COVID-19
AABoyles
Feb 14, 2020, 5:31 PM
34
points
0
comments
1
min read
LW
link
(www.metaculus.com)
Training Regime Day 0: Introduction
Mark Xu
Feb 14, 2020, 8:22 AM
41
points
4
comments
2
min read
LW
link
Distinguishing definitions of takeoff
Matthew Barnett
Feb 14, 2020, 12:16 AM
79
points
6
comments
6
min read
LW
link
Effective Altruism 80,000 hours workshop materials & outline (and Feb 10 ’19 KC meetup notes)
samstowers
Feb 13, 2020, 9:48 PM
5
points
0
comments
2
min read
LW
link
[Question]
How do you use face masks?
ChristianKl
Feb 13, 2020, 2:18 PM
12
points
1
comment
1
min read
LW
link
In theory: does building the subagent have an “impact”?
Stuart_Armstrong
Feb 13, 2020, 2:17 PM
17
points
4
comments
4
min read
LW
link
[Question]
What fraction of work time in the world is done at a computer?
Mati_Roy
Feb 13, 2020, 9:53 AM
9
points
0
comments
1
min read
LW
link
A Variance Indifferent Maximizer Alternative
Nevan Wichers
Feb 13, 2020, 9:06 AM
7
points
1
comment
4
min read
LW
link
Confirmation Bias As Misfire Of Normal Bayesian Reasoning
Scott Alexander
Feb 13, 2020, 7:20 AM
43
points
9
comments
2
min read
LW
link
(slatestarcodex.com)
Building and using the subagent
Stuart_Armstrong
Feb 12, 2020, 7:28 PM
17
points
3
comments
2
min read
LW
link
[AN #86]: Improving debate and factored cognition through human experiments
Rohin Shah
Feb 12, 2020, 6:10 PM
15
points
0
comments
9
min read
LW
link
(mailchi.mp)
Suspiciously balanced evidence
gjm
Feb 12, 2020, 5:04 PM
50
points
24
comments
4
min read
LW
link
[Question]
What are the risks of having your genome publicly available?
Mati_Roy
Feb 11, 2020, 9:54 PM
16
points
13
comments
LW
link
Demons in Imperfect Search
johnswentworth
Feb 11, 2020, 8:25 PM
110
points
21
comments
3
min read
LW
link
[Question]
Will COVID-19 survivors suffer lasting disability at a high rate?
jimrandomh
Feb 11, 2020, 8:23 PM
134
points
11
comments
1
min read
LW
link
The Relational Stance
Raemon
Feb 11, 2020, 5:16 AM
48
points
11
comments
8
min read
LW
link
Intelligence without causality
Donald Hobson
Feb 11, 2020, 12:34 AM
9
points
0
comments
2
min read
LW
link
South Bay Meetup
DavidFriedman
Feb 10, 2020, 10:36 PM
4
points
0
comments
LW
link
Simulation of technological progress (work in progress)
Daniel Kokotajlo
Feb 10, 2020, 8:39 PM
21
points
9
comments
5
min read
LW
link
[Question]
Why do we refuse to take action claiming our impact would be too small?
hookdump
Feb 10, 2020, 7:33 PM
5
points
31
comments
1
min read
LW
link
Gricean communication and meta-preferences
Charlie Steiner
Feb 10, 2020, 5:05 AM
24
points
0
comments
3
min read
LW
link
Attainable Utility Landscape: How The World Is Changed
TurnTrout
Feb 10, 2020, 12:58 AM
52
points
7
comments
6
min read
LW
link
A Simple Introduction to Neural Networks
Rafael Harth
Feb 9, 2020, 10:02 PM
34
points
13
comments
18
min read
LW
link
[Question]
Did AI pioneers not worry much about AI risks?
lisperati
Feb 9, 2020, 7:58 PM
42
points
9
comments
1
min read
LW
link
[Question]
Source of Karma
jmh
Feb 9, 2020, 2:13 PM
4
points
14
comments
1
min read
LW
link
State Space of X-Risk Trajectories
David_Kristoffersson
Feb 9, 2020, 1:56 PM
11
points
0
comments
9
min read
LW
link
[Question]
Does there exist an AGI-level parameter setting for modern DRL architectures?
TurnTrout
Feb 9, 2020, 5:09 AM
15
points
3
comments
1
min read
LW
link
[Question]
Who… (or what) designed this site and where did they come from?
thedayismine
Feb 9, 2020, 4:04 AM
12
points
3
comments
1
min read
LW
link
How to Frame Negative Feedback as Forward-Facing Guidance
Liron
Feb 9, 2020, 2:47 AM
46
points
7
comments
3
min read
LW
link
Relationship Outcomes Are Not Particularly Sensitive to Small Variations in Verbal Ability
Zack_M_Davis
Feb 9, 2020, 12:34 AM
14
points
2
comments
1
min read
LW
link
(zackmdavis.net)
What can the principal-agent literature tell us about AI risk?
apc
Feb 8, 2020, 9:28 PM
104
points
29
comments
16
min read
LW
link
A Cautionary Note on Unlocking the Emotional Brain
eapache
Feb 8, 2020, 5:21 PM
55
points
20
comments
2
min read
LW
link
[Question]
What is this review feature?
Long try
Feb 8, 2020, 3:30 PM
1
point
1
comment
1
min read
LW
link
Halifax SSC Meetup—FEB 8
interstice
Feb 8, 2020, 12:45 AM
4
points
0
comments
1
min read
LW
link
On the falsifiability of hypercomputation
jessicata
Feb 7, 2020, 8:16 AM
24
points
4
comments
4
min read
LW
link
(unstableontology.com)
More writeups!
jefftk
Feb 7, 2020, 3:10 AM
40
points
5
comments
1
min read
LW
link
(www.jefftk.com)
Book Review: Decisive by Chip and Dan Heath
Ian David Moss
Feb 6, 2020, 8:15 PM
4
points
0
comments
2
min read
LW
link
(medium.com)
Bayes-Up: An App for Sharing Bayesian-MCQ
Louis Faucon
Feb 6, 2020, 7:01 PM
53
points
9
comments
1
min read
LW
link
Mazes Sequence Roundup: Final Thoughts and Paths Forward
Zvi
Feb 6, 2020, 4:10 PM
88
points
28
comments
14
min read
LW
link
1
review
(thezvi.wordpress.com)
Plausibly, almost every powerful algorithm would be manipulative
Stuart_Armstrong
Feb 6, 2020, 11:50 AM
38
points
25
comments
3
min read
LW
link
Some quick notes on hand hygiene
willbradshaw
Feb 6, 2020, 2:47 AM
68
points
52
comments
3
min read
LW
link
Potential Research Topic: Vingean Reflection, Value Alignment and Aspiration
Vaughn Papenhausen
Feb 6, 2020, 1:09 AM
15
points
4
comments
4
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel