Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Research agenda for AI safety and a better civilization
agilecaveman
Jul 22, 2020, 6:35 AM
12
points
2
comments
16
min read
LW
link
More Right
Adam Zerner
Jul 22, 2020, 3:36 AM
22
points
29
comments
4
min read
LW
link
[not ongoing] Thoughts on Proportional voting methods
Jameson Quinn
Jul 22, 2020, 2:46 AM
32
points
53
comments
46
min read
LW
link
[Preprint] The Computational Limits of Deep Learning
Gordon Seidoh Worley
Jul 21, 2020, 9:25 PM
9
points
4
comments
1
min read
LW
link
(arxiv.org)
Fresh Bread
Zvi
Jul 21, 2020, 8:40 PM
22
points
1
comment
2
min read
LW
link
(thezvi.wordpress.com)
Competition: Amplify Rohin’s Prediction on AGI researchers & Safety Concerns
stuhlmueller
Jul 21, 2020, 8:06 PM
83
points
41
comments
3
min read
LW
link
Alignment As A Bottleneck To Usefulness Of GPT-3
johnswentworth
Jul 21, 2020, 8:02 PM
111
points
57
comments
3
min read
LW
link
How good is humanity at coordination?
Buck
Jul 21, 2020, 8:01 PM
82
points
44
comments
3
min read
LW
link
$1000 bounty for OpenAI to show whether GPT3 was “deliberately” pretending to be stupider than it is
Bird Concept
Jul 21, 2020, 6:42 PM
56
points
39
comments
2
min read
LW
link
(twitter.com)
[Question]
What are the limits of self-education?
nitropie
Jul 21, 2020, 6:01 PM
3
points
2
comments
1
min read
LW
link
[Meta] anonymous merit or public status
[anonymous]
Jul 21, 2020, 6:01 PM
6
points
4
comments
1
min read
LW
link
AI Benefits Post 5: Outstanding Questions on Governing Benefits
Cullen
Jul 21, 2020, 4:46 PM
4
points
0
comments
4
min read
LW
link
The “AI Dungeons” Dragon Model is heavily path dependent (testing GPT-3 on ethics)
Rafael Harth
Jul 21, 2020, 12:14 PM
44
points
9
comments
6
min read
LW
link
Uncalibrated quantum experiments act clasically
justinpombrio
Jul 21, 2020, 5:31 AM
18
points
12
comments
8
min read
LW
link
The Rediscovery of Interiority in Machine Learning
DanB
Jul 21, 2020, 5:02 AM
5
points
4
comments
LW
link
(danburfoot.net)
Chains, Bottlenecks and Optimization
curi
Jul 21, 2020, 2:07 AM
14
points
12
comments
4
min read
LW
link
“Can you keep this confidential? How do you know?”
Raemon
Jul 21, 2020, 12:33 AM
164
points
43
comments
3
min read
LW
link
2
reviews
Parallels Between AI Safety by Debate and Evidence Law
Cullen
Jul 20, 2020, 10:52 PM
10
points
1
comment
2
min read
LW
link
(cullenokeefe.com)
Thiel on Progress and Stagnation
Richard_Ngo
Jul 20, 2020, 8:27 PM
173
points
32
comments
11
min read
LW
link
(docs.google.com)
Learning Values in Practice
Stuart_Armstrong
Jul 20, 2020, 6:38 PM
24
points
0
comments
5
min read
LW
link
Inefficient doesn’t mean indifferent, but it might mean wimpy.
DirectedEvolution
Jul 20, 2020, 6:27 PM
14
points
3
comments
5
min read
LW
link
[Question]
To what extent is GPT-3 capable of reasoning?
TurnTrout
Jul 20, 2020, 5:10 PM
70
points
73
comments
16
min read
LW
link
Selling real estate: should you overprice or underprice?
Steven Byrnes
Jul 20, 2020, 3:54 PM
19
points
5
comments
10
min read
LW
link
[Question]
“Do Nothing” utility function, 3½ years later?
niplav
Jul 20, 2020, 11:09 AM
5
points
3
comments
1
min read
LW
link
Operationalizing Interpretability
lifelonglearner
Jul 20, 2020, 5:22 AM
20
points
0
comments
4
min read
LW
link
Use resilience, instead of imprecision, to communicate uncertainty
habryka
20 Jul 2020 5:08 UTC
3
points
1
comment
1
min read
LW
link
(forum.effectivealtruism.org)
What Would I Do? Self-prediction in Simple Algorithms
Scott Garrabrant
20 Jul 2020 4:27 UTC
65
points
12
comments
5
min read
LW
link
“Should Blackmail Be Legal” Hanson/Zvi Debate (Sun July 26th, 3pm PDT)
Ben Pace
20 Jul 2020 4:06 UTC
36
points
13
comments
1
min read
LW
link
The 8 Techniques to Tolerify the Dark World
adamShimi
20 Jul 2020 0:58 UTC
2
points
5
comments
2
min read
LW
link
Praise of some popular LW articles
DirectedEvolution
20 Jul 2020 0:32 UTC
40
points
1
comment
7
min read
LW
link
Types Of Online Meetups
Dan B
19 Jul 2020 23:51 UTC
4
points
2
comments
2
min read
LW
link
Musical Outgroups
eapache
19 Jul 2020 22:55 UTC
9
points
1
comment
4
min read
LW
link
Forum Assisted Discussion
Dan B
19 Jul 2020 22:38 UTC
9
points
0
comments
3
min read
LW
link
Pulse and Glide Cycling
jefftk
19 Jul 2020 19:02 UTC
11
points
5
comments
2
min read
LW
link
(www.jefftk.com)
[Question]
Math. proof of the superiority of independent guesses?
Milton
19 Jul 2020 2:38 UTC
−3
points
7
comments
1
min read
LW
link
Criticism of some popular LW articles
DirectedEvolution
19 Jul 2020 1:16 UTC
71
points
19
comments
6
min read
LW
link
Swiss Political System: More than You ever Wanted to Know (I.)
Martin Sustrik
19 Jul 2020 1:11 UTC
173
points
39
comments
24
min read
LW
link
2
reviews
[Question]
Why is pseudo-alignment “worse” than other ways ML can fail to generalize?
nostalgebraist
18 Jul 2020 22:54 UTC
45
points
9
comments
2
min read
LW
link
Against Reopening Ottawa
eapache
18 Jul 2020 20:08 UTC
6
points
2
comments
5
min read
LW
link
Collection of GPT-3 results
Kaj_Sotala
18 Jul 2020 20:04 UTC
89
points
24
comments
1
min read
LW
link
(twitter.com)
[Question]
Is there an easy way to turn a LW sequence into an epub?
ChristianKl
18 Jul 2020 18:20 UTC
17
points
9
comments
1
min read
LW
link
Calibrate words, not just probabilities
MikkW
18 Jul 2020 5:56 UTC
11
points
3
comments
2
min read
LW
link
[Question]
Erving Goffman’s ‘paper’
Saffron
18 Jul 2020 1:12 UTC
5
points
2
comments
1
min read
LW
link
Lessons on AI Takeover from the conquistadors
Daniel Kokotajlo
and
Bird Concept
17 Jul 2020 22:35 UTC
61
points
31
comments
6
min read
LW
link
[Question]
Can an agent use interactive proofs to check the alignment of succesors?
PabloAMC
17 Jul 2020 19:07 UTC
7
points
2
comments
1
min read
LW
link
Anthropomorphizing Humans
johnswentworth
17 Jul 2020 17:49 UTC
46
points
6
comments
2
min read
LW
link
Telling more rational stories
DirectedEvolution
17 Jul 2020 17:47 UTC
26
points
21
comments
3
min read
LW
link
Solving Math Problems by Relay
Ben Goldhaber
and
Owain_Evans
17 Jul 2020 15:32 UTC
103
points
26
comments
7
min read
LW
link
[Question]
What are the best tools you have seen to keep track of knowledge around testable statements?
migueltorrescosta
17 Jul 2020 15:02 UTC
2
points
1
comment
1
min read
LW
link
Environments as a bottleneck in AGI development
Richard_Ngo
17 Jul 2020 5:02 UTC
41
points
19
comments
6
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel