Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Raph Koster on Virtual Worlds vs Games (notes)
Raemon
18 Aug 2019 19:01 UTC
26
points
8
comments
2
min read
LW
link
“Can We Survive Technology” by von Neumann
Ben Pace
18 Aug 2019 18:58 UTC
32
points
2
comments
1
min read
LW
link
(geosci.uchicago.edu)
Prokaryote Multiverse. An argument that potential simulators do not have significantly more complex physics than ours
mako yass
18 Aug 2019 4:22 UTC
0
points
5
comments
2
min read
LW
link
Neural Nets in Python 1
lifelonglearner
18 Aug 2019 2:48 UTC
10
points
3
comments
8
min read
LW
link
Inspection Paradox as a Driver of Group Separation
Shmi
17 Aug 2019 21:47 UTC
29
points
0
comments
1
min read
LW
link
South Bay Meetup
David Friedman
17 Aug 2019 19:56 UTC
1
point
0
comments
1
min read
LW
link
Problems in AI Alignment that philosophers could potentially contribute to
Wei Dai
17 Aug 2019 17:38 UTC
78
points
14
comments
2
min read
LW
link
[Question]
How can you use music to boost learning?
Matthew Barnett
17 Aug 2019 6:59 UTC
11
points
1
comment
1
min read
LW
link
A Primer on Matrix Calculus, Part 3: The Chain Rule
Matthew Barnett
17 Aug 2019 1:50 UTC
12
points
4
comments
6
min read
LW
link
Nashville SSC September Meetup
friedelcraftiness
16 Aug 2019 15:16 UTC
1
point
0
comments
1
min read
LW
link
Beliefs Are For True Things
Davis_Kingsley
15 Aug 2019 23:23 UTC
8
points
5
comments
3
min read
LW
link
[Question]
What experiments would demonstrate “upper limits of augmented working memory?”
Raemon
15 Aug 2019 22:09 UTC
33
points
6
comments
2
min read
LW
link
Clarifying some key hypotheses in AI alignment
Ben Cottier
and
Rohin Shah
15 Aug 2019 21:29 UTC
79
points
12
comments
9
min read
LW
link
Tessercube — OpenPGP Made Mobile
Suji Yan
15 Aug 2019 9:34 UTC
4
points
0
comments
1
min read
LW
link
A Primer on Matrix Calculus, Part 2: Jacobians and other fun
Matthew Barnett
15 Aug 2019 1:13 UTC
22
points
7
comments
7
min read
LW
link
Partial summary of debate with Benquo and Jessicata [pt 1]
Raemon
14 Aug 2019 20:02 UTC
89
points
63
comments
22
min read
LW
link
3
reviews
“Designing agent incentives to avoid reward tampering”, DeepMind
gwern
14 Aug 2019 16:57 UTC
28
points
15
comments
1
min read
LW
link
(medium.com)
Subagents, trauma and rationality
Kaj_Sotala
14 Aug 2019 13:14 UTC
111
points
4
comments
19
min read
LW
link
Predicted AI alignment event/meeting calendar
rmoehn
14 Aug 2019 7:14 UTC
29
points
14
comments
1
min read
LW
link
Natural laws should be explicit constraints on strategy space
ryan_b
13 Aug 2019 20:22 UTC
8
points
6
comments
1
min read
LW
link
Distance Functions are Hard
Grue_Slinky
13 Aug 2019 17:33 UTC
31
points
19
comments
6
min read
LW
link
Book Review: Secular Cycles
Scott Alexander
13 Aug 2019 4:10 UTC
62
points
10
comments
16
min read
LW
link
1
review
(slatestarcodex.com)
A Primer on Matrix Calculus, Part 1: Basic review
Matthew Barnett
12 Aug 2019 23:44 UTC
25
points
4
comments
7
min read
LW
link
[Question]
What explanatory power does Kahneman’s System 2 possess?
Richard_Ngo
12 Aug 2019 15:23 UTC
31
points
2
comments
1
min read
LW
link
Mesa-Optimizers and Over-optimization Failure (Optimizing and Goodhart Effects, Clarifying Thoughts—Part 4)
Davidmanheim
12 Aug 2019 8:07 UTC
15
points
3
comments
4
min read
LW
link
Adjectives from the Future: The Dangers of Result-based Descriptions
Pradeep_Kumar
11 Aug 2019 19:19 UTC
19
points
8
comments
11
min read
LW
link
[Question]
Could we solve this email mess if we all moved to paid emails?
jacobjacob
11 Aug 2019 16:31 UTC
29
points
50
comments
4
min read
LW
link
AI Safety Reading Group
Søren Elverlin
11 Aug 2019 9:01 UTC
16
points
8
comments
1
min read
LW
link
[Question]
Does human choice have to be transitive in order to be rational/consistent?
jmh
11 Aug 2019 1:49 UTC
9
points
6
comments
1
min read
LW
link
Diana Fleischman and Geoffrey Miller—Audience Q&A
Jacob Falkovich
10 Aug 2019 22:37 UTC
38
points
6
comments
9
min read
LW
link
Intransitive Preferences You Can’t Pump
zulupineapple
9 Aug 2019 23:10 UTC
0
points
2
comments
1
min read
LW
link
Categorial preferences and utility functions
DavidHolmes
9 Aug 2019 21:36 UTC
10
points
6
comments
5
min read
LW
link
[Question]
What is the state of the ego depletion field?
Eli Tyre
9 Aug 2019 20:30 UTC
27
points
10
comments
1
min read
LW
link
Why Gradients Vanish and Explode
Matthew Barnett
9 Aug 2019 2:54 UTC
25
points
9
comments
3
min read
LW
link
AI Forecasting Dictionary (Forecasting infrastructure, part 1)
jacobjacob
and
bgold
8 Aug 2019 16:10 UTC
50
points
0
comments
5
min read
LW
link
[Question]
Why do humans not have built-in neural i/o channels?
Richard_Ngo
8 Aug 2019 13:09 UTC
25
points
23
comments
1
min read
LW
link
Which of these five AI alignment research projects ideas are no good?
rmoehn
8 Aug 2019 7:17 UTC
25
points
13
comments
1
min read
LW
link
Calibrating With Cards
lifelonglearner
8 Aug 2019 6:44 UTC
32
points
3
comments
3
min read
LW
link
[Question]
Is there a source/market for LW-related t-shirts?
jooyous
8 Aug 2019 4:30 UTC
8
points
3
comments
1
min read
LW
link
Verification and Transparency
DanielFilan
8 Aug 2019 1:50 UTC
35
points
6
comments
2
min read
LW
link
(danielfilan.com)
Toy model piece #2: Combining short and long range partial preferences
Stuart_Armstrong
8 Aug 2019 0:11 UTC
14
points
0
comments
4
min read
LW
link
Four Ways An Impact Measure Could Help Alignment
Matthew Barnett
8 Aug 2019 0:10 UTC
21
points
1
comment
9
min read
LW
link
Nashville August SSC Meetup
friedelcraftiness
7 Aug 2019 20:11 UTC
1
point
0
comments
1
min read
LW
link
In defense of Oracle (“Tool”) AI research
Steven Byrnes
7 Aug 2019 19:14 UTC
22
points
11
comments
4
min read
LW
link
Help forecast study replication in this social science prediction market
rosiecam
7 Aug 2019 18:18 UTC
29
points
3
comments
1
min read
LW
link
[Question]
Edit Nickname
Luigi Lotti
7 Aug 2019 17:42 UTC
5
points
1
comment
1
min read
LW
link
Self-Supervised Learning and AGI Safety
Steven Byrnes
7 Aug 2019 14:21 UTC
29
points
9
comments
12
min read
LW
link
Emotions are not beliefs
Chris_Leong
7 Aug 2019 6:27 UTC
25
points
2
comments
2
min read
LW
link
Understanding Recent Impact Measures
Matthew Barnett
7 Aug 2019 4:57 UTC
16
points
6
comments
7
min read
LW
link
[Site Update] Behind the scenes data-layer and caching improvements
habryka
7 Aug 2019 0:49 UTC
23
points
3
comments
1
min read
LW
link
Back to top
Next