Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Page
1
EA Kansas City planning meetup, discussion & open questions
samstowers
19 Feb 2020 23:12 UTC
7
points
1
comment
2
min read
LW
link
On unfixably unsafe AGI architectures
Steven Byrnes
19 Feb 2020 21:16 UTC
33
points
8
comments
5
min read
LW
link
[AN #87]: What might happen as deep learning scales even further?
Rohin Shah
19 Feb 2020 18:20 UTC
28
points
0
comments
4
min read
LW
link
(mailchi.mp)
Training Regime Day 5: TAPs
Mark Xu
19 Feb 2020 18:11 UTC
26
points
0
comments
7
min read
LW
link
[Question]
Does donating to EA make sense in light of the mere addition paradox ?
George3d6
19 Feb 2020 14:14 UTC
8
points
8
comments
2
min read
LW
link
Stuck Exploration
Chris_Leong
19 Feb 2020 12:31 UTC
14
points
6
comments
1
min read
LW
link
[Question]
Does anyone have a recommended resource about the research on behavioral conditioning, reinforcement, and shaping?
Eli Tyre
19 Feb 2020 3:58 UTC
7
points
3
comments
1
min read
LW
link
Assembling Sets for Contra
jefftk
19 Feb 2020 3:40 UTC
8
points
2
comments
3
min read
LW
link
(www.jefftk.com)
[Question]
Is there an intuitive way to explain how much better superforecasters are than regular forecasters?
William_S
19 Feb 2020 1:07 UTC
16
points
5
comments
1
min read
LW
link
We Want MoR (HPMOR Discussion Podcast) Completes Book One
moridinamael
19 Feb 2020 0:34 UTC
31
points
0
comments
1
min read
LW
link
[Question]
Is there software for goal factoring?
philip_b
18 Feb 2020 19:55 UTC
11
points
4
comments
1
min read
LW
link
What are information hazards?
MichaelA
18 Feb 2020 19:34 UTC
41
points
15
comments
4
min read
LW
link
Blog Post Day (Unofficial)
Daniel Kokotajlo
18 Feb 2020 19:05 UTC
49
points
8
comments
1
min read
LW
link
Big Yellow Tractor (Filk)
Gordon Seidoh Worley
18 Feb 2020 18:43 UTC
14
points
3
comments
1
min read
LW
link
Training Regime Day 4: Murphyjitsu
Mark Xu
18 Feb 2020 17:33 UTC
31
points
0
comments
7
min read
LW
link
[Question]
In a rational world is there a place for ideology?
wachichornia
18 Feb 2020 17:09 UTC
2
points
3
comments
1
min read
LW
link
(In)action rollouts
Stuart_Armstrong
18 Feb 2020 14:48 UTC
11
points
2
comments
2
min read
LW
link
Counterfactuals versus the laws of physics
Stuart_Armstrong
18 Feb 2020 13:21 UTC
16
points
0
comments
1
min read
LW
link
How to actually switch to an artificial body – Gradual remapping
George3d6
18 Feb 2020 13:19 UTC
9
points
3
comments
18
min read
LW
link
(blog.cerebralab.com)
Wireheading and discontinuity
Michele Campolo
18 Feb 2020 10:49 UTC
21
points
4
comments
3
min read
LW
link
Set Ups and Summaries
Elizabeth
18 Feb 2020 5:00 UTC
14
points
1
comment
4
min read
LW
link
(acesounderglass.com)
[Productivity] How not to use “Important // Not Urgent”
aaq
17 Feb 2020 23:42 UTC
22
points
0
comments
1
min read
LW
link
Training Regime Day 3: Tips and Tricks
Mark Xu
17 Feb 2020 18:53 UTC
24
points
5
comments
11
min read
LW
link
Cambridge (UK) SSC meetup
rlms
17 Feb 2020 18:41 UTC
1
point
0
comments
1
min read
LW
link
Jan Bloch’s Impossible War
Slimepriestess
17 Feb 2020 16:14 UTC
112
points
30
comments
5
min read
LW
link
(hivewired.wordpress.com)
Subagents and impact measures: summary tables
Stuart_Armstrong
17 Feb 2020 14:09 UTC
11
points
2
comments
1
min read
LW
link
Appendix: mathematics of indexical impact measures
Stuart_Armstrong
17 Feb 2020 13:22 UTC
12
points
0
comments
4
min read
LW
link
Stepwise inaction and non-indexical impact measures
Stuart_Armstrong
17 Feb 2020 10:32 UTC
12
points
7
comments
1
min read
LW
link
How to Lurk Less (and benefit others while benefiting yourself)
romeostevensit
17 Feb 2020 6:18 UTC
95
points
17
comments
2
min read
LW
link
Attainable Utility Preservation: Concepts
TurnTrout
17 Feb 2020 5:20 UTC
38
points
20
comments
1
min read
LW
link
On the falsifiability of hypercomputation, part 2: finite input streams
jessicata
17 Feb 2020 3:51 UTC
26
points
7
comments
4
min read
LW
link
(unstableontology.com)
[Question]
Wanting More Intellectual Stamina
mr.magpie
17 Feb 2020 2:58 UTC
6
points
7
comments
1
min read
LW
link
A Memetic Mediator Manifesto
Chris_Leong
17 Feb 2020 2:14 UTC
9
points
4
comments
1
min read
LW
link
(docs.google.com)
UML XI: Nearest Neighbor Schemes
Rafael Harth
16 Feb 2020 20:30 UTC
15
points
3
comments
9
min read
LW
link
[Link and commentary] The Offense-Defense Balance of Scientific Knowledge: Does Publishing AI Research Reduce Misuse?
MichaelA
16 Feb 2020 19:56 UTC
24
points
4
comments
3
min read
LW
link
Training Regime Day 2: Searching for bugs
Mark Xu
16 Feb 2020 17:16 UTC
31
points
2
comments
3
min read
LW
link
Taking the Outgroup Seriously
Davis_Kingsley
16 Feb 2020 13:23 UTC
21
points
8
comments
2
min read
LW
link
On characterizing heavy-tailedness
Jsevillamol
16 Feb 2020 0:14 UTC
38
points
6
comments
4
min read
LW
link
Training Regime Day 1: What is applied rationality?
Mark Xu
15 Feb 2020 21:03 UTC
33
points
7
comments
4
min read
LW
link
[Question]
It “wanted” …
jmh
15 Feb 2020 20:52 UTC
4
points
7
comments
1
min read
LW
link
Why Science is slowing down, Universities and Maslow’s hierarchy of needs
George3d6
15 Feb 2020 20:39 UTC
16
points
25
comments
10
min read
LW
link
Exercises in Comprehensive Information Gathering
johnswentworth
15 Feb 2020 17:27 UTC
141
points
18
comments
3
min read
LW
link
1
review
Reference Post: Trivial Decision Theory Problem
Chris_Leong
15 Feb 2020 17:13 UTC
16
points
4
comments
2
min read
LW
link
[Question]
What is the difference between robustness and inner alignment?
JanB
15 Feb 2020 13:28 UTC
9
points
2
comments
1
min read
LW
link
[Question]
Does iterated amplification tackle the inner alignment problem?
JanB
15 Feb 2020 12:58 UTC
7
points
4
comments
1
min read
LW
link
Bayesian Evolving-to-Extinction
abramdemski
14 Feb 2020 23:55 UTC
40
points
13
comments
5
min read
LW
link
[Question]
A ‘Practice of Rationality’ Sequence?
abramdemski
14 Feb 2020 22:56 UTC
78
points
25
comments
3
min read
LW
link
The Catastrophic Convergence Conjecture
TurnTrout
14 Feb 2020 21:16 UTC
45
points
16
comments
8
min read
LW
link
The Reasonable Effectiveness of Mathematics or: AI vs sandwiches
Vanessa Kosoy
14 Feb 2020 18:46 UTC
34
points
8
comments
9
min read
LW
link
1
review
Perceptrons Explained
lifelonglearner
14 Feb 2020 17:34 UTC
13
points
2
comments
1
min read
LW
link
(owenshen24.github.io)
Back to top
Next