Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
LessLong Launch Party
Raemon
23 Aug 2019 22:18 UTC
12
points
1
comment
1
min read
LW
link
[Question]
Is there a simple parameter that controls human working memory capacity, which has been set tragically low?
Liron
23 Aug 2019 22:10 UTC
17
points
8
comments
1
min read
LW
link
Optimization Provenance
Adele Lopez
23 Aug 2019 20:08 UTC
38
points
5
comments
5
min read
LW
link
Troll Bridge
abramdemski
23 Aug 2019 18:36 UTC
86
points
59
comments
12
min read
LW
link
Understanding understanding
mthq
23 Aug 2019 18:10 UTC
24
points
1
comment
2
min read
LW
link
Actually updating
SaraHax
23 Aug 2019 17:46 UTC
56
points
10
comments
4
min read
LW
link
When do utility functions constrain?
Hoagy
23 Aug 2019 17:19 UTC
30
points
8
comments
7
min read
LW
link
Parables of Constraint and Actualization
Spencer Wyman
23 Aug 2019 16:56 UTC
13
points
0
comments
6
min read
LW
link
Thoughts on Retrieving Knowledge from Neural Networks
Jaime Ruiz
23 Aug 2019 16:41 UTC
11
points
2
comments
5
min read
LW
link
Algorithmic Similarity
LukasM
23 Aug 2019 16:39 UTC
28
points
10
comments
11
min read
LW
link
Soft takeoff can still lead to decisive strategic advantage
Daniel Kokotajlo
23 Aug 2019 16:39 UTC
122
points
47
comments
8
min read
LW
link
4
reviews
Moscow LW meetup in “Nauchka” library
Alexander230
23 Aug 2019 12:40 UTC
3
points
0
comments
1
min read
LW
link
OpenGPT-2: We Replicated GPT-2 Because You Can Too
avturchin
23 Aug 2019 11:32 UTC
18
points
0
comments
1
min read
LW
link
(medium.com)
Torture and Dust Specks and Joy—Oh my! or: Non-Archimedean Utility Functions as Pseudograded Vector Spaces
Louis_Brown
23 Aug 2019 11:11 UTC
19
points
29
comments
8
min read
LW
link
Metalignment: Deconfusing metaethics for AI alignment.
Guillaume Corlouer
23 Aug 2019 10:25 UTC
13
points
7
comments
3
min read
LW
link
[Question]
A basic probability question
Shmi
23 Aug 2019 7:13 UTC
11
points
3
comments
1
min read
LW
link
Towards an Intentional Research Agenda
romeostevensit
23 Aug 2019 5:27 UTC
21
points
8
comments
3
min read
LW
link
[Question]
Why are people so optimistic about superintelligence?
bipolo
23 Aug 2019 4:25 UTC
6
points
3
comments
1
min read
LW
link
Vague Thoughts and Questions about Agent Structures
loriphos
23 Aug 2019 4:01 UTC
9
points
3
comments
2
min read
LW
link
Formalising decision theory is hard
Lukas Finnveden
23 Aug 2019 3:27 UTC
17
points
19
comments
2
min read
LW
link
Creating Environments to Design and Test Embedded Agents
lemonhope
23 Aug 2019 3:17 UTC
13
points
5
comments
8
min read
LW
link
Tabooing ‘Agent’ for Prosaic Alignment
Hjalmar_Wijk
23 Aug 2019 2:55 UTC
57
points
10
comments
6
min read
LW
link
Vaniver’s View on Factored Cognition
Vaniver
23 Aug 2019 2:54 UTC
48
points
4
comments
8
min read
LW
link
Redefining Fast Takeoff
VojtaKovarik
23 Aug 2019 2:15 UTC
10
points
1
comment
1
min read
LW
link
[Question]
Does Agent-like Behavior Imply Agent-like Architecture?
Scott Garrabrant
23 Aug 2019 2:01 UTC
66
points
8
comments
1
min read
LW
link
The Commitment Races problem
Daniel Kokotajlo
23 Aug 2019 1:58 UTC
157
points
56
comments
5
min read
LW
link
Analysis of a Secret Hitler Scenario
jaek
23 Aug 2019 1:24 UTC
16
points
6
comments
4
min read
LW
link
Thoughts from a Two Boxer
jaek
23 Aug 2019 0:24 UTC
18
points
11
comments
5
min read
LW
link
Deconfuse Yourself about Agency
VojtaKovarik
23 Aug 2019 0:21 UTC
15
points
9
comments
4
min read
LW
link
Logical Optimizers
Donald Hobson
22 Aug 2019 23:54 UTC
11
points
4
comments
3
min read
LW
link
Towards a mechanistic understanding of corrigibility
evhub
22 Aug 2019 23:20 UTC
47
points
26
comments
4
min read
LW
link
Response to Glen Weyl on Technocracy and the Rationalist Community
John_Maxwell
22 Aug 2019 23:14 UTC
66
points
9
comments
10
min read
LW
link
[Question]
Why so much variance in human intelligence?
Ben Pace
22 Aug 2019 22:36 UTC
65
points
28
comments
4
min read
LW
link
Logical Counterfactuals and Proposition graphs, Part 1
Donald Hobson
22 Aug 2019 22:06 UTC
20
points
0
comments
3
min read
LW
link
Time Travel, AI and Transparent Newcomb
johnswentworth
22 Aug 2019 22:04 UTC
11
points
7
comments
1
min read
LW
link
Embedded Naive Bayes
johnswentworth
22 Aug 2019 21:40 UTC
14
points
6
comments
3
min read
LW
link
Intentional Bucket Errors
Scott Garrabrant
22 Aug 2019 20:02 UTC
55
points
6
comments
3
min read
LW
link
Computational Model: Causal Diagrams with Symmetry
johnswentworth
22 Aug 2019 17:54 UTC
53
points
29
comments
4
min read
LW
link
[AN #62] Are adversarial examples caused by real but imperceptible features?
Rohin Shah
22 Aug 2019 17:10 UTC
28
points
10
comments
9
min read
LW
link
(mailchi.mp)
Implications of Quantum Computing for Artificial Intelligence Alignment Research
Jsevillamol
and
PabloAMC
22 Aug 2019 10:33 UTC
24
points
3
comments
13
min read
LW
link
Body Alignment & Balance. Our Midline Anatomy & the Median Plane.
leggi
22 Aug 2019 10:24 UTC
15
points
6
comments
4
min read
LW
link
[Question]
Simulation Argument: Why aren’t ancestor simulations outnumbered by transhumans?
maximkazhenkov
22 Aug 2019 9:07 UTC
9
points
11
comments
1
min read
LW
link
Markets are Universal for Logical Induction
johnswentworth
22 Aug 2019 6:44 UTC
75
points
2
comments
5
min read
LW
link
Announcement: Writing Day Today (Thursday)
Ben Pace
22 Aug 2019 4:48 UTC
29
points
5
comments
1
min read
LW
link
Western Massachusetts SSC meetup #15
a_lieb
22 Aug 2019 0:53 UTC
1
point
0
comments
1
min read
LW
link
Call for contributors to the Alignment Newsletter
Rohin Shah
21 Aug 2019 18:21 UTC
39
points
0
comments
4
min read
LW
link
Two senses of “optimizer”
Joar Skalse
21 Aug 2019 16:02 UTC
35
points
41
comments
3
min read
LW
link
Paradoxical Advice Thread
Hazard
21 Aug 2019 14:50 UTC
13
points
10
comments
1
min read
LW
link
Three Levels of Motivation
DragonGod
21 Aug 2019 9:24 UTC
15
points
1
comment
7
min read
LW
link
Odds are not easier
MrMind
21 Aug 2019 8:34 UTC
9
points
6
comments
1
min read
LW
link
Back to top
Next