Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Embedded Agency via Abstraction
johnswentworth
26 Aug 2019 23:03 UTC
42
points
20
comments
11
min read
LW
link
Reversible changes: consider a bucket of water
Stuart_Armstrong
26 Aug 2019 22:55 UTC
25
points
18
comments
2
min read
LW
link
Toy model piece #3: close and distant situations
Stuart_Armstrong
26 Aug 2019 22:41 UTC
10
points
0
comments
1
min read
LW
link
[Question]
How do you learn foreign language vocabulary, beyond Anki?
Elizabeth
26 Aug 2019 21:00 UTC
9
points
21
comments
1
min read
LW
link
[Question]
How Can People Evaluate Complex Questions Consistently?
Elizabeth
26 Aug 2019 20:33 UTC
46
points
12
comments
1
min read
LW
link
Problems with AI debate
Stuart_Armstrong
26 Aug 2019 19:21 UTC
21
points
3
comments
5
min read
LW
link
Schelling Categories, and Simple Membership Tests
Zack_M_Davis
26 Aug 2019 2:43 UTC
59
points
10
comments
8
min read
LW
link
Limits of and to (artificial) Intelligence
MoritzG
25 Aug 2019 22:16 UTC
1
point
3
comments
7
min read
LW
link
Gratification: a useful concept, maybe new
Stuart_Armstrong
25 Aug 2019 18:58 UTC
17
points
7
comments
3
min read
LW
link
Under a week left to win $1,000! By questioning Oracle AIs.
Stuart_Armstrong
25 Aug 2019 17:02 UTC
12
points
2
comments
1
min read
LW
link
[Question]
I’m interested in a sub-field of AI but don’t know what to call it.
fowlertm
25 Aug 2019 14:55 UTC
9
points
4
comments
1
min read
LW
link
[Question]
Am I going for a job interview with a woo pusher?
CronoDAS
25 Aug 2019 14:39 UTC
6
points
7
comments
1
min read
LW
link
OpenPhil on “GiveWell’s Top Charities Are (Increasingly) Hard to Beat”
Raemon
24 Aug 2019 23:28 UTC
17
points
0
comments
6
min read
LW
link
(www.openphilanthropy.org)
Epistemic Spot Check: The Fate of Rome (Kyle Harper)
Elizabeth
24 Aug 2019 21:40 UTC
39
points
3
comments
5
min read
LW
link
(acesounderglass.com)
[Question]
Performance IQ and higher mathematics
c5pi
24 Aug 2019 17:31 UTC
4
points
5
comments
1
min read
LW
link
[Question]
how should a second version of “rationality: A to Z” look like?
Yoav Ravid
24 Aug 2019 7:01 UTC
6
points
4
comments
1
min read
LW
link
Petrov Day Celebration 2019 - Oxford Campsite
jbeshir
24 Aug 2019 3:42 UTC
8
points
1
comment
1
min read
LW
link
[Question]
How has rationalism helped you?
Sunny from QAD
24 Aug 2019 1:31 UTC
9
points
11
comments
1
min read
LW
link
[Question]
Is LW making progress?
zulupineapple
24 Aug 2019 0:32 UTC
21
points
11
comments
1
min read
LW
link
LessLong Launch Party
Raemon
23 Aug 2019 22:18 UTC
12
points
1
comment
1
min read
LW
link
[Question]
Is there a simple parameter that controls human working memory capacity, which has been set tragically low?
Liron
23 Aug 2019 22:10 UTC
17
points
8
comments
1
min read
LW
link
Optimization Provenance
Adele Lopez
23 Aug 2019 20:08 UTC
38
points
5
comments
5
min read
LW
link
Troll Bridge
abramdemski
23 Aug 2019 18:36 UTC
86
points
59
comments
12
min read
LW
link
Understanding understanding
mthq
23 Aug 2019 18:10 UTC
24
points
1
comment
2
min read
LW
link
Actually updating
SaraHax
23 Aug 2019 17:46 UTC
56
points
10
comments
4
min read
LW
link
When do utility functions constrain?
Hoagy
23 Aug 2019 17:19 UTC
30
points
8
comments
7
min read
LW
link
Parables of Constraint and Actualization
Spencer Wyman
23 Aug 2019 16:56 UTC
13
points
0
comments
6
min read
LW
link
Thoughts on Retrieving Knowledge from Neural Networks
Jaime Ruiz
23 Aug 2019 16:41 UTC
11
points
2
comments
5
min read
LW
link
Algorithmic Similarity
LukasM
23 Aug 2019 16:39 UTC
28
points
10
comments
11
min read
LW
link
Soft takeoff can still lead to decisive strategic advantage
Daniel Kokotajlo
23 Aug 2019 16:39 UTC
122
points
47
comments
8
min read
LW
link
4
reviews
Moscow LW meetup in “Nauchka” library
Alexander230
23 Aug 2019 12:40 UTC
3
points
0
comments
1
min read
LW
link
OpenGPT-2: We Replicated GPT-2 Because You Can Too
avturchin
23 Aug 2019 11:32 UTC
18
points
0
comments
1
min read
LW
link
(medium.com)
Torture and Dust Specks and Joy—Oh my! or: Non-Archimedean Utility Functions as Pseudograded Vector Spaces
Louis_Brown
23 Aug 2019 11:11 UTC
19
points
29
comments
8
min read
LW
link
Metalignment: Deconfusing metaethics for AI alignment.
Guillaume Corlouer
23 Aug 2019 10:25 UTC
13
points
7
comments
3
min read
LW
link
[Question]
A basic probability question
Shmi
23 Aug 2019 7:13 UTC
11
points
3
comments
1
min read
LW
link
Towards an Intentional Research Agenda
romeostevensit
23 Aug 2019 5:27 UTC
20
points
8
comments
3
min read
LW
link
[Question]
Why are people so optimistic about superintelligence?
bipolo
23 Aug 2019 4:25 UTC
6
points
3
comments
1
min read
LW
link
Vague Thoughts and Questions about Agent Structures
loriphos
23 Aug 2019 4:01 UTC
9
points
3
comments
2
min read
LW
link
Formalising decision theory is hard
Lukas Finnveden
23 Aug 2019 3:27 UTC
17
points
19
comments
2
min read
LW
link
Creating Environments to Design and Test Embedded Agents
lemonhope
23 Aug 2019 3:17 UTC
13
points
5
comments
8
min read
LW
link
Tabooing ‘Agent’ for Prosaic Alignment
Hjalmar_Wijk
23 Aug 2019 2:55 UTC
57
points
10
comments
6
min read
LW
link
Vaniver’s View on Factored Cognition
Vaniver
23 Aug 2019 2:54 UTC
48
points
4
comments
8
min read
LW
link
Redefining Fast Takeoff
VojtaKovarik
23 Aug 2019 2:15 UTC
10
points
1
comment
1
min read
LW
link
[Question]
Does Agent-like Behavior Imply Agent-like Architecture?
Scott Garrabrant
23 Aug 2019 2:01 UTC
66
points
8
comments
1
min read
LW
link
The Commitment Races problem
Daniel Kokotajlo
23 Aug 2019 1:58 UTC
152
points
56
comments
5
min read
LW
link
Analysis of a Secret Hitler Scenario
jaek
23 Aug 2019 1:24 UTC
16
points
6
comments
4
min read
LW
link
Thoughts from a Two Boxer
jaek
23 Aug 2019 0:24 UTC
18
points
11
comments
5
min read
LW
link
Deconfuse Yourself about Agency
VojtaKovarik
23 Aug 2019 0:21 UTC
15
points
9
comments
4
min read
LW
link
Logical Optimizers
Donald Hobson
22 Aug 2019 23:54 UTC
11
points
4
comments
3
min read
LW
link
Towards a mechanistic understanding of corrigibility
evhub
22 Aug 2019 23:20 UTC
47
points
26
comments
4
min read
LW
link
Back to top
Next