Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Wolfram Research v Cook
Kenny
Jul 31, 2022, 1:35 PM
7
points
3
comments
8
min read
LW
link
Wanted: Notation for credal resilience
PeterH
Jul 31, 2022, 7:35 AM
21
points
12
comments
1
min read
LW
link
Anatomy of a Dating Document
squidious
Jul 31, 2022, 2:40 AM
29
points
24
comments
4
min read
LW
link
(opalsandbonobos.blogspot.com)
chinchilla’s wild implications
nostalgebraist
Jul 31, 2022, 1:18 AM
424
points
128
comments
10
min read
LW
link
1
review
AGI-level reasoner will appear sooner than an agent; what the humanity will do with this reasoner is critical
Roman Leventov
Jul 30, 2022, 8:56 PM
24
points
10
comments
1
min read
LW
link
[Question]
What job should I do?
Tom Paine
Jul 30, 2022, 9:15 AM
2
points
8
comments
1
min read
LW
link
How transparency changed over time
ViktoriaMalyasova
Jul 30, 2022, 4:36 AM
21
points
0
comments
6
min read
LW
link
Translating between Latent Spaces
JamesH
,
Jeremy Gillen
and
NickyP
Jul 30, 2022, 3:25 AM
27
points
2
comments
8
min read
LW
link
Drexler’s Nanotech Forecast
PeterMcCluskey
Jul 30, 2022, 12:45 AM
25
points
28
comments
3
min read
LW
link
(www.bayesianinvestor.com)
Humans Reflecting on HRH
leogao
Jul 29, 2022, 9:56 PM
26
points
4
comments
2
min read
LW
link
Comparing Four Approaches to Inner Alignment
Lucas Teixeira
Jul 29, 2022, 9:06 PM
38
points
1
comment
9
min read
LW
link
Questions for a Theory of Narratives
Marv K
Jul 29, 2022, 7:31 PM
5
points
4
comments
4
min read
LW
link
Focusing
CFAR!Duncan
Jul 29, 2022, 7:15 PM
115
points
23
comments
14
min read
LW
link
Conjecture: Internal Infohazard Policy
Connor Leahy
,
Sid Black
,
Chris Scammell
and
Andrea_Miotti
Jul 29, 2022, 7:07 PM
131
points
6
comments
19
min read
LW
link
Abstracting The Hardness of Alignment: Unbounded Atomic Optimization
adamShimi
Jul 29, 2022, 6:59 PM
75
points
3
comments
16
min read
LW
link
Bucket Errors
CFAR!Duncan
Jul 29, 2022, 6:50 PM
43
points
7
comments
11
min read
LW
link
Distillation Contest—Results and Recap
Aris
Jul 29, 2022, 5:40 PM
34
points
0
comments
7
min read
LW
link
The generalized Sierpinski-Mazurkiewicz theorem.
Donald Hobson
Jul 29, 2022, 12:12 AM
11
points
4
comments
1
min read
LW
link
The Conversations We Make Space For
Severin T. Seehrich
Jul 28, 2022, 9:37 PM
21
points
0
comments
3
min read
LW
link
Announcing the AI Safety Field Building Hub, a new effort to provide AISFB projects, mentorship, and funding
Vael Gates
Jul 28, 2022, 9:29 PM
49
points
3
comments
6
min read
LW
link
Defining Optimization in a Deeper Way Part 4
J Bostock
Jul 28, 2022, 5:02 PM
7
points
0
comments
5
min read
LW
link
Covid 7/28/22: Ruining It For Everyone
Zvi
Jul 28, 2022, 3:10 PM
32
points
8
comments
12
min read
LW
link
(thezvi.wordpress.com)
Monkeypox Post #2
Zvi
Jul 28, 2022, 1:20 PM
36
points
3
comments
6
min read
LW
link
(thezvi.wordpress.com)
For Better Commenting, Stop Out Loud
DirectedEvolution
Jul 28, 2022, 1:39 AM
18
points
30
comments
1
min read
LW
link
Seeking beta readers who are ignorant of biology but knowledgeable about AI safety
Holly_Elmore
Jul 27, 2022, 11:02 PM
11
points
6
comments
1
min read
LW
link
Principles of Privacy for Alignment Research
johnswentworth
Jul 27, 2022, 7:53 PM
73
points
31
comments
7
min read
LW
link
Moral strategies at different capability levels
Richard_Ngo
Jul 27, 2022, 6:50 PM
112
points
14
comments
5
min read
LW
link
(thinkingcomplete.blogspot.com)
Progress links and tweets, 2022-07-27
jasoncrawford
Jul 27, 2022, 5:20 PM
18
points
0
comments
1
min read
LW
link
(rootsofprogress.org)
Quantum Advantage in Learning from Experiments
Dennis Towne
Jul 27, 2022, 3:49 PM
5
points
5
comments
1
min read
LW
link
(ai.googleblog.com)
Levels of Pluralism
adamShimi
Jul 27, 2022, 9:35 AM
37
points
0
comments
14
min read
LW
link
Human trials for the Marburg vaccine: funding opportunity?
americanwalrus
Jul 27, 2022, 5:53 AM
3
points
0
comments
1
min read
LW
link
(www.independent.co.uk)
[Question]
“Fanatical” Longtermists: Why is Pascal’s Wager wrong?
Yitz
Jul 27, 2022, 4:16 AM
3
points
7
comments
1
min read
LW
link
Unifying Bargaining Notions (2/2)
Diffractor
Jul 27, 2022, 3:40 AM
118
points
19
comments
21
min read
LW
link
AGI ruin scenarios are likely (and disjunctive)
So8res
Jul 27, 2022, 3:21 AM
177
points
38
comments
6
min read
LW
link
Technocracy and the Space Age
jasoncrawford
Jul 26, 2022, 11:14 PM
25
points
5
comments
2
min read
LW
link
(rootsofprogress.org)
«Boundaries», Part 1: a key missing concept from utility theory
Andrew_Critch
Jul 26, 2022, 11:03 PM
158
points
33
comments
7
min read
LW
link
Incoherence of unbounded selfishness
emmab
Jul 26, 2022, 10:27 PM
−6
points
2
comments
1
min read
LW
link
«Boundaries» Sequence (Index Post)
Andrew_Critch
Jul 26, 2022, 7:12 PM
25
points
1
comment
1
min read
LW
link
Active Inference as a formalisation of instrumental convergence
Roman Leventov
Jul 26, 2022, 5:55 PM
12
points
2
comments
3
min read
LW
link
(direct.mit.edu)
NeurIPS ML Safety Workshop 2022
Dan H
Jul 26, 2022, 3:28 PM
72
points
2
comments
1
min read
LW
link
(neurips2022.mlsafety.org)
AI ethics vs AI alignment
Wei Dai
Jul 26, 2022, 1:08 PM
5
points
1
comment
1
min read
LW
link
Utility functions and probabilities are entangled
Thomas Kwa
Jul 26, 2022, 5:36 AM
15
points
5
comments
1
min read
LW
link
How Promising is Theoretical Research on Rationality? Seeking Career Advice
Aspirant223
Jul 26, 2022, 1:08 AM
3
points
3
comments
3
min read
LW
link
Prediction markets meetup/coworking (hosted by Manifold Markets)
Sinclair Chen
and
Austin Chen
Jul 26, 2022, 12:14 AM
2
points
0
comments
1
min read
LW
link
Alignment being impossible might be better than it being really difficult
Martín Soto
Jul 25, 2022, 11:57 PM
13
points
2
comments
2
min read
LW
link
[Question]
How optimistic should we be about AI figuring out how to interpret itself?
oh54321
Jul 25, 2022, 10:09 PM
3
points
1
comment
1
min read
LW
link
Protectionism in One Country: How Industrial Policy Worked in Canada
Davis Kedrosky
Jul 25, 2022, 10:08 PM
5
points
0
comments
16
min read
LW
link
(daviskedrosky.substack.com)
Mistakes as agency
pchvykov
Jul 25, 2022, 4:17 PM
12
points
8
comments
4
min read
LW
link
My Bitcoin Thesis @2022 - Part 1
aysajan
Jul 25, 2022, 3:49 PM
7
points
6
comments
13
min read
LW
link
The Reader’s Guide to Optimal Monetary Policy
Ege Erdil
Jul 25, 2022, 3:10 PM
57
points
10
comments
14
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel