All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 293031

Humans Reflecting on HRH

leogaoJul 29, 2022, 9:56 PM

27 points

4 comments2 min readLW link

Comparing Four Approaches to Inner Alignment

Lucas TeixeiraJul 29, 2022, 9:06 PM

38 points

1 comment9 min readLW link

Questions for a Theory of Narratives

Marv KJul 29, 2022, 7:31 PM

5 points

4 comments4 min readLW link

Focusing

CFAR!DuncanJul 29, 2022, 7:15 PM

115 points

24 comments14 min readLW link

Conjecture: Internal Infohazard Policy

Connor Leahy, Sid Black, Chris Scammell and Andrea_Miotti

Jul 29, 2022, 7:07 PM

131 points

6 comments19 min readLW link

Abstracting The Hardness of Alignment: Unbounded Atomic Optimization

adamShimiJul 29, 2022, 6:59 PM

75 points

3 comments16 min readLW link

Bucket Errors

CFAR!DuncanJul 29, 2022, 6:50 PM

44 points

7 comments11 min readLW link

Distillation Contest—Results and Recap

ArisJul 29, 2022, 5:40 PM

34 points

0 comments7 min readLW link

The generalized Sierpinski-Mazurkiewicz theorem.

Donald HobsonJul 29, 2022, 12:12 AM

11 points

4 comments1 min readLW link

The Conversations We Make Space For

Severin T. SeehrichJul 28, 2022, 9:37 PM

21 points

0 comments3 min readLW link

Announcing the AI Safety Field Building Hub, a new effort to provide AISFB projects, mentorship, and funding

Vael GatesJul 28, 2022, 9:29 PM

49 points

3 comments6 min readLW link

Defining Optimization in a Deeper Way Part 4

J BostockJul 28, 2022, 5:02 PM

7 points

0 comments5 min readLW link

Covid 7/28/22: Ruining It For Everyone

ZviJul 28, 2022, 3:10 PM

32 points

8 comments12 min readLW link

(thezvi.wordpress.com)

Monkeypox Post #2

ZviJul 28, 2022, 1:20 PM

36 points

3 comments6 min readLW link

(thezvi.wordpress.com)

For Better Commenting, Stop Out Loud

DirectedEvolutionJul 28, 2022, 1:39 AM

18 points

30 comments1 min readLW link

Seeking beta readers who are ignorant of biology but knowledgeable about AI safety

Holly_ElmoreJul 27, 2022, 11:02 PM

11 points

6 comments1 min readLW link

Principles of Privacy for Alignment Research

johnswentworthJul 27, 2022, 7:53 PM

73 points

31 comments7 min readLW link

Moral strategies at different capability levels

Richard_NgoJul 27, 2022, 6:50 PM

112 points

14 comments5 min readLW link

(thinkingcomplete.blogspot.com)

Progress links and tweets, 2022-07-27

jasoncrawfordJul 27, 2022, 5:20 PM

18 points

0 comments1 min readLW link

(rootsofprogress.org)

Quantum Advantage in Learning from Experiments

Dennis TowneJul 27, 2022, 3:49 PM

5 points

5 comments1 min readLW link

(ai.googleblog.com)

Levels of Pluralism

adamShimiJul 27, 2022, 9:35 AM

37 points

0 comments14 min readLW link

Human trials for the Marburg vaccine: funding opportunity?

americanwalrusJul 27, 2022, 5:53 AM

3 points

0 comments1 min readLW link

(www.independent.co.uk)

[Question] “Fanatical” Longtermists: Why is Pascal’s Wager wrong?

YitzJul 27, 2022, 4:16 AM

3 points

7 comments1 min readLW link

Unifying Bargaining Notions (2/2)

DiffractorJul 27, 2022, 3:40 AM

118 points

19 comments21 min readLW link

AGI ruin scenarios are likely (and disjunctive)

So8resJul 27, 2022, 3:21 AM

177 points

38 comments6 min readLW link

Technocracy and the Space Age

jasoncrawfordJul 26, 2022, 11:14 PM

25 points

5 comments2 min readLW link

(rootsofprogress.org)

«Boundaries», Part 1: a key missing concept from utility theory

Andrew_CritchJul 26, 2022, 11:03 PM

158 points

33 comments7 min readLW link

Incoherence of unbounded selfishness

emmabJul 26, 2022, 10:27 PM

−6 points

2 comments1 min readLW link

«Boundaries» Sequence (Index Post)

Andrew_CritchJul 26, 2022, 7:12 PM

25 points

1 comment1 min readLW link

Active Inference as a formalisation of instrumental convergence

Roman LeventovJul 26, 2022, 5:55 PM

12 points

2 comments3 min readLW link

(direct.mit.edu)

NeurIPS ML Safety Workshop 2022

Dan HJul 26, 2022, 3:28 PM

72 points

2 comments1 min readLW link

(neurips2022.mlsafety.org)

AI ethics vs AI alignment

Wei DaiJul 26, 2022, 1:08 PM

5 points

1 comment1 min readLW link

Utility functions and probabilities are entangled

Thomas KwaJul 26, 2022, 5:36 AM

15 points

5 comments1 min readLW link

How Promising is Theoretical Research on Rationality? Seeking Career Advice

Aspirant223Jul 26, 2022, 1:08 AM

3 points

3 comments3 min readLW link

Prediction markets meetup/coworking (hosted by Manifold Markets)

Sinclair Chen and Austin Chen

Jul 26, 2022, 12:14 AM

2 points

0 comments1 min readLW link

Alignment being impossible might be better than it being really difficult

Martín SotoJul 25, 2022, 11:57 PM

13 points

2 comments2 min readLW link

[Question] How optimistic should we be about AI figuring out how to interpret itself?

oh54321Jul 25, 2022, 10:09 PM

3 points

1 comment1 min readLW link

Protectionism in One Country: How Industrial Policy Worked in Canada

Davis KedroskyJul 25, 2022, 10:08 PM

5 points

0 comments16 min readLW link

(daviskedrosky.substack.com)

Mistakes as agency

pchvykovJul 25, 2022, 4:17 PM

12 points

8 comments4 min readLW link

My Bitcoin Thesis @2022 - Part 1

aysajanJul 25, 2022, 3:49 PM

7 points

6 comments13 min readLW link

The Reader’s Guide to Optimal Monetary Policy

Ege ErdilJul 25, 2022, 3:10 PM

57 points

10 comments14 min readLW link

AGI Safety Needs People With All Skillsets!

Severin T. SeehrichJul 25, 2022, 1:32 PM

28 points

0 comments2 min readLW link

[Question] Is there any evidence that handwashing does anything to prevent COVID?

mukashiJul 25, 2022, 7:34 AM

4 points

3 comments1 min readLW link

Opening Session Tips & Advice

CFAR!DuncanJul 25, 2022, 3:57 AM

95 points

3 comments14 min readLW link 1 review

How much should we worry about mesa-optimization challenges?

sudoJul 25, 2022, 3:56 AM

4 points

13 comments2 min readLW link

[Question] Does agent foundations cover all future ML systems?

Jonas HallgrenJul 25, 2022, 1:17 AM

4 points

0 comments1 min readLW link

Unifying Bargaining Notions (1/2)

DiffractorJul 25, 2022, 12:28 AM

210 points

41 comments16 min readLW link

Reward is not the optimization target

TurnTroutJul 25, 2022, 12:03 AM

376 points

123 comments10 min readLW link 3 reviews

Brainstorm of things that could force an AI team to burn their lead

So8resJul 24, 2022, 11:58 PM

136 points

8 comments13 min readLW link

Finding Skeletons on Rashomon Ridge

David Udell, Peter S. Park and NickyP

Jul 24, 2022, 10:31 PM

30 points

2 comments7 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer