All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Reward is not the optimization target

TurnTroutJul 25, 2022, 12:03 AM

375 points

123 comments10 min readLW link 3 reviews

Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover

Ajeya CotraJul 18, 2022, 7:06 PM

368 points

95 comments75 min readLW link 1 review

What should you change in response to an “emergency”? And AI risk

AnnaSalamonJul 18, 2022, 1:11 AM

338 points

60 comments6 min readLW link 1 review

Looking back on my alignment PhD

TurnTroutJul 1, 2022, 3:19 AM

334 points

66 comments11 min readLW link

On how various plans miss the hard bits of the alignment challenge

So8resJul 12, 2022, 2:49 AM

313 points

89 comments29 min readLW link 3 reviews

Toni Kurz and the Insanity of Climbing Mountains

GeneSmithJul 3, 2022, 8:51 PM

271 points

67 comments11 min readLW link 2 reviews

Changing the world through slack & hobbies

Steven ByrnesJul 21, 2022, 6:11 PM

261 points

13 comments10 min readLW link

Safetywashing

Adam SchollJul 1, 2022, 11:56 AM

260 points

20 comments1 min readLW link 2 reviews

Sexual Abuse attitudes might be infohazardous

Pseudonymous OtterJul 19, 2022, 6:06 PM

256 points

72 comments1 min readLW link

Humans provide an untapped wealth of evidence about alignment

TurnTrout and Quintin Pope

Jul 14, 2022, 2:31 AM

211 points

94 comments9 min readLW link 1 review

Unifying Bargaining Notions (1/2)

DiffractorJul 25, 2022, 12:28 AM

210 points

41 comments16 min readLW link

A note about differential technological development

So8resJul 15, 2022, 4:46 AM

197 points

33 comments6 min readLW link

Connor Leahy on Dying with Dignity, EleutherAI and Conjecture

Michaël TrazziJul 22, 2022, 6:44 PM

195 points

29 comments14 min readLW link

(theinsideview.ai)

AGI ruin scenarios are likely (and disjunctive)

So8resJul 27, 2022, 3:21 AM

175 points

38 comments6 min readLW link

ITT-passing and civility are good; “charity” is bad; steelmanning is niche

Rob BensingerJul 5, 2022, 12:15 AM

163 points

36 comments6 min readLW link 1 review

«Boundaries», Part 1: a key missing concept from utility theory

Andrew_CritchJul 26, 2022, 11:03 PM

158 points

33 comments7 min readLW link

Resolve Cycles

CFAR!DuncanJul 16, 2022, 11:17 PM

140 points

8 comments10 min readLW link

Carrying the Torch: A Response to Anna Salamon by the Guild of the Rose

moridinamaelJul 6, 2022, 2:20 PM

136 points

16 comments6 min readLW link

Brainstorm of things that could force an AI team to burn their lead

So8resJul 24, 2022, 11:58 PM

134 points

8 comments13 min readLW link

AI Forecasting: One Year In

jsteinhardtJul 4, 2022, 5:10 AM

132 points

12 comments6 min readLW link

(bounded-regret.ghost.io)

Conjecture: Internal Infohazard Policy

Connor Leahy, Sid Black, Chris Scammell and Andrea_Miotti

Jul 29, 2022, 7:07 PM

131 points

6 comments19 min readLW link

Limerence Messes Up Your Rationality Real Bad, Yo

RaemonJul 1, 2022, 4:53 PM

128 points

41 comments3 min readLW link 2 reviews

Principles for Alignment/Agency Projects

johnswentworthJul 7, 2022, 2:07 AM

122 points

20 comments4 min readLW link

Unifying Bargaining Notions (2/2)

DiffractorJul 27, 2022, 3:40 AM

118 points

19 comments21 min readLW link

Focusing

CFAR!DuncanJul 29, 2022, 7:15 PM

114 points

23 comments14 min readLW link

Circumventing interpretability: How to defeat mind-readers

Lee SharkeyJul 14, 2022, 4:59 PM

114 points

15 comments33 min readLW link

Moral strategies at different capability levels

Richard_NgoJul 27, 2022, 6:50 PM

112 points

14 comments5 min readLW link

(thinkingcomplete.blogspot.com)

Criticism of EA Criticism Contest

ZviJul 14, 2022, 2:30 PM

108 points

17 comments31 min readLW link 1 review

(thezvi.wordpress.com)

Examples of AI Increasing AI Progress

TW123Jul 17, 2022, 8:06 PM

107 points

14 comments1 min readLW link

Safety Implications of LeCun’s path to machine intelligence

Ivan VendrovJul 15, 2022, 9:47 PM

102 points

18 comments6 min readLW link

Comment on “Propositions Concerning Digital Minds and Society”

Zack_M_DavisJul 10, 2022, 5:48 AM

99 points

12 comments8 min readLW link

Marriage, the Giving What We Can Pledge, and the damage caused by vague public commitments

Jeffrey LadishJul 11, 2022, 7:38 PM

98 points

27 comments6 min readLW link 1 review

Naive Hypotheses on AI Alignment

Shoshannah TekofskyJul 2, 2022, 7:03 PM

98 points

29 comments5 min readLW link

A summary of every “Highlights from the Sequences” post

Orpheus16Jul 15, 2022, 11:01 PM

97 points

7 comments17 min readLW link

Opening Session Tips & Advice

CFAR!DuncanJul 25, 2022, 3:57 AM

95 points

3 comments14 min readLW link 1 review

Help ARC evaluate capabilities of current language models (still need people)

Beth BarnesJul 19, 2022, 4:55 AM

95 points

6 comments2 min readLW link

MATS Models

johnswentworthJul 9, 2022, 12:14 AM

94 points

5 comments16 min readLW link

Human values & biases are inaccessible to the genome

TurnTroutJul 7, 2022, 5:29 PM

94 points

54 comments6 min readLW link 1 review

Internal Double Crux

CFAR!DuncanJul 22, 2022, 4:34 AM

93 points

15 comments12 min readLW link

Immanuel Kant and the Decision Theory App Store

Daniel KokotajloJul 10, 2022, 4:04 PM

92 points

12 comments5 min readLW link

Goal Factoring

CFAR!DuncanJul 5, 2022, 7:10 AM

92 points

2 comments8 min readLW link

How to Diversify Conceptual Alignment: the Model Behind Refine

adamShimiJul 20, 2022, 10:44 AM

87 points

11 comments8 min readLW link

Don’t use ‘infohazard’ for collectively destructive info

Eliezer YudkowskyJul 15, 2022, 5:13 AM

86 points

33 comments1 min readLW link 2 reviews

(www.facebook.com)

Trigger-Action Planning

CFAR!DuncanJul 3, 2022, 1:42 AM

86 points

14 comments13 min readLW link 2 reviews

Trends in GPU price-performance

Marius Hobbhahn and Tamay

Jul 1, 2022, 3:51 PM

85 points

13 comments1 min readLW link 1 review

(epochai.org)

All AGI safety questions welcome (especially basic ones) [July 2022]

plex and Robert Miles

Jul 16, 2022, 12:57 PM

84 points

132 comments3 min readLW link

Benchmark for successful concept extrapolation/avoiding goal misgeneralization

Stuart_ArmstrongJul 4, 2022, 8:48 PM

82 points

12 comments4 min readLW link

Addendum: A non-magical explanation of Jeffrey Epstein

lcJul 18, 2022, 5:40 PM

81 points

21 comments11 min readLW link

Decision theory and dynamic inconsistency

paulfchristianoJul 3, 2022, 10:20 PM

80 points

33 comments10 min readLW link

(sideways-view.com)

[Question] How do AI timelines affect how you live your life?

Quadratic ReciprocityJul 11, 2022, 1:54 PM

80 points

50 comments1 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer