All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 192021 22 23 24 25 26 27 28 29 30 31

Shard Theory in Nine Theses: a Distillation and Critical Appraisal

LawrenceCDec 19, 2022, 10:52 PM

150 points

30 comments18 min readLW link

[Question] Will research in AI risk jinx it? Consequences of training AI on AI risk arguments

Yann DuboisDec 19, 2022, 10:42 PM

5 points

6 comments1 min readLW link

AGI Timelines in Governance: Different Strategies for Different Timeframes

simeon_c and AmberDawn

Dec 19, 2022, 9:31 PM

65 points

28 comments10 min readLW link

Towards Hodge-podge Alignment

Cleo NardoDec 19, 2022, 8:12 PM

95 points

30 comments9 min readLW link

Computational signatures of psychopathy

Cameron BergDec 19, 2022, 5:01 PM

30 points

3 comments20 min readLW link

Results from a survey on tool use and workflows in alignment research

jacquesthibs, Jan, janus and Logan Riggs

Dec 19, 2022, 3:19 PM

79 points

2 comments19 min readLW link

Does ChatGPT’s performance warrant working on a tutor for children? [It’s time to take it to the lab.]

Bill BenzonDec 19, 2022, 3:12 PM

13 points

5 comments4 min readLW link

(new-savanna.blogspot.com)

Conditions for Superrationality-motivated Cooperation in a one-shot Prisoner’s Dilemma

Jim BuhlerDec 19, 2022, 3:00 PM

24 points

4 comments5 min readLW link

Next Level Seinfeld

ZviDec 19, 2022, 1:30 PM

50 points

8 comments1 min readLW link

(thezvi.wordpress.com)

CEA Disambiguation

jefftkDec 19, 2022, 1:20 PM

25 points

0 comments1 min readLW link

(www.jefftk.com)

Why mechanistic interpretability does not and cannot contribute to long-term AGI safety (from messages with a friend)

RemmeltDec 19, 2022, 12:02 PM

−3 points

9 comments31 min readLW link

Hacker-AI and Cyberwar 2.0+

Erland WittkotterDec 19, 2022, 11:46 AM

2 points

0 comments15 min readLW link

Non-Technical Preparation for Hacker-AI and Cyberwar 2.0+

Erland WittkotterDec 19, 2022, 11:42 AM

2 points

0 comments25 min readLW link

An Effective Grab Bag

stavrosDec 19, 2022, 10:29 AM

28 points

2 comments7 min readLW link

Slick hyperfinite Ramsey theory proof

Alok SinghDec 19, 2022, 8:40 AM

8 points

3 comments1 min readLW link

(alok.github.io)

The True Spirit of Solstice?

RaemonDec 19, 2022, 8:00 AM

69 points

31 comments9 min readLW link

The Risk of Orbital Debris and One (Cheap) Way to Mitigate It

clansDec 19, 2022, 3:16 AM

13 points

1 comment4 min readLW link

(locationtbd.home.blog)

Why I think that teaching philosophy is high impact

Eleni AngelouDec 19, 2022, 3:11 AM

5 points

0 comments2 min readLW link

A template for doing annual reviews

peterslatteryDec 19, 2022, 3:09 AM

2 points

0 comments1 min readLW link

Event [Berkeley]: Alignment Collaborator Speed-Meeting

AlexMennen and Carson Jones

Dec 19, 2022, 2:24 AM

18 points

2 comments1 min readLW link

An easier(?) end to the electoral college

ejacobDec 19, 2022, 2:09 AM

2 points

2 comments2 min readLW link

How Death Feels

sisyphusDec 18, 2022, 11:47 PM

−7 points

9 comments1 min readLW link

Why Are Women Hot?

Jacob FalkovichDec 18, 2022, 11:20 PM

17 points

19 comments11 min readLW link

[Question] Can we, in principle, know the measure of counterfactual quantum branches?

sisyphusDec 18, 2022, 10:07 PM

1 point

15 comments1 min readLW link

Boston Solstice 2022 Retrospective

jefftkDec 18, 2022, 7:00 PM

19 points

3 comments5 min readLW link

(www.jefftk.com)

Take 11: “Aligning language models” should be weirder.

Charlie SteinerDec 18, 2022, 2:14 PM

34 points

0 comments2 min readLW link

Bad at Arithmetic, Promising at Math

cohenmacaulayDec 18, 2022, 5:40 AM

100 points

19 comments20 min readLW link 1 review

Overconfidence bubbles

kaputmiDec 18, 2022, 2:07 AM

3 points

0 comments2 min readLW link

Positive values seem more robust and lasting than prohibitions

TurnTroutDec 17, 2022, 9:43 PM

52 points

13 comments2 min readLW link

What we owe the microbiome

weverkaDec 17, 2022, 7:40 PM

2 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Why write more: improve your epistemics, self-care, & 28 other reasons

KatWoodsDec 17, 2022, 7:25 PM

24 points

1 comment6 min readLW link

Looking for an alignment tutor

JanBDec 17, 2022, 7:08 PM

15 points

2 comments1 min readLW link

[Question] How to Convince my Son that Drugs are Bad

concerned_dadDec 17, 2022, 6:47 PM

140 points

84 comments2 min readLW link

Ordinary human life

David Hugh-JonesDec 17, 2022, 4:46 PM

24 points

3 comments14 min readLW link

(wyclif.substack.com)

Predictive Processing, Heterosexuality and Delusions of Grandeur

lsusrDec 17, 2022, 7:37 AM

37 points

13 comments5 min readLW link

[Link] Escape the Echo Chamber (2018)

CronoDASDec 17, 2022, 6:14 AM

13 points

0 comments2 min readLW link

(aeon.co)

“Starry Night” Solstice Cookies

maiaDec 17, 2022, 5:31 AM

26 points

7 comments1 min readLW link

There have been 3 planes (billionaire donors) and 2 have crashed

trevorDec 17, 2022, 3:58 AM

16 points

10 comments2 min readLW link

[Question] What about non-degree seeking?

Lao MeinDec 17, 2022, 2:22 AM

5 points

5 comments1 min readLW link

Using Information Theory to tackle AI Alignment: A Practical Approach

Daniel SalamiDec 17, 2022, 1:37 AM

10 points

4 comments7 min readLW link

Paper: Constitutional AI: Harmlessness from AI Feedback (Anthropic)

LawrenceCDec 16, 2022, 10:12 PM

68 points

11 comments1 min readLW link

(www.anthropic.com)

Vaguely interested in Effective Altruism? Please Take the Official 2022 EA Survey

Peter WildefordDec 16, 2022, 9:07 PM

22 points

4 comments1 min readLW link

(rethinkpriorities.qualtrics.com)

Abstract concepts and metalingual definition: Does ChatGPT understand justice and charity?

Bill BenzonDec 16, 2022, 9:01 PM

2 points

0 comments13 min readLW link

Beyond the moment of invention

jasoncrawfordDec 16, 2022, 8:18 PM

35 points

0 comments2 min readLW link

(rootsofprogress.org)

[Question] What’s the best time-efficient alternative to the Sequences?

trevorDec 16, 2022, 8:17 PM

7 points

7 comments1 min readLW link

Can we efficiently explain model behaviors?

paulfchristianoDec 16, 2022, 7:40 PM

64 points

3 comments9 min readLW link

(ai-alignment.com)

Proper scoring rules don’t guarantee predicting fixed points

Johannes Treutlein, Rubi J. Hudson and Caspar Oesterheld

Dec 16, 2022, 6:22 PM

79 points

8 comments21 min readLW link

A learned agent is not the same as a learning agent

Ben AmitayDec 16, 2022, 5:27 PM

4 points

5 comments4 min readLW link

[Question] College Selection Advice for Technical Alignment

TempCollegeAskDec 16, 2022, 5:11 PM

11 points

8 comments1 min readLW link

How important are accurate AI timelines for the optimal spending schedule on AI risk interventions?

Tristan CookDec 16, 2022, 4:05 PM

27 points

2 comments LW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer