All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 272829 30 31

Is checking that a state of the world is not dystopian easier than constructing a non-dystopian state?

No77eDec 27, 2022, 8:57 PM

5 points

3 comments1 min readLW link

Crypto-currency as pro-alignment mechanism

False NameDec 27, 2022, 5:45 PM

−10 points

2 comments2 min readLW link

My Reservations about Discovering Latent Knowledge (Burns, Ye, et al)

Robert_AIZIDec 27, 2022, 5:27 PM

50 points

0 comments4 min readLW link

(aizi.substack.com)

Things that can kill you quickly: What everyone should know about first aid

jasoncrawfordDec 27, 2022, 4:23 PM

166 points

21 comments2 min readLW link 1 review

(jasoncrawford.org)

[Question] Why The Focus on Expected Utility Maximisers?

DragonGodDec 27, 2022, 3:49 PM

118 points

84 comments3 min readLW link

Presumptive Listening: sticking to familiar concepts and missing the outer reasoning paths

RemmeltDec 27, 2022, 3:40 PM

−16 points

8 comments2 min readLW link

(mflb.com)

Mere exposure effect: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

Dec 27, 2022, 2:05 PM

0 points

2 comments1 min readLW link

Housing and Transportation Roundup #2

ZviDec 27, 2022, 1:10 PM

25 points

0 comments12 min readLW link

(thezvi.wordpress.com)

[Question] Are tulpas moral patients?

ChristianKlDec 27, 2022, 11:30 AM

16 points

28 comments1 min readLW link

Reflections on my 5-month alignment upskilling grant

Jay BaileyDec 27, 2022, 10:51 AM

82 points

4 comments8 min readLW link

Institutions Cannot Restrain Dark-Triad AI Exploitation

Remmelt and flandry19

Dec 27, 2022, 10:34 AM

5 points

0 comments5 min readLW link

(mflb.com)

Introduction: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

Dec 27, 2022, 10:27 AM

1 point

0 comments3 min readLW link

MDPs and the Bellman Equation, Intuitively Explained

Jack O'BrienDec 27, 2022, 5:50 AM

11 points

3 comments14 min readLW link

How ‘Human-Human’ dynamics give way to ‘Human-AI’ and then ‘AI-AI’ dynamics

Remmelt and flandry19

Dec 27, 2022, 3:16 AM

−2 points

5 comments2 min readLW link

(mflb.com)

Nine Points of Collective Insanity

Remmelt and flandry19

Dec 27, 2022, 3:14 AM

−2 points

3 comments1 min readLW link

(mflb.com)

Fractional Resignation

jefftkDec 27, 2022, 2:30 AM

19 points

6 comments1 min readLW link

(www.jefftk.com)

[Question] What policies have most thoroughly crippled (otherwise-promising) industries or technologies?

benwrDec 27, 2022, 2:25 AM

40 points

4 comments1 min readLW link

Recent advances in Natural Language Processing—Some Woolly speculations (2019 essay on semantics and language models)

philosophybearDec 27, 2022, 2:11 AM

1 point

0 comments7 min readLW link

Against Agents as an Approach to Aligned Transformative AI

DragonGodDec 27, 2022, 12:47 AM

12 points

9 comments2 min readLW link

Can we efficiently distinguish different mechanisms?

paulfchristianoDec 27, 2022, 12:20 AM

91 points

30 comments16 min readLW link

(ai-alignment.com)

Air-gapping evaluation and support

Ryan KiddDec 26, 2022, 10:52 PM

53 points

1 comment2 min readLW link

Slightly against aligning with neo-luddites

Matthew BarnettDec 26, 2022, 10:46 PM

104 points

31 comments4 min readLW link

Avoiding perpetual risk from TAI

scasperDec 26, 2022, 10:34 PM

15 points

6 comments5 min readLW link

Announcing: The Independent AI Safety Registry

Shoshannah TekofskyDec 26, 2022, 9:22 PM

53 points

9 comments1 min readLW link

Are men harder to help?

bracesDec 26, 2022, 9:11 PM

35 points

1 comment2 min readLW link

[Question] How much should I update on the fact that my dentist is named Dennis?

MichaelDickensDec 26, 2022, 7:11 PM

2 points

3 comments1 min readLW link

Theodicy and the simulation hypothesis, or: The problem of simulator evil

philosophybearDec 26, 2022, 6:55 PM

12 points

12 comments19 min readLW link

(philosophybear.substack.com)

Safety of Self-Assembled Neuromorphic Hardware

CanDec 26, 2022, 6:51 PM

16 points

2 comments10 min readLW link

(forum.effectivealtruism.org)

Coherent extrapolated dreaming

Alex FlintDec 26, 2022, 5:29 PM

38 points

10 comments17 min readLW link

An overview of some promising work by junior alignment researchers

Orpheus16Dec 26, 2022, 5:23 PM

34 points

0 comments4 min readLW link

Solstice song: Here Lies the Dragon

jchanDec 26, 2022, 4:08 PM

8 points

1 comment2 min readLW link

The Usefulness Paradigm

AprillionDec 26, 2022, 1:23 PM

4 points

4 comments1 min readLW link

Looking Back on Posts From 2022

ZviDec 26, 2022, 1:20 PM

50 points

8 comments17 min readLW link

(thezvi.wordpress.com)

Analogies between Software Reverse Engineering and Mechanistic Interpretability

Neel Nanda and Itay Yona

Dec 26, 2022, 12:26 PM

34 points

6 comments11 min readLW link

(www.neelnanda.io)

Mlyyrczo

lsusrDec 26, 2022, 7:58 AM

41 points

14 comments3 min readLW link

Causal abstractions vs infradistributions

Pablo VillalobosDec 26, 2022, 12:21 AM

24 points

0 comments6 min readLW link

Concrete Steps to Get Started in Transformer Mechanistic Interpretability

Neel NandaDec 25, 2022, 10:21 PM

57 points

7 comments12 min readLW link

(www.neelnanda.io)

It’s time to worry about online privacy again

MalmesburyDec 25, 2022, 9:05 PM

67 points

23 comments6 min readLW link

[Hebbian Natural Abstractions] Mathematical Foundations

Samuel Nellessen and Jan

Dec 25, 2022, 8:58 PM

15 points

2 comments6 min readLW link

(www.snellessen.com)

[Question] Oracle AGI—How can it escape, other than security issues? (Steganography?)

RationalSieveDec 25, 2022, 8:14 PM

3 points

6 comments1 min readLW link

YCombinator fraud rates

XodarapDec 25, 2022, 7:21 PM

56 points

3 comments LW link

How evolutionary lineages of LLMs can plan their own future and act on these plans

Roman LeventovDec 25, 2022, 6:11 PM

39 points

16 comments8 min readLW link

Accurate Models of AI Risk Are Hyperexistential Exfohazards

Thane RuthenisDec 25, 2022, 4:50 PM

33 points

38 comments9 min readLW link

ChatGPT is our Wright Brothers moment

Ron JDec 25, 2022, 4:26 PM

10 points

9 comments1 min readLW link

The Meditation on Winter

RaemonDec 25, 2022, 4:12 PM

59 points

3 comments3 min readLW link

I’ve updated towards AI boxing being surprisingly easy

Noosphere89Dec 25, 2022, 3:40 PM

8 points

20 comments2 min readLW link

Take 14: Corrigibility isn’t that great.

Charlie SteinerDec 25, 2022, 1:04 PM

15 points

3 comments3 min readLW link

Simplified Level Up

jefftkDec 25, 2022, 1:00 PM

12 points

16 comments2 min readLW link

(www.jefftk.com)

Hyperfinite graphs ~ manifolds

Alok SinghDec 25, 2022, 12:24 PM

11 points

5 comments2 min readLW link

Inconsistent math is great

Alok SinghDec 25, 2022, 3:20 AM

1 point

2 comments1 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer