All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] Will the first AGI agent have been designed as an agent (in addition to an AGI)?

nahojDec 3, 2022, 8:32 PM

1 point

8 comments1 min readLW link

Logical induction for software engineers

Alex FlintDec 3, 2022, 7:55 PM

163 points

8 comments27 min readLW link 1 review

Utilitarianism is the only option

aelwoodDec 3, 2022, 5:14 PM

−13 points

7 comments LW link

Our 2022 Giving

jefftkDec 3, 2022, 3:40 PM

33 points

0 comments1 min readLW link

(www.jefftk.com)

[Question] Is school good or bad?

tailcalledDec 3, 2022, 1:14 PM

10 points

76 comments1 min readLW link

MrBeast’s Squid Game Tricked Me

lsusrDec 3, 2022, 5:50 AM

75 points

1 comment2 min readLW link

Great Cryonics Survey of 2022

Mati_RoyDec 3, 2022, 5:10 AM

16 points

0 comments1 min readLW link

Causal scrubbing: results on induction heads

LawrenceC, Adrià Garriga-alonso, Nicholas Goldowsky-Dill, ryan_greenblatt, Tao Lin, jenny, Ansh Radhakrishnan, Buck and Nate Thomas

Dec 3, 2022, 12:59 AM

34 points

1 comment17 min readLW link

Causal scrubbing: results on a paren balance checker

LawrenceC, Adrià Garriga-alonso, Nicholas Goldowsky-Dill, ryan_greenblatt, Tao Lin, jenny, Ansh Radhakrishnan, Buck and Nate Thomas

Dec 3, 2022, 12:59 AM

34 points

2 comments30 min readLW link

Causal scrubbing: Appendix

LawrenceC, Adrià Garriga-alonso, Nicholas Goldowsky-Dill, ryan_greenblatt, jenny, Ansh Radhakrishnan, Buck and Nate Thomas

Dec 3, 2022, 12:58 AM

18 points

4 comments20 min readLW link

Causal Scrubbing: a method for rigorously testing interpretability hypotheses [Redwood Research]

LawrenceC, Adrià Garriga-alonso, Nicholas Goldowsky-Dill, ryan_greenblatt, jenny, Ansh Radhakrishnan, Buck and Nate Thomas

Dec 3, 2022, 12:58 AM

206 points

35 comments20 min readLW link 1 review

Take 2: Building tools to help build FAI is a legitimate strategy, but it’s dual-use.

Charlie SteinerDec 3, 2022, 12:54 AM

17 points

1 comment2 min readLW link

D&D.Sci December 2022: The Boojumologist

abstractapplicDec 2, 2022, 11:39 PM

32 points

9 comments2 min readLW link

Subsets and quotients in interpretability

Erik JennerDec 2, 2022, 11:13 PM

26 points

1 comment7 min readLW link

Research Principles for 6 Months of AI Alignment Studies

Shoshannah TekofskyDec 2, 2022, 10:55 PM

23 points

3 comments6 min readLW link

Three Fables of Magical Girls and Longtermism

Ulisse MiniDec 2, 2022, 10:01 PM

33 points

11 comments2 min readLW link

Brun’s theorem and sieve theory

Ege ErdilDec 2, 2022, 8:57 PM

31 points

1 comment73 min readLW link

Apply for the ML Upskilling Winter Camp in Cambridge, UK [2-10 Jan]

hannah wing-yeeDec 2, 2022, 8:45 PM

3 points

0 comments2 min readLW link

Takeoff speeds, the chimps analogy, and the Cultural Intelligence Hypothesis

NickGabsDec 2, 2022, 7:14 PM

16 points

2 comments4 min readLW link

[ASoT] Finetuning, RL, and GPT’s world prior

JozdienDec 2, 2022, 4:33 PM

45 points

8 comments5 min readLW link

NeurIPS Safety & ChatGPT. MLAISU W48

Esben Kran and Steinthal

Dec 2, 2022, 3:50 PM

3 points

0 comments4 min readLW link

(newsletter.apartresearch.com)

[Question] Is ChatGPT rigth when advising to brush the tongue when brushing teeth?

ChristianKlDec 2, 2022, 2:53 PM

13 points

14 comments2 min readLW link

Jailbreaking ChatGPT on Release Day

ZviDec 2, 2022, 1:10 PM

242 points

77 comments6 min readLW link 1 review

(thezvi.wordpress.com)

Deconfusing Direct vs Amortised Optimization

berenDec 2, 2022, 11:30 AM

134 points

19 comments10 min readLW link

Inner and outer alignment decompose one hard problem into two extremely hard problems

TurnTroutDec 2, 2022, 2:43 AM

149 points

22 comments47 min readLW link 3 reviews

New Feature: Collaborative editing now supports logged-out users

RobertMDec 2, 2022, 2:41 AM

10 points

0 comments1 min readLW link

Mastering Stratego (Deepmind)

svemirskiDec 2, 2022, 2:21 AM

6 points

0 comments1 min readLW link

(www.deepmind.com)

Update on Harvard AI Safety Team and MIT AI Alignment

Xander Davies, Sam Marks, kaivu, tlevin, eleni, maxnadeau and Naomi Bashkansky

Dec 2, 2022, 12:56 AM

60 points

4 comments8 min readLW link

Quick look: cognitive damage from well-administered anesthesia

ElizabethDec 2, 2022, 12:40 AM

28 points

0 comments4 min readLW link

(acesounderglass.com)

Against meta-ethical hedonism

Joe CarlsmithDec 2, 2022, 12:23 AM

24 points

5 comments35 min readLW link

Lumenators for very lazy British people

shakeelhDec 2, 2022, 12:18 AM

16 points

3 comments1 min readLW link

 Understanding goals in complex systems

Johannes C. MayerDec 1, 2022, 11:49 PM

9 points

0 comments1 min readLW link

(www.youtube.com)

A challenge for AGI organizations, and a challenge for readers

Rob Bensinger and Eliezer Yudkowsky

Dec 1, 2022, 11:11 PM

302 points

33 comments2 min readLW link

Playing with Aerial Photos

jefftkDec 1, 2022, 10:50 PM

9 points

0 comments1 min readLW link

(www.jefftk.com)

Take 1: We’re not going to reverse-engineer the AI.

Charlie SteinerDec 1, 2022, 10:41 PM

38 points

4 comments4 min readLW link

Re-Examining LayerNorm

Eric WinsorDec 1, 2022, 10:20 PM

127 points

12 comments5 min readLW link

The LessWrong 2021 Review: Intellectual Circle Expansion

Ruby and Raemon

Dec 1, 2022, 9:17 PM

95 points

55 comments8 min readLW link

The Plan − 2022 Update

johnswentworthDec 1, 2022, 8:43 PM

239 points

37 comments8 min readLW link 1 review

Finding gliders in the game of life

paulfchristianoDec 1, 2022, 8:40 PM

104 points

8 comments16 min readLW link

(ai-alignment.com)

The Machine Stops (Chapter 9)

Justin BullockDec 1, 2022, 7:20 PM

3 points

0 comments47 min readLW link

Covid 12/1/22: China Protests

ZviDec 1, 2022, 5:10 PM

38 points

2 comments10 min readLW link

(thezvi.wordpress.com)

ChatGPT: First Impressions

specbugDec 1, 2022, 4:36 PM

18 points

2 comments13 min readLW link

(sixeleven.in)

[LINK] - ChatGPT discussion

JanBDec 1, 2022, 3:04 PM

13 points

8 comments1 min readLW link

(openai.com)

Research request (alignment strategy): Deep dive on “making AI solve alignment for us”

JanBDec 1, 2022, 2:55 PM

16 points

3 comments1 min readLW link

Theories of impact for Science of Deep Learning

Marius HobbhahnDec 1, 2022, 2:39 PM

24 points

0 comments11 min readLW link

Safe Development of Hacker-AI Countermeasures – What if we are too late?

Erland WittkotterDec 1, 2022, 7:59 AM

3 points

0 comments14 min readLW link

Did ChatGPT just gaslight me?

TW123Dec 1, 2022, 5:41 AM

123 points

45 comments9 min readLW link

(aiwatchtower.substack.com)

SBF’s comments on ethics are no surprise to virtue ethicists

c.troutDec 1, 2022, 4:18 AM

36 points

30 comments16 min readLW link

Notes on Caution

David GrossDec 1, 2022, 3:05 AM

14 points

0 comments19 min readLW link

Reestablishing Reliable Sources: A System for Tagging URLs

Riley MuellerDec 1, 2022, 2:27 AM

7 points

1 comment3 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer