All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 252627 28 29 30 31

Concrete Steps to Get Started in Transformer Mechanistic Interpretability

Neel NandaDec 25, 2022, 10:21 PM

57 points

7 comments12 min readLW link

(www.neelnanda.io)

It’s time to worry about online privacy again

MalmesburyDec 25, 2022, 9:05 PM

67 points

23 comments6 min readLW link

[Hebbian Natural Abstractions] Mathematical Foundations

Samuel Nellessen and Jan

Dec 25, 2022, 8:58 PM

15 points

2 comments6 min readLW link

(www.snellessen.com)

[Question] Oracle AGI—How can it escape, other than security issues? (Steganography?)

RationalSieveDec 25, 2022, 8:14 PM

3 points

6 comments1 min readLW link

YCombinator fraud rates

XodarapDec 25, 2022, 7:21 PM

56 points

3 comments LW link

How evolutionary lineages of LLMs can plan their own future and act on these plans

Roman LeventovDec 25, 2022, 6:11 PM

39 points

16 comments8 min readLW link

Accurate Models of AI Risk Are Hyperexistential Exfohazards

Thane RuthenisDec 25, 2022, 4:50 PM

33 points

38 comments9 min readLW link

ChatGPT is our Wright Brothers moment

Ron JDec 25, 2022, 4:26 PM

10 points

9 comments1 min readLW link

The Meditation on Winter

RaemonDec 25, 2022, 4:12 PM

59 points

3 comments3 min readLW link

I’ve updated towards AI boxing being surprisingly easy

Noosphere89Dec 25, 2022, 3:40 PM

8 points

20 comments2 min readLW link

Take 14: Corrigibility isn’t that great.

Charlie SteinerDec 25, 2022, 1:04 PM

15 points

3 comments3 min readLW link

Simplified Level Up

jefftkDec 25, 2022, 1:00 PM

12 points

16 comments2 min readLW link

(www.jefftk.com)

Hyperfinite graphs ~ manifolds

Alok SinghDec 25, 2022, 12:24 PM

11 points

5 comments2 min readLW link

Inconsistent math is great

Alok SinghDec 25, 2022, 3:20 AM

1 point

2 comments1 min readLW link

A hundredth of a bit of extra entropy

Adam ScherlisDec 24, 2022, 9:12 PM

84 points

4 comments3 min readLW link

Shared reality: a key driver of human behavior

kdbscottDec 24, 2022, 7:35 PM

126 points

25 comments4 min readLW link

Contra Steiner on Too Many Natural Abstractions

DragonGodDec 24, 2022, 5:42 PM

10 points

6 comments1 min readLW link

Three reasons to cooperate

paulfchristianoDec 24, 2022, 5:40 PM

86 points

14 comments10 min readLW link

(sideways-view.com)

Practical AI risk I: Watching large compute

Gustavo RamiresDec 24, 2022, 1:25 PM

3 points

0 comments1 min readLW link

Non-Elevated Air Purifiers

jefftkDec 24, 2022, 12:40 PM

10 points

2 comments1 min readLW link

(www.jefftk.com)

The Case for Chip-Backed Dollars

AnthonyRepettoDec 24, 2022, 10:28 AM

0 points

1 comment4 min readLW link

List #3: Why not to assume on prior that AGI-alignment workarounds are available

RemmeltDec 24, 2022, 9:54 AM

4 points

1 comment3 min readLW link

List #2: Why coordinating to align as humans to not develop AGI is a lot easier than, well… coordinating as humans with AGI coordinating to be aligned with humans

RemmeltDec 24, 2022, 9:53 AM

1 point

0 comments3 min readLW link

List #1: Why stopping the development of AGI is hard but doable

RemmeltDec 24, 2022, 9:52 AM

6 points

11 comments5 min readLW link

The case against AI alignment

andrew sauerDec 24, 2022, 6:57 AM

126 points

110 comments5 min readLW link

Content and Takeaways from SERI MATS Training Program with John Wentworth

RohanSDec 24, 2022, 4:17 AM

28 points

3 comments12 min readLW link

Löb’s Lemma: an easier approach to Löb’s Theorem

Andrew_CritchDec 24, 2022, 2:02 AM

30 points

16 comments3 min readLW link

Durkon, an open-source tool for Inherently Interpretable Modelling

abstractapplicDec 24, 2022, 1:49 AM

37 points

0 comments4 min readLW link

Issues with uneven AI resource distribution

User_LukeDec 24, 2022, 1:18 AM

3 points

9 comments5 min readLW link

(temporal.substack.com)

Loose Threads on Intelligence

Shoshannah TekofskyDec 24, 2022, 12:38 AM

11 points

3 comments8 min readLW link

[Question] If you factor out next token prediction, what are the remaining salient features of human cognition?

ShmiDec 24, 2022, 12:38 AM

9 points

7 comments1 min readLW link

[Question] Why is “Argument Mapping” Not More Common in EA/Rationality (And What Objections Should I Address in a Post on the Topic?)

HarrisonDurlandDec 23, 2022, 9:58 PM

10 points

5 comments1 min readLW link

The Fear [Fiction]

YitzDec 23, 2022, 9:21 PM

7 points

0 comments1 min readLW link

To err is neural: select logs with ChatGPT

VipulNaikDec 23, 2022, 8:26 PM

22 points

2 comments38 min readLW link

AISER—AIS Europe Retreat

CarolinDec 23, 2022, 7:03 PM

5 points

0 comments1 min readLW link

Two Truths and a Prediction Market

ScrewtapeDec 23, 2022, 6:52 PM

22 points

2 comments6 min readLW link

ChatGPT understands, but largely does not generate Spanglish (and other code-mixed) text

Milan WDec 23, 2022, 5:40 PM

15 points

5 comments4 min readLW link

On sincerity

Joe CarlsmithDec 23, 2022, 5:13 PM

75 points

6 comments42 min readLW link

Epigenetics of the mammalian germline

MetacelsusDec 23, 2022, 3:21 PM

37 points

0 comments7 min readLW link

(denovo.substack.com)

Boston Solstice Songs

jefftkDec 23, 2022, 1:00 PM

9 points

0 comments1 min readLW link

(www.jefftk.com)

Are there any reliable CAPTCHAs? Competition for CAPTCHA ideas that AIs can’t solve.

MrThinkDec 23, 2022, 12:52 PM

7 points

37 comments1 min readLW link

“Search” is dead. What is the new paradigm?

ShmiDec 23, 2022, 10:33 AM

15 points

9 comments1 min readLW link

Article Review: Discovering Latent Knowledge (Burns, Ye, et al)

Robert_AIZIDec 22, 2022, 6:16 PM

13 points

4 comments6 min readLW link

(aizi.substack.com)

Let’s think about slowing down AI

KatjaGraceDec 22, 2022, 5:40 PM

551 points

182 comments38 min readLW link 3 reviews

(aiimpacts.org)

Some Notes on the mathematics of Toy Autoencoding Problems

carboniferous_umbraculum Dec 22, 2022, 5:21 PM

18 points

1 comment12 min readLW link

December 2022 updates and fundraising

AI ImpactsDec 22, 2022, 5:20 PM

39 points

1 comment3 min readLW link

(aiimpacts.org)

Covid 12/22/22: Reevaluating Past Options

ZviDec 22, 2022, 4:50 PM

30 points

2 comments9 min readLW link

(thezvi.wordpress.com)

China Covid #4

ZviDec 22, 2022, 4:30 PM

50 points

2 comments11 min readLW link

(thezvi.wordpress.com)

Racing through a minefield: the AI deployment problem

HoldenKarnofskyDec 22, 2022, 4:10 PM

38 points

2 comments13 min readLW link

(www.cold-takes.com)

Lead in Chocolate?

jefftkDec 22, 2022, 4:10 PM

41 points

6 comments2 min readLW link

(www.jefftk.com)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer