All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Notes on control evaluations for safety cases

ryan_greenblatt, Buck and Fabien Roger

Feb 28, 2024, 4:15 PM

49 points

0 comments32 min readLW link

Corporate Governance for Frontier AI Labs: A Research Agenda

Matthew WeardenFeb 28, 2024, 11:29 AM

4 points

0 comments16 min readLW link

(matthewwearden.co.uk)

How AI Will Change Education

robotelvisFeb 28, 2024, 5:30 AM

6 points

3 comments5 min readLW link

(messyprogress.substack.com)

Band Lessons?

jefftkFeb 28, 2024, 3:00 AM

13 points

3 comments1 min readLW link

(www.jefftk.com)

New LessWrong review winner UI (“The LeastWrong” section and full-art post pages)

kaveFeb 28, 2024, 2:42 AM

105 points

64 comments1 min readLW link

Counting arguments provide no evidence for AI doom

Nora Belrose and Quintin Pope

Feb 27, 2024, 11:03 PM

95 points

188 comments14 min readLW link

Which animals realize which types of subjective welfare?

MichaelStJulesFeb 27, 2024, 7:31 PM

4 points

0 comments1 min readLW link

Biosecurity and AI: Risks and Opportunities

Steve NewmanFeb 27, 2024, 6:45 PM

11 points

1 comment7 min readLW link

(www.safe.ai)

The Gemini Incident Continues

ZviFeb 27, 2024, 4:00 PM

45 points

6 comments48 min readLW link

(thezvi.wordpress.com)

How I internalized my achievements to better deal with negative feelings

Raymond KoopmanschapFeb 27, 2024, 3:10 PM

42 points

7 comments6 min readLW link

On Frustration and Regret

silentbobFeb 27, 2024, 12:19 PM

8 points

0 comments4 min readLW link

Facts vs Interpretations—An Exercise in Cognitive Reframing

Declan MolonyFeb 27, 2024, 7:57 AM

15 points

0 comments3 min readLW link

San Francisco ACX Meetup “Third Saturday”

Nate Sternberg and guenael

Feb 27, 2024, 7:07 AM

7 points

0 comments1 min readLW link

Examining Language Model Performance with Reconstructed Activations using Sparse Autoencoders

Evan Anders and Joseph Bloom

Feb 27, 2024, 2:43 AM

42 points

16 comments15 min readLW link

Project idea: an iterated prisoner’s dilemma competition/game

Adam ZernerFeb 26, 2024, 11:06 PM

8 points

0 comments5 min readLW link

Acting Wholesomely

owencbFeb 26, 2024, 9:49 PM

58 points

64 comments1 min readLW link

Getting rational now or later: navigating procrastination and time-inconsistent preferences for new rationalists

milo_thoughtsFeb 26, 2024, 7:38 PM

1 point

0 comments8 min readLW link

[Question] Whom Do You Trust?

JackOfAllTradesFeb 26, 2024, 7:38 PM

1 point

0 comments1 min readLW link

Boundary Violations vs Boundary Dissolution

ChipmonkFeb 26, 2024, 6:59 PM

8 points

4 comments1 min readLW link

[Question] Can we get an AI to “do our alignment homework for us”?

Chris_LeongFeb 26, 2024, 7:56 AM

53 points

33 comments1 min readLW link

How I build and run behavioral interviews

benkuhnFeb 26, 2024, 5:50 AM

32 points

6 comments4 min readLW link

(www.benkuhn.net)

Hidden Cognition Detection Methods and Benchmarks

Paul CologneseFeb 26, 2024, 5:31 AM

22 points

11 comments4 min readLW link

Cellular respiration as a steam engine

dkl9Feb 25, 2024, 8:17 PM

24 points

1 comment1 min readLW link

(dkl9.net)

[Question] Rationalism and Dependent Origination?

BaometrusFeb 25, 2024, 6:16 PM

2 points

3 comments1 min readLW link

China-AI forecasts

NathanBarnardFeb 25, 2024, 4:49 PM

39 points

29 comments6 min readLW link

Ideological Bayesians

Kevin DorstFeb 25, 2024, 2:17 PM

95 points

4 comments10 min readLW link

(kevindorst.substack.com)

Deconfusing In-Context Learning

Arjun PanicksseryFeb 25, 2024, 9:48 AM

37 points

1 comment2 min readLW link

Everett branches, inter-light cone trade and other alien matters: Appendix to “An ECL explainer”

Chi Nguyen and _will_

Feb 24, 2024, 11:09 PM

17 points

0 comments1 min readLW link

Cooperating with aliens and AGIs: An ECL explainer

Chi Nguyen, _will_ and Akash

Feb 24, 2024, 10:58 PM

51 points

8 comments1 min readLW link

Choosing My Quest (Part 2 of “The Sense Of Physical Necessity”)

LoganStrohlFeb 24, 2024, 9:31 PM

40 points

7 comments12 min readLW link

Rationality Research Report: Towards 10x OODA Looping?

RaemonFeb 24, 2024, 9:06 PM

114 points

21 comments15 min readLW link

Let’s ask some of the largest LLMs for tips and ideas on how to take over the world

Super AGIFeb 24, 2024, 8:35 PM

1 point

0 comments7 min readLW link

Exercise: Planmaking, Surprise Anticipation, and “Baba is You”

RaemonFeb 24, 2024, 8:33 PM

50 points

19 comments6 min readLW link

In search of God.

Spiritus DeiFeb 24, 2024, 6:59 PM

−19 points

3 comments7 min readLW link

Impossibility of Anthropocentric-Alignment

False NameFeb 24, 2024, 6:31 PM

−8 points

2 comments39 min readLW link

The Inner Alignment Problem

Jakub HalmešFeb 24, 2024, 5:55 PM

1 point

1 comment3 min readLW link

(jakubhalmes.substack.com)

We Need Major, But Not Radical, FDA Reform

Maxwell TabarrokFeb 24, 2024, 4:54 PM

42 points

12 comments7 min readLW link

(www.maximum-progress.com)

After Overmorrow: Scattered Musings on the Immediate Post-AGI World

Yuli_BanFeb 24, 2024, 3:49 PM

−3 points

0 comments26 min readLW link

[Question] CDT vs. EDT on Deterrence

notfnofnFeb 24, 2024, 3:41 PM

1 point

9 comments1 min readLW link

Balancing Games

jefftkFeb 24, 2024, 2:40 PM

61 points

18 comments1 min readLW link

(www.jefftk.com)

How well do truth probes generalise?

mishajwFeb 24, 2024, 2:12 PM

87 points

11 comments9 min readLW link

Rawls’s Veil of Ignorance Doesn’t Make Any Sense

Arjun PanicksseryFeb 24, 2024, 1:18 PM

10 points

9 comments1 min readLW link

[Question] Can someone explain to me what went wrong with ChatGPT?

Valentin BaltadzhievFeb 24, 2024, 11:50 AM

9 points

1 comment1 min readLW link

The Sense Of Physical Necessity: A Naturalism Demo (Introduction)

LoganStrohlFeb 24, 2024, 2:56 AM

59 points

1 comment6 min readLW link

Instrumental deception and manipulation in LLMs—a case study

Olli JärviniemiFeb 24, 2024, 2:07 AM

39 points

13 comments12 min readLW link

A starting point for making sense of task structure (in machine learning)

Kaarel, RP and jake_mendel

Feb 24, 2024, 1:51 AM

45 points

2 comments12 min readLW link

Why you, personally, should want a larger human population

jasoncrawfordFeb 23, 2024, 7:48 PM

32 points

32 comments5 min readLW link

(rootsofprogress.org)

Deliberative Cognitive Algorithms as Scaffolding

Cole WyethFeb 23, 2024, 5:15 PM

19 points

4 comments3 min readLW link

The Shutdown Problem: Incomplete Preferences as a Solution

EJTFeb 23, 2024, 4:01 PM

52 points

28 comments42 min readLW link

In set theory, everything is a set

Jacob G-WFeb 23, 2024, 2:35 PM

11 points

9 comments2 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer