All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21

A Qualitative Case for LTFF: Filling Critical Ecosystem Gaps

Linch3 Dec 2024 21:57 UTC

64 points

2 comments1 min readLW link

Deep Causal Transcoding: A Framework for Mechanistically Eliciting Latent Behaviors in Language Models

Andrew Mack and TurnTrout

3 Dec 2024 21:19 UTC

83 points

7 comments41 min readLW link

“Alignment at Large”: Bending the Arc of History Towards Life-Affirming Futures

welfvh3 Dec 2024 21:17 UTC

5 points

0 comments4 min readLW link

Roots of Progress is hiring an event manager

jasoncrawford3 Dec 2024 20:46 UTC

10 points

0 comments7 min readLW link

(rootsofprogress.notion.site)

Do simulacra dream of digital sheep?

EuanMcLean3 Dec 2024 20:25 UTC

16 points

36 comments10 min readLW link

Orca communication project—seeking feedback (and collaborators)

Towards_Keeperhood3 Dec 2024 17:29 UTC

31 points

16 comments2 min readLW link

Book a Time to Chat about Interp Research

Logan Riggs3 Dec 2024 17:27 UTC

47 points

3 comments1 min readLW link

Balsa Research 2024 Update

Zvi3 Dec 2024 12:30 UTC

19 points

0 comments5 min readLW link

(thezvi.wordpress.com)

First Solo Bus Ride

jefftk3 Dec 2024 12:20 UTC

28 points

1 comment1 min readLW link

(www.jefftk.com)

How to make evals for the AISI evals bounty

TheManxLoiner3 Dec 2024 10:44 UTC

8 points

0 comments5 min readLW link

Should there be just one western AGI project?

rosehadshar and Tom Davidson

3 Dec 2024 10:11 UTC

78 points

72 comments15 min readLW link

Cognitive Biases Contributing to AI X-risk — a deleted excerpt from my 2018 ARCHES draft

Andrew_Critch3 Dec 2024 9:29 UTC

46 points

2 comments5 min readLW link

[Question] What is your opinion of Dr. Angelo Dilullo(meditation)?

Suh_Prance_Alot3 Dec 2024 5:54 UTC

0 points

0 comments1 min readLW link

Chemical Turing Machines

Yudhister Kumar3 Dec 2024 5:26 UTC

10 points

2 comments4 min readLW link

(www.yudhister.me)

MIRI’s 2024 End-of-Year Update

Rob Bensinger3 Dec 2024 4:33 UTC

98 points

2 comments4 min readLW link

Linkpost: Rat Traps by Sheon Han in Asterisk Mag

Chris_Leong3 Dec 2024 3:22 UTC

12 points

5 comments1 min readLW link

(asteriskmag.com)

[Question] Who are the worthwhile non-European pre-Industrial thinkers?

Lorec3 Dec 2024 1:45 UTC

12 points

4 comments1 min readLW link

A Paradox of Simulated Suffering

arusarda2 Dec 2024 23:44 UTC

−1 points

3 comments1 min readLW link

Levels of Thought: from Points to Fields

HNX2 Dec 2024 20:25 UTC

4 points

2 comments23 min readLW link

From Code to Managing: Why Being a ‘Force Multiplier’ Matters to Me More Than Being a Coding Wizard

cloak2 Dec 2024 20:10 UTC

−3 points

0 comments1 min readLW link

(www.reddit.com)

A case for donating to AI risk reduction (including if you work in AI)

tlevin2 Dec 2024 19:05 UTC

61 points

2 comments1 min readLW link

Fertility Roundup #4

Zvi2 Dec 2024 14:30 UTC

35 points

16 comments49 min readLW link

(thezvi.wordpress.com)

Conjecture: A Roadmap for Cognitive Software and A Humanist Future of AI

Connor Leahy and Gabriel Alfour

2 Dec 2024 13:28 UTC

43 points

9 comments29 min readLW link

(www.conjecture.dev)

2024 Unofficial LessWrong Census/Survey

Screwtape2 Dec 2024 5:30 UTC

91 points

42 comments1 min readLW link

Drexler’s Nanotech Software

PeterMcCluskey2 Dec 2024 4:55 UTC

65 points

9 comments4 min readLW link

(bayesianinvestor.com)

Sorry for the downtime, looks like we got DDosd

habryka2 Dec 2024 4:14 UTC

109 points

13 comments1 min readLW link

[Question] Is malice a real emotion?

landscape_kiwi1 Dec 2024 23:47 UTC

7 points

5 comments1 min readLW link

Teaching My Younger Self to Program: A case study of how I’d pass on my skill at self-learning

Shoshannah Tekofsky1 Dec 2024 21:05 UTC

25 points

1 comment7 min readLW link

(thinkfeelplay.substack.com)

[Question] Which Biases are most important to Overcome?

abstractapplic1 Dec 2024 15:40 UTC

35 points

24 comments1 min readLW link

Commenting Patterns by Platform

jefftk1 Dec 2024 11:50 UTC

12 points

0 comments1 min readLW link

(www.jefftk.com)

[Letter] Chinese Quickstart

lsusr1 Dec 2024 6:38 UTC

31 points

0 comments5 min readLW link

AXRP Episode 39 - Evan Hubinger on Model Organisms of Misalignment

DanielFilan1 Dec 2024 6:00 UTC

41 points

0 comments67 min readLW link

Magnitudes: Let’s Comprehend the Incomprehensible!

joec1 Dec 2024 3:08 UTC

21 points

8 comments3 min readLW link

[Question] Why does ChatGPT throw an error when outputting “David Mayer”?

Archimedes1 Dec 2024 0:11 UTC

6 points

9 comments1 min readLW link

Introducing the Anthropic Fellows Program

Miranda Zhang and Ethan Perez

30 Nov 2024 23:47 UTC

26 points

0 comments4 min readLW link

(alignment.anthropic.com)

The Shape of Heaven

ejk6430 Nov 2024 23:38 UTC

15 points

1 comment5 min readLW link

AI Training Opt-Outs Reinforce Global Power Asymmetries

kushagra30 Nov 2024 22:08 UTC

3 points

0 comments6 min readLW link

Visual demonstration of Optimizer’s curse

Roman Malov30 Nov 2024 19:34 UTC

24 points

3 comments7 min readLW link

CAIDP Statement on Lethal Autonomous Weapons Systems

Heramb30 Nov 2024 18:16 UTC

−1 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Launching Applications for the Global AI Safety Fellowship 2025!

Aditya_SK30 Nov 2024 14:02 UTC

11 points

4 comments1 min readLW link

Exporting Facebook Comments, Again

jefftk30 Nov 2024 12:40 UTC

10 points

6 comments1 min readLW link

(www.jefftk.com)

Mathematical Futurology: From Pseudoscience to Rigorous Framework

Wenitte Apiou30 Nov 2024 3:27 UTC

−1 points

1 comment2 min readLW link

(The) Lightcone is nothing without its people: LW + Lighthaven’s big fundraiser

habryka30 Nov 2024 2:55 UTC

570 points

194 comments41 min readLW link

Sexual Selection as a Mesa-Optimizer

Lorec29 Nov 2024 23:34 UTC

3 points

0 comments37 min readLW link

INTELLECT-1 Release: The First Globally Trained 10B Parameter Model

Matrice Jacobine29 Nov 2024 23:05 UTC

16 points

1 comment1 min readLW link

(www.primeintellect.ai)

How to bet on AI, without helping AGI?

Nicholas / Heather Kross29 Nov 2024 22:46 UTC

24 points

0 comments1 min readLW link

You should consider applying to PhDs (soon!)

bilalchughtai29 Nov 2024 20:33 UTC

112 points

19 comments6 min readLW link

Understanding Emergence in Large Language Models

egek9229 Nov 2024 19:42 UTC

3 points

1 comment2 min readLW link

I’m a rationalist but....

ninney29 Nov 2024 19:41 UTC

−19 points

0 comments1 min readLW link

The ‘Road Not Taken’ in the Multiverse

Jonah Wilberg29 Nov 2024 19:01 UTC

2 points

0 comments7 min readLW link