All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

How Your Physiology Affects the Mind’s Projection Fallacy

YanLyutnev14 Dec 2024 21:10 UTC

0 points

0 comments6 min readLW link

Introducing the Evidence Color Wheel

Larry Lee14 Dec 2024 16:08 UTC

6 points

0 comments3 min readLW link

An Illustrated Summary of “Robust Agents Learn Causal World Model”

Dalcy14 Dec 2024 15:02 UTC

63 points

2 comments10 min readLW link

Best-of-N Jailbreaking

John Hughes, saraprice, Aengus Lynch, Rylan Schaeffer, Fazl, Henry Sleight, Ethan Perez and mrinank_sharma

14 Dec 2024 4:58 UTC

78 points

5 comments2 min readLW link

(arxiv.org)

D&D.Sci Dungeonbuilding: the Dungeon Tournament

aphyer14 Dec 2024 4:30 UTC

49 points

16 comments3 min readLW link

Creating Interpretable Latent Spaces with Gradient Routing

Jacob G-W14 Dec 2024 4:00 UTC

26 points

6 comments2 min readLW link

(jacobgw.com)

Probability of death by suicide by a 26 year old

John Wiseman14 Dec 2024 3:33 UTC

−25 points

4 comments1 min readLW link

Matryoshka Sparse Autoencoders

Noa Nabeshima14 Dec 2024 2:52 UTC

91 points

15 comments11 min readLW link

[Question] What is MIRI currently doing?

Roko14 Dec 2024 2:39 UTC

32 points

14 comments1 min readLW link

The o1 System Card Is Not About o1

Zvi13 Dec 2024 20:30 UTC

116 points

5 comments16 min readLW link

(thezvi.wordpress.com)

Arch-anarchy and The Fable of the Dragon-Tyrant

Peter lawless 13 Dec 2024 20:15 UTC

−10 points

0 comments1 min readLW link

Communications in Hard Mode (My new job at MIRI)

tanagrabeast13 Dec 2024 20:13 UTC

202 points

25 comments5 min readLW link

First Thoughts on Detachmentism

Jacob Peterson13 Dec 2024 1:19 UTC

−11 points

5 comments9 min readLW link

How to Build Heaven: A Constrained Boltzmann Brain Generator

High Tides13 Dec 2024 1:04 UTC

−8 points

3 comments5 min readLW link

Representing Irrationality in Game Theory

Larry Lee13 Dec 2024 0:50 UTC

−1 points

3 comments11 min readLW link

“Charity” as a conflationary alliance term

Jan_Kulveit12 Dec 2024 21:49 UTC

34 points

2 comments5 min readLW link

Just one more exposure bro

Chipmonk12 Dec 2024 21:37 UTC

51 points

6 comments2 min readLW link

(chrislakin.blog)

The Dangers of Mirrored Life

Niko_McCarty and fin

12 Dec 2024 20:58 UTC

119 points

7 comments29 min readLW link

(www.asimov.press)

Effective Networking as Sending Hard to Fake Signals

vaishnav9212 Dec 2024 20:32 UTC

25 points

2 comments7 min readLW link

(www.optimaloutliers.com)

Mini PAPR Review

jefftk12 Dec 2024 19:10 UTC

10 points

0 comments2 min readLW link

(www.jefftk.com)

Biological risk from the mirror world

jasoncrawford12 Dec 2024 19:07 UTC

333 points

37 comments7 min readLW link

(newsletter.rootsofprogress.org)

Naturalistic dualism

Arturo Macias12 Dec 2024 16:19 UTC

−4 points

0 comments4 min readLW link

AI #94: Not Now, Google

Zvi12 Dec 2024 15:40 UTC

49 points

3 comments64 min readLW link

(thezvi.wordpress.com)

Consciousness, Intelligence, and AI – Some Quick Notes [call it a mini-ramble]

Bill Benzon12 Dec 2024 15:04 UTC

−3 points

0 comments4 min readLW link

The Dissolution of AI Safety

Roko12 Dec 2024 10:34 UTC

8 points

44 comments1 min readLW link

(www.transhumanaxiology.com)

Is Optimization Correct?

Yoshinori Okamoto12 Dec 2024 10:27 UTC

−9 points

0 comments2 min readLW link

AXRP Episode 38.3 - Erik Jenner on Learned Look-Ahead

DanielFilan12 Dec 2024 5:40 UTC

20 points

0 comments16 min readLW link

Public computers can make addictive tools safe

dkl911 Dec 2024 19:55 UTC

23 points

0 comments1 min readLW link

(dkl9.net)

Solving Newcomb’s Paradox In Real Life

Alice Wanderland11 Dec 2024 19:48 UTC

3 points

0 comments1 min readLW link

(open.substack.com)

The “Think It Faster” Exercise

Raemon11 Dec 2024 19:14 UTC

142 points

35 comments13 min readLW link

Forecast With GiveWell

ChristianWilliams11 Dec 2024 17:52 UTC

11 points

0 comments1 min readLW link

(www.metaculus.com)

A shortcoming of concrete demonstrations as AGI risk advocacy

Steven Byrnes11 Dec 2024 16:48 UTC

103 points

27 comments2 min readLW link

Why Isn’t Tesla Level 3?

jefftk11 Dec 2024 14:50 UTC

22 points

7 comments2 min readLW link

(www.jefftk.com)

Investing in Robust Safety Mechanisms is critical for reducing Systemic Risks

Tom DAVID, Pierre Peigné, Quentin FEUILLADE--MONTIXI, Kay Kozaronek and Miailhe Nicolas

11 Dec 2024 13:37 UTC

4 points

3 comments2 min readLW link

Post-Quantum Investing: Dump Crypto for Index Funds and Real Estate?

G11 Dec 2024 11:59 UTC

8 points

5 comments1 min readLW link

Low-effort review of “AI For Humanity”

Charlie Steiner11 Dec 2024 9:54 UTC

13 points

0 comments4 min readLW link

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders

Can, Adam Karvonen, Johnny Lin, Curt Tigges, Joseph Bloom, chanind, Yeu-Tong Lau, Eoin Farrell, Arthur Conmy, CallumMcDougall, Kola Ayonrinde, Matthew Wearden, Sam Marks and Neel Nanda

11 Dec 2024 6:30 UTC

82 points

6 comments2 min readLW link

(www.neuronpedia.org)

Zombies! Substance Dualist Zombies?

Ape in the coat11 Dec 2024 6:10 UTC

15 points

7 comments6 min readLW link

My thoughts on correlation and causation

Victor Porton11 Dec 2024 5:08 UTC

−13 points

3 comments1 min readLW link

Why empiricists should believe in AI risk

Knight Lee11 Dec 2024 3:51 UTC

5 points

0 comments1 min readLW link

[Question] fake alignment solutions????

KvmanThinking11 Dec 2024 3:31 UTC

1 point

6 comments1 min readLW link

Second-Time Free

jefftk11 Dec 2024 3:30 UTC

24 points

4 comments1 min readLW link

(www.jefftk.com)

Frontier AI systems have surpassed the self-replicating red line

aproteinengine11 Dec 2024 3:06 UTC

9 points

4 comments1 min readLW link

(github.com)

The Technist Reformation: A Discussion with o1 About The Coming Economic Event Horizon

Yuli_Ban11 Dec 2024 2:34 UTC

5 points

2 comments17 min readLW link

LessWrong audio: help us choose the new voice

PeterH and TYPE III AUDIO

11 Dec 2024 2:24 UTC

23 points

1 comment1 min readLW link

Apply to attend a Global Challenges Project workshop in 2025!

LiamE11 Dec 2024 0:41 UTC

6 points

0 comments2 min readLW link

(forum.effectivealtruism.org)

The MVO and The MVP

kwang10 Dec 2024 23:17 UTC

0 points

0 comments7 min readLW link

(kevw.substack.com)

What is Confidence—in Game Theory and Life?

James Stephen Brown10 Dec 2024 23:06 UTC

3 points

0 comments8 min readLW link

(nonzerosum.games)

Computational functionalism probably can’t explain phenomenal consciousness

EuanMcLean10 Dec 2024 17:11 UTC

17 points

36 comments12 min readLW link

o1 Turns Pro

Zvi10 Dec 2024 17:00 UTC

59 points

3 comments14 min readLW link

(thezvi.wordpress.com)