All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Scaling prediction markets with meta-markets

DentosalOct 10, 2024, 9:17 PM

1 point

0 comments2 min readLW link

Startup Success Rates Are So Low Because the Rewards Are So Large

AppliedDivinityStudiesOct 10, 2024, 8:22 PM

42 points

6 comments2 min readLW link

Can AI Outpredict Humans? Results From Metaculus’s Q3 AI Forecasting Benchmark

ChristianWilliamsOct 10, 2024, 6:58 PM

50 points

2 comments1 min readLW link

(www.metaculus.com)

Rationality Quotes—Fall 2024

ScrewtapeOct 10, 2024, 6:37 PM

79 points

26 comments1 min readLW link

[Question] why won’t this alignment plan work?

KvmanThinkingOct 10, 2024, 3:44 PM

8 points

7 comments1 min readLW link

AI #85: AI Wins the Nobel Prize

ZviOct 10, 2024, 1:40 PM

30 points

6 comments31 min readLW link

(thezvi.wordpress.com)

Behavioral red-teaming is unlikely to produce clear, strong evidence that models aren’t scheming

BuckOct 10, 2024, 1:36 PM

100 points

4 comments13 min readLW link

Joshua Achiam Public Statement Analysis

ZviOct 10, 2024, 12:50 PM

73 points

14 comments21 min readLW link

(thezvi.wordpress.com)

Do you want to do a debate on youtube? I’m looking for polite, truth-seeking participants.

Nathan YoungOct 10, 2024, 9:32 AM

12 points

0 comments1 min readLW link

Rationalist Gnosticism

tailcalledOct 10, 2024, 9:06 AM

9 points

10 comments3 min readLW link

The deepest atheist: Sam Altman

Trey EdwinOct 10, 2024, 3:27 AM

14 points

2 comments4 min readLW link

Values Are Real Like Harry Potter

johnswentworth and David Lorell

Oct 9, 2024, 11:42 PM

83 points

20 comments5 min readLW link

Momentum of Light in Glass

BenOct 9, 2024, 8:19 PM

143 points

44 comments11 min readLW link

vgillioz’s Shortform

vgilliozOct 9, 2024, 7:31 PM

1 point

2 comments1 min readLW link

Hamiltonian Dynamics in AI: A Novel Approach to Optimizing Reasoning in Language Models

Javier Marin ValenzuelaOct 9, 2024, 7:14 PM

3 points

0 comments10 min readLW link

Triangulating My Interpretation of Methods: Black Boxes by Marco J. Nathan

adamShimiOct 9, 2024, 7:13 PM

8 points

0 comments6 min readLW link

(formethods.substack.com)

Scaffolding for “Noticing Metacognition”

RaemonOct 9, 2024, 5:54 PM

80 points

4 comments17 min readLW link

Safe Predictive Agents with Joint Scoring Rules

Rubi J. HudsonOct 9, 2024, 4:38 PM

55 points

10 comments17 min readLW link

Demis Hassabis and Geoffrey Hinton Awarded Nobel Prizes

Anna GajdovaOct 9, 2024, 12:56 PM

48 points

14 comments1 min readLW link

Humans are (mostly) metarational

Yair HalberstadtOct 9, 2024, 5:51 AM

14 points

6 comments3 min readLW link

[Job Ad] MATS is hiring!

Jana, LauraVaughan, yams, Christian Smith and Ryan Kidd

Oct 9, 2024, 2:17 AM

10 points

0 comments5 min readLW link

Palisade is hiring: Exec Assistant, Content Lead, Ops Lead, and Policy Lead

Charlie Rogers-SmithOct 9, 2024, 12:04 AM

11 points

0 comments4 min readLW link

AGI & Consciousness—Joscha Bach

Rahul ChandOct 8, 2024, 10:51 PM

1 point

0 comments10 min readLW link

Video and transcript of presentation on Otherness and control in the age of AGI

Joe CarlsmithOct 8, 2024, 10:30 PM

35 points

1 comment27 min readLW link

From seeded complexity to consciousness—yes, it’s all the same.

eschatailOct 8, 2024, 9:31 PM

−23 points

0 comments2 min readLW link

Limits of safe and aligned AI

ShivamOct 8, 2024, 9:30 PM

2 points

0 comments4 min readLW link

[Question] What constitutes an infohazard?

K1r4d4rk.v1Oct 8, 2024, 9:29 PM

−4 points

8 comments1 min readLW link

[Question] What makes one a “rationalist”?

mathyoufOct 8, 2024, 8:25 PM

7 points

5 comments3 min readLW link

[Intuitive self-models] 4. Trance

Steven ByrnesOct 8, 2024, 1:30 PM

75 points

7 comments24 min readLW link

Schelling game evaluations for AI control

Olli JärviniemiOct 8, 2024, 12:01 PM

65 points

5 comments11 min readLW link

Thinking About a Pedalboard

jefftkOct 8, 2024, 11:50 AM

9 points

2 comments1 min readLW link

(www.jefftk.com)

Overview of strong human intelligence amplification methods

TsviBTOct 8, 2024, 8:37 AM

271 points

142 comments10 min readLW link

Near-death experiences

Declan MolonyOct 8, 2024, 6:34 AM

3 points

1 comment2 min readLW link

The unreasonable effectiveness of plasmid sequencing as a service

Abhishaike MahajanOct 8, 2024, 2:02 AM

23 points

2 comments13 min readLW link

(www.owlposting.com)

There is a globe in your LLM

jacob_droriOct 8, 2024, 12:43 AM

86 points

4 comments1 min readLW link

MATS AI Safety Strategy Curriculum v2

DanielFilan and Ryan Kidd

Oct 7, 2024, 10:44 PM

42 points

6 comments13 min readLW link

2025 Color Trends

sarahconstantinOct 7, 2024, 9:20 PM

40 points

7 comments6 min readLW link

(sarahconstantin.substack.com)

Clarifying Alignment Fundamentals Through the Lens of Ontology

eternal/ephemeraOct 7, 2024, 8:57 PM

12 points

4 comments24 min readLW link

Ethics on Cosmic Scale, Outer Space Treaty, Directed Panspermia, Forwards-Contamination, Technology Assessment, Planetary Protection, and Fermi’s Paradox

MrFantasticOct 7, 2024, 8:56 PM

−12 points

0 comments1 min readLW link

Domain-specific SAEs

jacob_droriOct 7, 2024, 8:15 PM

27 points

0 comments5 min readLW link

Metaculus Is Open Source

ChristianWilliamsOct 7, 2024, 7:55 PM

13 points

0 comments1 min readLW link

(www.metaculus.com)

Research update: Towards a Law of Iterated Expectations for Heuristic Estimators

Eric NeymanOct 7, 2024, 7:29 PM

87 points

2 comments22 min readLW link

AI Model Registries: A Foundational Tool for AI Governance

Elliot Mckernon, Deric Cheng and Gwyn Glasser

Oct 7, 2024, 7:27 PM

20 points

1 comment4 min readLW link

(www.convergenceanalysis.org)

Evaluating the truth of statements in a world of ambiguous language.

HastingsOct 7, 2024, 6:08 PM

48 points

19 comments2 min readLW link

Advice for journalists

Nathan YoungOct 7, 2024, 4:46 PM

100 points

53 comments9 min readLW link

(nathanpmyoung.substack.com)

Time Efficient Resistance Training

romeostevensitOct 7, 2024, 3:15 PM

42 points

10 comments3 min readLW link

A Narrow Path: a plan to deal with AI extinction risk

Andrea_Miotti, davekasten and Tolga

Oct 7, 2024, 1:02 PM

73 points

12 comments2 min readLW link

(www.narrowpath.co)

Toy Models of Feature Absorption in SAEs

chanind, hrdkbhatnagar, TomasD and Joseph Bloom

Oct 7, 2024, 9:56 AM

49 points

8 comments10 min readLW link

An argument that consequentialism is incomplete

cousin_itOct 7, 2024, 9:45 AM

32 points

27 comments1 min readLW link

An X-Ray is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation

hugofry, Ahmed Abdulaal, NMontanaBrown and a-ijishakin

Oct 7, 2024, 8:53 AM

38 points

0 comments5 min readLW link

(arxiv.org)