All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 121314 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Binary encoding as a simple explicit construction for superposition

tailcalled12 Oct 2024 21:18 UTC

12 points

0 comments1 min readLW link

[Question] How Should We Use Limited Time to Maximize Long-Term Impact?

queelius12 Oct 2024 20:02 UTC

10 points

3 comments1 min readLW link

A Percentage Model of a Person

Sable12 Oct 2024 17:55 UTC

37 points

3 comments9 min readLW link

(affablyevil.substack.com)

AI Compute governance: Verifying AI chip location

Farhan12 Oct 2024 17:36 UTC

5 points

0 comments6 min readLW link

Geoffrey Hinton on the Past, Present, and Future of AI

Stephen McAleese12 Oct 2024 16:41 UTC

22 points

5 comments18 min readLW link

[Question] I = W/T?

HNX12 Oct 2024 15:15 UTC

0 points

3 comments1 min readLW link

AI research assistants competition 2024Q3: Tie between Elicit and You.com

Elizabeth12 Oct 2024 15:10 UTC

64 points

4 comments3 min readLW link

(acesounderglass.com)

SAE features for refusal and sycophancy steering vectors

neverix, Dmitrii Kharlapenko, Arthur Conmy and Neel Nanda

12 Oct 2024 14:54 UTC

26 points

4 comments7 min readLW link

Prices are Bounties

Maxwell Tabarrok12 Oct 2024 14:51 UTC

51 points

13 comments2 min readLW link

(www.maximum-progress.com)

Differential knowledge interconnection

Roman Leventov12 Oct 2024 12:52 UTC

5 points

0 comments7 min readLW link

Most arguments for AI Doom are either bad or weak

Logan Zoellner12 Oct 2024 11:57 UTC

2 points

97 comments3 min readLW link

Kassel ACX/LW Meetup

Fernand012 Oct 2024 7:47 UTC

2 points

0 comments1 min readLW link

Neural Network And Newton’s Second Law

Max Ma12 Oct 2024 6:25 UTC

−10 points

0 comments1 min readLW link

[Question] If the DoJ goes through with the Google breakup,where does Deepmind end up?

O O12 Oct 2024 5:06 UTC

5 points

1 comment1 min readLW link

My motivation and theory of change for working in AI healthtech

Andrew_Critch12 Oct 2024 0:36 UTC

169 points

37 comments14 min readLW link

HDBSCAN is Surprisingly Effective at Finding Interpretable Clusters of the SAE Decoder Matrix

Jaehyuk Lim, Kanishk Tantia and Sinem

11 Oct 2024 23:06 UTC

8 points

2 comments10 min readLW link

Changing the Mind of an LLM

testingthewaters11 Oct 2024 22:25 UTC

2 points

0 comments5 min readLW link

EIS XIV: Is mechanistic interpretability about to be practically useful?

scasper11 Oct 2024 22:13 UTC

68 points

4 comments7 min readLW link

Dario Amodei — Machines of Loving Grace

Matrice Jacobine11 Oct 2024 21:43 UTC

62 points

26 comments1 min readLW link

(darioamodei.com)

“Deep Galactic Chillout” a space to relax during SF tech week & meet wholesome, fun people

Jared Phillip Mantell11 Oct 2024 19:50 UTC

1 point

0 comments1 min readLW link

Open letter to young EAs

Leif Wenar11 Oct 2024 19:49 UTC

9 points

10 comments1 min readLW link

The Great Bootstrap

KristianRonn11 Oct 2024 19:46 UTC

11 points

0 comments15 min readLW link

Embracing complexity when developing and evaluating AI responsibly

Aliya Amirova11 Oct 2024 17:46 UTC

2 points

9 comments9 min readLW link

How much I’m paying for AI productivity software (and the future of AI use)

jacquesthibs11 Oct 2024 17:11 UTC

57 points

16 comments8 min readLW link

(jacquesthibodeau.com)

AI: The Philosopher’s Stone of the 21st Century

HNX11 Oct 2024 16:55 UTC

0 points

2 comments29 min readLW link

[Question] Who created the Less Wrong Gather Town?

Arepo11 Oct 2024 8:53 UTC

2 points

1 comment1 min readLW link

A Heuristic Proof of Practical Aligned Superintelligence

Roko11 Oct 2024 5:05 UTC

7 points

6 comments1 min readLW link

(transhumanaxiology.substack.com)

An AI crash is our best bet for restricting AI

Remmelt11 Oct 2024 2:12 UTC

27 points

3 comments1 min readLW link

A Triple Decker for Elfland

jefftk11 Oct 2024 1:50 UTC

25 points

0 comments1 min readLW link

(www.jefftk.com)

OODA your OODA Loop

Raemon11 Oct 2024 0:50 UTC

37 points

3 comments3 min readLW link

Scaling prediction markets with meta-markets

Dentosal10 Oct 2024 21:17 UTC

1 point

0 comments2 min readLW link

Startup Success Rates Are So Low Because the Rewards Are So Large

AppliedDivinityStudies10 Oct 2024 20:22 UTC

42 points

6 comments2 min readLW link

Can AI Outpredict Humans? Results From Metaculus’s Q3 AI Forecasting Benchmark

ChristianWilliams10 Oct 2024 18:58 UTC

50 points

2 comments1 min readLW link

(www.metaculus.com)

Rationality Quotes—Fall 2024

Screwtape10 Oct 2024 18:37 UTC

78 points

26 comments1 min readLW link

[Question] why won’t this alignment plan work?

KvmanThinking10 Oct 2024 15:44 UTC

6 points

7 comments1 min readLW link

AI #85: AI Wins the Nobel Prize

Zvi10 Oct 2024 13:40 UTC

30 points

6 comments31 min readLW link

(thezvi.wordpress.com)

Behavioral red-teaming is unlikely to produce clear, strong evidence that models aren’t scheming

Buck10 Oct 2024 13:36 UTC

100 points

4 comments13 min readLW link

Joshua Achiam Public Statement Analysis

Zvi10 Oct 2024 12:50 UTC

73 points

14 comments21 min readLW link

(thezvi.wordpress.com)

Do you want to do a debate on youtube? I’m looking for polite, truth-seeking participants.

Nathan Young10 Oct 2024 9:32 UTC

12 points

0 comments1 min readLW link

Rationalist Gnosticism

tailcalled10 Oct 2024 9:06 UTC

9 points

10 comments3 min readLW link

The deepest atheist: Sam Altman

Trey Edwin10 Oct 2024 3:27 UTC

14 points

2 comments4 min readLW link

Values Are Real Like Harry Potter

johnswentworth and David Lorell

9 Oct 2024 23:42 UTC

81 points

17 comments5 min readLW link

Momentum of Light in Glass

Ben9 Oct 2024 20:19 UTC

144 points

44 comments11 min readLW link

vgillioz’s Shortform

vgillioz9 Oct 2024 19:31 UTC

1 point

2 comments1 min readLW link

Hamiltonian Dynamics in AI: A Novel Approach to Optimizing Reasoning in Language Models

Javier Marin Valenzuela9 Oct 2024 19:14 UTC

3 points

0 comments10 min readLW link

Triangulating My Interpretation of Methods: Black Boxes by Marco J. Nathan

adamShimi9 Oct 2024 19:13 UTC

8 points

0 comments6 min readLW link

(formethods.substack.com)

Scaffolding for “Noticing Metacognition”

Raemon9 Oct 2024 17:54 UTC

80 points

4 comments17 min readLW link

Safe Predictive Agents with Joint Scoring Rules

Rubi J. Hudson9 Oct 2024 16:38 UTC

55 points

10 comments17 min readLW link

Demis Hassabis and Geoffrey Hinton Awarded Nobel Prizes

Anna Gajdova9 Oct 2024 12:56 UTC

48 points

14 comments1 min readLW link

Humans are (mostly) metarational

Yair Halberstadt9 Oct 2024 5:51 UTC

14 points

6 comments3 min readLW link