All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

D&D.Sci: Whom Shall You Call?

abstractapplic5 Jul 2024 20:53 UTC

38 points

6 comments2 min readLW link

[Interim research report] Activation plateaus & sensitive directions in GPT2

StefanHex and jake_mendel

5 Jul 2024 17:05 UTC

65 points

2 comments5 min readLW link

Minimalist And Maximalist Type Systems

adamShimi5 Jul 2024 16:25 UTC

17 points

6 comments3 min readLW link

(epistemologicalfascinations.substack.com)

ML4Good Summer Bootcamps—Applications Open [deadline extended]

YM5 Jul 2024 13:59 UTC

12 points

0 comments1 min readLW link

[Question] Are there any plans to launch a paperback version of “Rationality: From AI to Zombies”?

m_arj5 Jul 2024 11:14 UTC

2 points

1 comment1 min readLW link

Doomsday Argument and the False Dilemma of Anthropic Reasoning

Ape in the coat5 Jul 2024 5:38 UTC

37 points

55 comments7 min readLW link

Finding the Wisdom to Build Safe AI

Gordon Seidoh Worley4 Jul 2024 19:04 UTC

36 points

10 comments9 min readLW link

Libs vs Frameworks, Middle-Level Regularities vs Theories

adamShimi4 Jul 2024 19:01 UTC

23 points

0 comments2 min readLW link

(epistemologicalfascinations.substack.com)

The Potential Impossibility of Subjective Death

VictorLJZ4 Jul 2024 18:17 UTC

2 points

34 comments1 min readLW link

Consider the humble rock (or: why the dumb thing kills you)

pleiotroth4 Jul 2024 13:54 UTC

62 points

11 comments4 min readLW link

AI #71: Farewell to Chevron

Zvi4 Jul 2024 13:40 UTC

53 points

9 comments36 min readLW link

(thezvi.wordpress.com)

The Dumbification of our smart screens

Itay Dreyfus4 Jul 2024 6:32 UTC

18 points

0 comments5 min readLW link

(productidentity.co)

Introduction to French AI Policy

Lucie Philippon4 Jul 2024 3:39 UTC

110 points

12 comments6 min readLW link

How predictive processing solved my wrist pain

max_shen4 Jul 2024 1:56 UTC

35 points

8 comments8 min readLW link

80,000 hours should remove OpenAI from the Job Board (and similar EA orgs should do similarly)

Raemon3 Jul 2024 20:34 UTC

274 points

71 comments1 min readLW link

Notes on Tuning Metacognition

JoNeedsSleep3 Jul 2024 19:54 UTC

8 points

0 comments5 min readLW link

When Are Results from Computational Complexity Not Too Coarse?

Dalcy3 Jul 2024 19:06 UTC

41 points

8 comments3 min readLW link

Musings on LLM Scale (Jul 2024)

Vladimir_Nesov3 Jul 2024 18:35 UTC

34 points

0 comments3 min readLW link

Static Analysis As A Lifestyle

adamShimi3 Jul 2024 18:29 UTC

65 points

11 comments3 min readLW link

(epistemologicalfascinations.substack.com)

AI development is an act of social revolution

artemiocobb3 Jul 2024 18:00 UTC

3 points

0 comments3 min readLW link

[Question] What percent of the sun would a Dyson Sphere cover?

Raemon3 Jul 2024 17:27 UTC

24 points

26 comments1 min readLW link

[Question] Isomorphisms don’t preserve subjective experience… right?

notfnofn3 Jul 2024 14:22 UTC

5 points

26 comments1 min readLW link

3C’s: A Recipe For Mathing Concepts

johnswentworth and David Lorell

3 Jul 2024 1:06 UTC

81 points

5 comments7 min readLW link

Announcing the AI Forecasting Benchmark Series | July 8, $120k in Prizes

ChristianWilliams2 Jul 2024 22:33 UTC

15 points

0 comments1 min readLW link

(www.metaculus.com)

Open Sourcing Metaculus

ChristianWilliams2 Jul 2024 22:30 UTC

44 points

0 comments1 min readLW link

(www.metaculus.com)

[Question] Why Can’t Sub-AGI Solve AI Alignment? Or: Why Would Sub-AGI AI Not be Aligned?

MrThink2 Jul 2024 20:13 UTC

4 points

23 comments1 min readLW link

[Question] Why haven’t there been assassination attempts against high profile AI accelerationists like sam altman yet?

louisTrem2 Jul 2024 18:16 UTC

−13 points

4 comments2 min readLW link

How ARENA course material gets made

CallumMcDougall2 Jul 2024 18:04 UTC

41 points

2 comments7 min readLW link

An AI Race With China Can Be Better Than Not Racing

niplav2 Jul 2024 17:57 UTC

69 points

33 comments11 min readLW link

List of Collective Intelligence Projects

Chipmonk2 Jul 2024 14:10 UTC

40 points

9 comments2 min readLW link

(chrislakin.blog)

Decomposing the QK circuit with Bilinear Sparse Dictionary Learning

keith_wynroe and Lee Sharkey

2 Jul 2024 13:17 UTC

86 points

7 comments12 min readLW link

Economics Roundup #2

Zvi2 Jul 2024 12:40 UTC

35 points

5 comments23 min readLW link

(thezvi.wordpress.com)

How Congressional Offices Process Constituent Communication

Tristan Williams2 Jul 2024 12:38 UTC

24 points

0 comments1 min readLW link

OthelloGPT learned a bag of heuristics

jylin04, JackS, Adam Karvonen and Can

2 Jul 2024 9:12 UTC

109 points

10 comments9 min readLW link

Blueprint for a Brighter Future

Alex Beyman2 Jul 2024 6:15 UTC

−1 points

0 comments5 min readLW link

Covert Malicious Finetuning

Tony Wang and dannyhalawi

2 Jul 2024 2:41 UTC

89 points

4 comments3 min readLW link

Interpreting Preference Models w/ Sparse Autoencoders

Logan Riggs and Jannik Brinkmann

1 Jul 2024 21:35 UTC

74 points

12 comments9 min readLW link

Honest science is spirituality

pchvykov1 Jul 2024 20:33 UTC

−1 points

10 comments4 min readLW link

New Executive Team & Board — PIBBSS

Nora_Ammann1 Jul 2024 19:30 UTC

43 points

1 comment1 min readLW link

Uncursing Civilization

Lorec1 Jul 2024 18:44 UTC

−6 points

2 comments5 min readLW link

[Question] Self-censoring on AI x-risk discussions?

Decaeneus1 Jul 2024 18:24 UTC

17 points

2 comments1 min readLW link

Rationalists As People Who Build Piles Of Rocks

Sable1 Jul 2024 10:32 UTC

9 points

0 comments5 min readLW link

(affablyevil.substack.com)

How good are LLMs at doing ML on an unknown dataset?

Håvard Tveit Ihle1 Jul 2024 9:04 UTC

33 points

4 comments13 min readLW link

Whirlwind Tour of Chain of Thought Literature Relevant to Automating Alignment Research.

sevdeawesome1 Jul 2024 5:50 UTC

25 points

0 comments17 min readLW link

Probabilistic Logic ⇔ Oracles?

Yudhister Kumar1 Jul 2024 5:36 UTC

15 points

0 comments4 min readLW link

Important open problems in voting

Closed Limelike Curves1 Jul 2024 2:53 UTC

33 points

1 comment1 min readLW link

Anti-Circumcision Essay 3 of 3: Now That I Think About It, Is There Actually a Space Between “Info” and “Hazard”? Isn’t It Just One Word?

Harry Stevenage1 Jul 2024 2:21 UTC

12 points

0 comments7 min readLW link

In Defense of Lawyers Playing Their Part

Isaac King1 Jul 2024 1:32 UTC

32 points

9 comments9 min readLW link

Anti-circumcision Essay 2 of 3: Physical and Psychological Realities

Harry Stevenage30 Jun 2024 22:13 UTC

12 points

5 comments9 min readLW link

Review of METR’s public evaluation protocol

nahoj and JaimeRV

30 Jun 2024 22:03 UTC

10 points

0 comments5 min readLW link