All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 232425 26 27 28 29 30 31

Panology

JenniferRM23 Dec 2024 21:40 UTC

11 points

8 comments5 min readLW link

Aristotle, Aquinas, and the Evolution of Teleology: From Purpose to Meaning.

Spiritus Dei23 Dec 2024 19:37 UTC

−7 points

0 comments6 min readLW link

People aren’t properly calibrated on FrontierMath

cakubilo23 Dec 2024 19:35 UTC

30 points

4 comments3 min readLW link

Near- and medium-term AI Control Safety Cases

Martín Soto23 Dec 2024 17:37 UTC

9 points

0 comments6 min readLW link

[Rationality Malaysia] 2024 year-end meetup!

Doris Liew23 Dec 2024 16:02 UTC

1 point

0 comments1 min readLW link

Printable book of some rationalist creative writing (from Scott A. & Eliezer)

CounterBlunder23 Dec 2024 15:44 UTC

5 points

0 comments1 min readLW link

Monthly Roundup #25: December 2024

Zvi23 Dec 2024 14:20 UTC

18 points

3 comments26 min readLW link

(thezvi.wordpress.com)

Exploring the petertodd / Leilan duality in GPT-2 and GPT-J

mwatkins23 Dec 2024 13:17 UTC

10 points

0 comments17 min readLW link

[Question] What are the strongest arguments for very short timelines?

Kaj_Sotala23 Dec 2024 9:38 UTC

94 points

73 comments1 min readLW link

Reduce AI Self-Allegiance by saying “he” instead of “I”

Knight Lee23 Dec 2024 9:32 UTC

6 points

4 comments2 min readLW link

Funding Case: AI Safety Camp 11

Remmelt, Robert Kralisch and Linda Linsefors

23 Dec 2024 8:51 UTC

23 points

0 comments6 min readLW link

(manifund.org)

What is compute governance?

Vishakha23 Dec 2024 6:32 UTC

6 points

0 comments2 min readLW link

(aisafety.info)

Stop Making Sense

JenniferRM23 Dec 2024 5:16 UTC

15 points

0 comments3 min readLW link

Hire (or Become) a Thinking Assistant

Raemon23 Dec 2024 3:58 UTC

119 points

42 comments8 min readLW link

Non-Obvious Benefits of Insurance

jefftk23 Dec 2024 3:40 UTC

21 points

5 comments2 min readLW link

(www.jefftk.com)

Vision of a positive Singularity

RussellThor23 Dec 2024 2:19 UTC

4 points

0 comments4 min readLW link

Ideologies are slow and necessary, for now

Gabriel Alfour23 Dec 2024 1:57 UTC

9 points

1 comment1 min readLW link

(cognition.cafe)

Propaganda Is Everywhere—LLM Models Are No Exception

Yanling Guo23 Dec 2024 1:39 UTC

−13 points

0 comments3 min readLW link

[Question] Has Anthropic checked if Claude fakes alignment for intended values too?

Maloew23 Dec 2024 0:43 UTC

4 points

1 comment1 min readLW link

Vegans need to eat just enough Meat—emperically evaluate the minimum ammount of meat that maximizes utility

Johannes C. Mayer22 Dec 2024 22:08 UTC

55 points

34 comments3 min readLW link

We are in a New Paradigm of AI Progress—OpenAI’s o3 model makes huge gains on the toughest AI benchmarks in the world

garrison22 Dec 2024 21:45 UTC

17 points

3 comments1 min readLW link

(garrisonlovely.substack.com)

My AI timelines

xpostah22 Dec 2024 21:06 UTC

12 points

2 comments5 min readLW link

(samuelshadrach.com)

A breakdown of AI capability levels focused on AI R&D labor acceleration

ryan_greenblatt22 Dec 2024 20:56 UTC

92 points

5 comments6 min readLW link

How I saved 1 human life (in expectation) without overthinking it

Christopher King22 Dec 2024 20:53 UTC

14 points

0 comments4 min readLW link

Towards mutually assured cooperation

mikko22 Dec 2024 20:46 UTC

4 points

0 comments2 min readLW link

Checking in on Scott’s composition image bet with imagen 3

Dave Orr22 Dec 2024 19:04 UTC

61 points

0 comments1 min readLW link

Woloch & Wosatan

JackOfAllTrades22 Dec 2024 15:46 UTC

−11 points

0 comments2 min readLW link

A primer on machine learning in cryo-electron microscopy (cryo-EM)

Abhishaike Mahajan22 Dec 2024 15:11 UTC

17 points

0 comments25 min readLW link

(www.owlposting.com)

Notes from Copenhagen Secular Solstice 2024

Søren Elverlin22 Dec 2024 15:08 UTC

9 points

0 comments3 min readLW link

Proof Explained for “Robust Agents Learn Causal World Model”

Dalcy22 Dec 2024 15:06 UTC

18 points

0 comments15 min readLW link

subfunctional overlaps in attentional selection history implies momentum for decision-trajectories

Emrik22 Dec 2024 14:12 UTC

19 points

1 comment2 min readLW link

It looks like there are some good funding opportunities in AI safety right now

Benjamin_Todd22 Dec 2024 12:41 UTC

20 points

0 comments4 min readLW link

(benjamintodd.substack.com)

What o3 Becomes by 2028

Vladimir_Nesov22 Dec 2024 12:37 UTC

123 points

15 comments5 min readLW link

The Alignment Simulator

Yair Halberstadt22 Dec 2024 11:45 UTC

24 points

3 comments2 min readLW link

(yairhalberstadt.github.io)

Theoretical Alignment’s Second Chance

lunatic_at_large22 Dec 2024 5:03 UTC

19 points

0 comments2 min readLW link

Orienting to 3 year AGI timelines

Nikola Jurkovic22 Dec 2024 1:15 UTC

218 points

37 comments8 min readLW link

ARC-AGI is a genuine AGI test but o3 cheated :(

Knight Lee22 Dec 2024 0:58 UTC

0 points

2 comments2 min readLW link

When AI 10x’s AI R&D, What Do We Do?

Logan Riggs21 Dec 2024 23:56 UTC

70 points

14 comments4 min readLW link

AI as systems, not just models

Andy Arditi21 Dec 2024 23:19 UTC

24 points

0 comments7 min readLW link

(andyrdt.com)

Towards a Unified Interpretability of Artificial and Biological Neural Networks

jan_bauer21 Dec 2024 23:10 UTC

1 point

0 comments1 min readLW link

Robbin’s Farm Sledding Route

jefftk21 Dec 2024 22:10 UTC

13 points

1 comment1 min readLW link

(www.jefftk.com)

AGI with RL is Bad News for Safety

Nadav Brandes21 Dec 2024 19:36 UTC

19 points

22 comments2 min readLW link

Better difference-making views

MichaelStJules21 Dec 2024 18:27 UTC

7 points

0 comments1 min readLW link

Review: Good Strategy, Bad Strategy

L Rudolf L21 Dec 2024 17:17 UTC

40 points

0 comments23 min readLW link

(nosetgauge.substack.com)

Last Line of Defense: Minimum Viable Shelters for Mirror Bacteria

Ulrik Horn21 Dec 2024 8:28 UTC

11 points

19 comments21 min readLW link

Elon Musk and Solar Futurism

transhumanist_atom_understander21 Dec 2024 2:55 UTC

22 points

27 comments5 min readLW link

Good Reasons for Alts

jefftk21 Dec 2024 1:30 UTC

24 points

2 comments1 min readLW link

(www.jefftk.com)

Updating on Bad Arguments

Guive21 Dec 2024 1:19 UTC

10 points

2 comments2 min readLW link

(guive.substack.com)

Bird’s eye view: An interactive representation to see large collection of text “from above”.

Alexandre Variengien21 Dec 2024 0:15 UTC

10 points

4 comments5 min readLW link

(alexandrevariengien.com)

The nihilism of NeurIPS

charlieoneill20 Dec 2024 23:58 UTC

98 points

7 comments4 min readLW link