All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Cognitive Work and AI Safety: A Thermodynamic Perspective

Daniel Murfet8 Dec 2024 21:42 UTC

61 points

9 comments4 min readLW link

Causal Undertow: A Work of Seed Fiction

Daniel Murfet8 Dec 2024 21:41 UTC

41 points

0 comments3 min readLW link

Misfortune and Many Worlds

Jonah Wilberg8 Dec 2024 20:25 UTC

10 points

4 comments9 min readLW link

Luck Based Medicine: No Good Very Bad Winter Cured My Hypothyroidism

Elizabeth8 Dec 2024 20:10 UTC

54 points

3 comments2 min readLW link

(acesounderglass.com)

Densing Law of LLMs

Bogdan Ionut Cirstea8 Dec 2024 19:35 UTC

9 points

2 comments1 min readLW link

(arxiv.org)

[Question] Are there ways to artificially fix laziness?

Aidar8 Dec 2024 18:26 UTC

4 points

2 comments1 min readLW link

Fred the Heretic, a GPT for poetry

Bill Benzon8 Dec 2024 16:52 UTC

4 points

0 comments1 min readLW link

Rethink Wellbeing’s Year 2 Update: Foster Sustainable High Performance for Ambitious Altruists

Inga G.8 Dec 2024 14:32 UTC

11 points

1 comment1 min readLW link

Alternatives to Masks for Infectious Aerosols

jefftk8 Dec 2024 14:00 UTC

25 points

9 comments7 min readLW link

(www.jefftk.com)

Parable of the vanilla ice cream curse (and how it would prevent a car from starting!)

Mati_Roy8 Dec 2024 6:57 UTC

89 points

21 comments3 min readLW link

A good way to build many air filters on the cheap

winstonBosan8 Dec 2024 1:47 UTC

14 points

5 comments3 min readLW link

Historical Net Worth

jefftk7 Dec 2024 23:10 UTC

19 points

1 comment1 min readLW link

(www.jefftk.com)

RL, but don’t do anything I wouldn’t do

Gunnar_Zarncke7 Dec 2024 22:54 UTC

63 points

5 comments1 min readLW link

(arxiv.org)

Litigate-for-Impact: Preparing Legal Action against an AGI Frontier Lab Leader

Sonia Joseph7 Dec 2024 21:42 UTC

38 points

7 comments2 min readLW link

Algebraic Linguistics

abstractapplic7 Dec 2024 19:18 UTC

34 points

27 comments5 min readLW link

Paper Highlights, November ’24

gasteigerjo7 Dec 2024 19:15 UTC

7 points

0 comments8 min readLW link

(aisafetyfrontier.substack.com)

Intricacies of Feature Geometry in Large Language Models

7vik, Lucius Bushnaq and Nandi

7 Dec 2024 18:10 UTC

68 points

0 comments12 min readLW link

The Way According To Zvi

Sable7 Dec 2024 17:35 UTC

38 points

3 comments32 min readLW link

(affablyevil.substack.com)

Deep Learning is cheap Solomonoff induction?

Lucius Bushnaq, Kaarel and Dmitry Vaintrob

7 Dec 2024 11:00 UTC

44 points

1 comment17 min readLW link

minifest

Austin Chen7 Dec 2024 3:50 UTC

19 points

1 comment1 min readLW link

Mask and Respirator Intelligibility Comparison

jefftk7 Dec 2024 3:20 UTC

26 points

5 comments1 min readLW link

(www.jefftk.com)

Broadening Horizons: Rethinking Social Mobility Through Skill Diversification

Yanling Guo7 Dec 2024 0:04 UTC

−1 points

0 comments2 min readLW link

Backdoors have universal representations across large language models

Amirali Abdullah, Narmeen, Dhruv Nathawani and nirmalendu prakash

6 Dec 2024 22:56 UTC

14 points

0 comments16 min readLW link

Gradient Routing: Masking Gradients to Localize Computation in Neural Networks

cloud, Jacob G-W, Evzen, Joseph Miller and TurnTrout

6 Dec 2024 22:19 UTC

161 points

12 comments11 min readLW link

(arxiv.org)

Understanding Shapley Values with Venn Diagrams

Carson L6 Dec 2024 21:56 UTC

213 points

34 comments1 min readLW link

(medium.com)

Model Integrity

ryan.lowe, Oliver Klingefjord and Joe Edelman

6 Dec 2024 21:28 UTC

4 points

1 comment18 min readLW link

Can AI improve the current state of molecular simulation?

Abhishaike Mahajan6 Dec 2024 20:22 UTC

5 points

0 comments1 min readLW link

(www.owlposting.com)

Low Temperature Solomonoff Induction

dil-leik-og6 Dec 2024 18:55 UTC

10 points

4 comments11 min readLW link

Experiments are in the territory, results are in the map

Tahp6 Dec 2024 15:44 UTC

5 points

1 comment6 min readLW link

A car journey with conservative evangelicals—Understanding some British political-religious beliefs

Nathan Young6 Dec 2024 11:22 UTC

41 points

8 comments6 min readLW link

(nathanpmyoung.substack.com)

Frontier Models are Capable of In-context Scheming

Marius Hobbhahn, AlexMeinke, Bronson Schoen, rusheb, Jérémy Scheurer and Mikita Balesni

5 Dec 2024 22:11 UTC

203 points

24 comments7 min readLW link

Should you be worried about H5N1?

gw5 Dec 2024 21:11 UTC

89 points

2 comments5 min readLW link

(www.georgeyw.com)

o1 tried to avoid being shut down

Raelifin5 Dec 2024 19:52 UTC

10 points

5 comments1 min readLW link

(www.transformernews.ai)

More Growth, Melancholy, and MindCraft @3QD [revised and updated]

Bill Benzon5 Dec 2024 19:36 UTC

4 points

0 comments4 min readLW link

Expevolu, a laissez-faire approach to country creation

Fernando5 Dec 2024 19:29 UTC

4 points

4 comments44 min readLW link

(expevolu.substack.com)

Are SAE features from the Base Model still meaningful to LLaVA?

Shan23Chen5 Dec 2024 19:24 UTC

5 points

2 comments10 min readLW link

OpenAI o1 + ChatGPT Pro release

anaguma5 Dec 2024 19:13 UTC

5 points

0 comments1 min readLW link

(openai.com)

Smart people should do biology

Haotian5 Dec 2024 19:11 UTC

10 points

2 comments3 min readLW link

Announcement: AI for Math Fund

sarahconstantin5 Dec 2024 18:33 UTC

20 points

9 comments2 min readLW link

(renaissancephilanthropy.org)

Detection of Asymptomatically Spreading Pathogens

jefftk5 Dec 2024 18:20 UTC

45 points

8 comments7 min readLW link

(www.jefftk.com)

Model Integrity: MAI on Value Alignment

Jonas Hallgren5 Dec 2024 17:11 UTC

6 points

11 comments1 min readLW link

(meaningalignment.substack.com)

Social Science in its epistemological context

Arturo Macias5 Dec 2024 16:12 UTC

3 points

0 comments1 min readLW link

(www.theseedsofscience.pub)

Higher and lower pleasures

Chris_Leong5 Dec 2024 13:13 UTC

19 points

3 comments1 min readLW link

Sam Harris’s Argument For Objective Morality

Zero Contradictions5 Dec 2024 10:19 UTC

7 points

5 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

Morality as Cooperation Part III: Failure Modes

DeLesley Hutchins5 Dec 2024 9:39 UTC

4 points

0 comments20 min readLW link

Morality as Cooperation Part II: Theory and Experiment

DeLesley Hutchins5 Dec 2024 9:04 UTC

2 points

0 comments17 min readLW link

Morality as Cooperation Part I: Humans

DeLesley Hutchins5 Dec 2024 8:16 UTC

5 points

0 comments19 min readLW link

I Finally Worked Through Bayes’ Theorem (Personal Achievement)

keltan5 Dec 2024 2:04 UTC

51 points

6 comments9 min readLW link

The Dream Machine

sarahconstantin5 Dec 2024 0:00 UTC

117 points

6 comments12 min readLW link

(sarahconstantin.substack.com)

Should you have children? A decision framework for a crucial life choice that affects yourself, your child and the world

Sherrinford4 Dec 2024 23:14 UTC

0 points

1 comment20 min readLW link