15 Nov 2024 18:55 UTC

201 points

68 comments7 min readLW link

The Case For Giving To The Shrimp Welfare Project

omnizoid15 Nov 2024 16:03 UTC

−6 points

14 comments7 min readLW link

Win/continue/lose scenarios and execute/replace/audit protocols

Buck15 Nov 2024 15:47 UTC

54 points

2 comments7 min readLW link

Antonym Heads Predict Semantic Opposites in Language Models

Jake Ward15 Nov 2024 15:32 UTC

3 points

0 comments5 min readLW link

Proposing the Conditional AI Safety Treaty (linkpost TIME)

otto.barten15 Nov 2024 13:59 UTC

10 points

8 comments3 min readLW link

(time.com)

A Theory of Equilibrium in the Offense-Defense Balance

Maxwell Tabarrok15 Nov 2024 13:51 UTC

25 points

6 comments2 min readLW link

(www.maximum-progress.com)

Boston Secular Solstice 2024: Call for Singers and Musicans

jefftk15 Nov 2024 13:50 UTC

22 points

0 comments1 min readLW link

(www.jefftk.com)

An Uncanny Moat

Adam Newgas15 Nov 2024 11:39 UTC

8 points

0 comments4 min readLW link

(www.boristhebrave.com)

[Question] What are some positive developments in AI safety in 2024?

Satron15 Nov 2024 10:32 UTC

10 points

5 comments1 min readLW link

If I care about measure, choices have additional burden (+AI generated LW-comments)

avturchin15 Nov 2024 10:27 UTC

5 points

11 comments2 min readLW link

What are Emotions?

Myles H15 Nov 2024 4:20 UTC

4 points

13 comments8 min readLW link

The Third Fundamental Question

Screwtape15 Nov 2024 4:01 UTC

66 points

7 comments6 min readLW link

Dance Differentiation

jefftk15 Nov 2024 2:30 UTC

14 points

0 comments1 min readLW link

(www.jefftk.com)

Breaking beliefs about saving the world

Oxidize15 Nov 2024 0:46 UTC

2 points

3 comments9 min readLW link

College technical AI safety hackathon retrospective—Georgia Tech

yix15 Nov 2024 0:22 UTC

39 points

2 comments5 min readLW link

(open.substack.com)

Gwern Branwen interview on Dwarkesh Patel’s podcast: “How an Anonymous Researcher Predicted AI’s Trajectory”

Said Achmiz14 Nov 2024 23:53 UTC

80 points

0 comments1 min readLW link

(www.dwarkeshpatel.com)

Internal music player: phenomenology of earworms

dkl914 Nov 2024 23:29 UTC

6 points

4 comments2 min readLW link

(dkl9.net)

The Foraging (Ex-)Bandit [Ruleset & Reflections]

abstractapplic14 Nov 2024 20:16 UTC

27 points

3 comments2 min readLW link

Seven lessons I didn’t learn from election day

Eric Neyman14 Nov 2024 18:39 UTC

97 points

33 comments13 min readLW link

(ericneyman.wordpress.com)

Effects of Non-Uniform Sparsity on Superposition in Toy Models

Shreyans Jain14 Nov 2024 16:59 UTC

4 points

3 comments6 min readLW link

AI #90: The Wall

Zvi14 Nov 2024 14:10 UTC

32 points

6 comments42 min readLW link

(thezvi.wordpress.com)

Evolutionary prompt optimization for SAE feature visualization

neverix, Daniel Tan, Dmitrii Kharlapenko, Neel Nanda and Arthur Conmy

14 Nov 2024 13:06 UTC

16 points

0 comments9 min readLW link

AXRP Episode 38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems

DanielFilan14 Nov 2024 7:00 UTC

14 points

0 comments12 min readLW link

FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

Tamay14 Nov 2024 6:13 UTC

39 points

0 comments3 min readLW link

(epoch.ai)

Concrete Methods for Heuristic Estimation on Neural Networks

Oliver Daniels14 Nov 2024 5:07 UTC

28 points

0 comments27 min readLW link

Heresies in the Shadow of the Sequences

Cole Wyeth14 Nov 2024 5:01 UTC

17 points

12 comments2 min readLW link

literally Hitler

David Gross14 Nov 2024 3:20 UTC

−13 points

0 comments4 min readLW link

Thoughts after the Wolfram and Yudkowsky discussion

Tahp14 Nov 2024 1:43 UTC

25 points

13 comments6 min readLW link

[Question] Why would ASI share any resources with us?

Satron13 Nov 2024 23:38 UTC

6 points

8 comments1 min readLW link

Neutrality

sarahconstantin13 Nov 2024 23:10 UTC

158 points

27 comments11 min readLW link

(sarahconstantin.substack.com)

Anvil Problems

Screwtape13 Nov 2024 22:57 UTC

89 points

13 comments3 min readLW link

[Question] Using hex to get murder advice from GPT-4o

Laurence Freeman13 Nov 2024 18:30 UTC

10 points

5 comments6 min readLW link

Confronting the legion of doom.

Spiritus Dei13 Nov 2024 17:03 UTC

−18 points

2 comments5 min readLW link

Is Deep Learning Actually Hitting a Wall? Evaluating Ilya Sutskever’s Recent Claims

garrison13 Nov 2024 17:00 UTC

84 points

14 comments1 min readLW link

(garrisonlovely.substack.com)

MIT FutureTech are hiring ‍a Product and Data Visualization Designer

peterslattery13 Nov 2024 14:48 UTC

2 points

0 comments4 min readLW link

Sparks of Consciousness

Charlie Sanders13 Nov 2024 4:58 UTC

2 points

0 comments3 min readLW link

(www.dailymicrofiction.com)

Contra Musician Gender II

jefftk13 Nov 2024 3:30 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

Flipping Out: The Cosmic Coinflip Thought Experiment Is Bad Philosophy

Joe Rogero12 Nov 2024 23:55 UTC

34 points

17 comments4 min readLW link

Incentive design and capability elicitation

Joe Carlsmith12 Nov 2024 20:56 UTC

31 points

0 comments12 min readLW link

The Humanitarian Economy

kylefurlong12 Nov 2024 18:25 UTC

−7 points

14 comments6 min readLW link

Current Attitudes Toward AI Provide Little Data Relevant to Attitudes Toward AGI

Seth Herd12 Nov 2024 18:23 UTC

16 points

2 comments4 min readLW link

Basics of Handling Disagreements with People

Camille Berger 12 Nov 2024 17:55 UTC

34 points

4 comments6 min readLW link

Registrations Open for 2024 NYC Secular Solstice & Megameetup

Joe Rogero and Screwtape

12 Nov 2024 17:50 UTC

13 points

0 comments1 min readLW link

2024 NYC Secular Solstice & Megameetup

Joe Rogero and Screwtape

12 Nov 2024 17:46 UTC

18 points

0 comments1 min readLW link

2025 Q1 Pivotal Research Fellowship (Technical & Policy)

Tobias H and tilmanr

12 Nov 2024 10:56 UTC

6 points

0 comments2 min readLW link

Theories With Mentalistic Atoms Are As Validly Called Theories As Theories With Only Non-Mentalistic Atoms

Lorec12 Nov 2024 6:45 UTC

5 points

5 comments8 min readLW link

The lying p value

kqr12 Nov 2024 6:12 UTC

13 points

7 comments1 min readLW link

(entropicthoughts.com)

Modeling AI-driven occupational change over the next 10 years and beyond

2120eth12 Nov 2024 4:58 UTC

1 point

0 comments2 min readLW link

How to Live Well: My Philosophy of Life

Philosofer12312 Nov 2024 4:05 UTC

−5 points

2 comments1 min readLW link

The Packaging and the Payload

Screwtape12 Nov 2024 3:07 UTC

76 points

1 comment5 min readLW link