All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 222324 25 26 27 28 29 30

Funny Anecdote of Eliezer From His Sister

Noah Birnbaum22 Apr 2024 22:05 UTC

202 points

6 comments2 min readLW link

How LLMs Work, in the Style of The Economist

utilistrutil22 Apr 2024 19:06 UTC

0 points

0 comments2 min readLW link

Measuring Coherence and Goal-Directedness in RL Policies

dx2622 Apr 2024 18:26 UTC

10 points

0 comments7 min readLW link

AI Regulation is Unsafe

Maxwell Tabarrok22 Apr 2024 16:37 UTC

40 points

41 comments4 min readLW link

(www.maximum-progress.com)

Priors and Prejudice

MathiasKB22 Apr 2024 15:00 UTC

150 points

31 comments7 min readLW link

Forget Everything (Statistical Mechanics Part 1)

J Bostock22 Apr 2024 13:33 UTC

39 points

6 comments3 min readLW link

On Llama-3 and Dwarkesh Patel’s Podcast with Zuckerberg

Zvi22 Apr 2024 13:10 UTC

63 points

4 comments47 min readLW link

(thezvi.wordpress.com)

Motivation gaps: Why so much EA criticism is hostile and lazy

titotal22 Apr 2024 11:49 UTC

69 points

5 comments1 min readLW link

(titotal.substack.com)

Should we break up Google DeepMind?

Hauke Hillebrandt22 Apr 2024 9:16 UTC

3 points

0 comments1 min readLW link

What should our containers do?

Richard Henage22 Apr 2024 6:17 UTC

1 point

1 comment2 min readLW link

Goal oriented cognition in “a single forward pass”

dxu and habryka

22 Apr 2024 5:03 UTC

20 points

15 comments26 min readLW link

Time complexity for deterministic string machines

alcatal21 Apr 2024 22:35 UTC

21 points

0 comments21 min readLW link

Transfer Learning in Humans

niplav21 Apr 2024 20:49 UTC

57 points

1 comment13 min readLW link

I created an Asi Alignment Tier List

TimeGoat21 Apr 2024 18:44 UTC

−6 points

0 comments1 min readLW link

The losing identity of Twitter

Itay Dreyfus21 Apr 2024 13:43 UTC

20 points

1 comment12 min readLW link

(productidentity.co)

Good Bings copy, great Bings steal

dr_s21 Apr 2024 9:52 UTC

31 points

6 comments9 min readLW link

Paper: “The Ethics of Advanced AI Assistants” -Google DeepMind

Tristan Wegner21 Apr 2024 6:45 UTC

20 points

0 comments1 min readLW link

(storage.googleapis.com)

Contra Chord Simplification

jefftk21 Apr 2024 2:30 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

A couple productivity tips for overthinkers

Steven Byrnes20 Apr 2024 16:05 UTC

78 points

13 comments4 min readLW link

“You’re the most beautiful girl in the world” and Wittgensteinian Language Games

Chris_Leong20 Apr 2024 14:54 UTC

5 points

18 comments1 min readLW link

Past Tense Features

Can20 Apr 2024 14:34 UTC

12 points

0 comments4 min readLW link

Thoughts on seed oil

dynomight20 Apr 2024 12:29 UTC

347 points

129 comments17 min readLW link

(dynomight.net)

How to know whether you are an idealist or a physicalist/materialist

JackOfAllTrades20 Apr 2024 11:53 UTC

−3 points

2 comments1 min readLW link

How I Think, Part Four: Money is Weird

Richard Henage20 Apr 2024 6:21 UTC

0 points

3 comments5 min readLW link

The power of finite and the weakness of infinite binary point numbers

AxiomWriter20 Apr 2024 6:03 UTC

−3 points

6 comments2 min readLW link

WISDOMISM A Moral Theory for the Age of Information

Peter lawless 19 Apr 2024 23:06 UTC

2 points

0 comments9 min readLW link

Inducing Unprompted Misalignment in LLMs

Sam Svenningsen, evhub and Henry Sleight

19 Apr 2024 20:00 UTC

38 points

6 comments16 min readLW link

Introspection

A*19 Apr 2024 19:10 UTC

7 points

0 comments1 min readLW link

[Full Post] Progress Update #1 from the GDM Mech Interp Team

Neel Nanda, Arthur Conmy, lewis smith, Senthooran Rajamanoharan, Tom Lieberum, János Kramár and Vikrant Varma

19 Apr 2024 19:06 UTC

77 points

10 comments8 min readLW link

[Summary] Progress Update #1 from the GDM Mech Interp Team

Neel Nanda, Arthur Conmy, lewis smith, Senthooran Rajamanoharan, Tom Lieberum, János Kramár and Vikrant Varma

19 Apr 2024 19:06 UTC

72 points

0 comments3 min readLW link

Daniel Dennett has died (1942-2024)

kave19 Apr 2024 16:17 UTC

150 points

5 comments1 min readLW link

(dailynous.com)

Events Booking New Callers?

jefftk19 Apr 2024 15:50 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

[Question] What is the best way to talk about probabilities you expect to change with evidence/experiments?

Will_Pearson19 Apr 2024 15:35 UTC

14 points

11 comments1 min readLW link

CTMU insight: maybe consciousness can affect quantum outcomes?

zhukeepa19 Apr 2024 15:23 UTC

12 points

11 comments5 min readLW link

Demonstrate and evaluate risks from AI to society at the AI x Democracy research hackathon

Esben Kran19 Apr 2024 14:46 UTC

5 points

0 comments1 min readLW link

(www.apartresearch.com)

[Question] How to Model the Future of Open-Source LLMs?

Joel Burget19 Apr 2024 14:28 UTC

25 points

9 comments1 min readLW link

What’s up with all the non-Mormons? Weirdly specific universalities across LLMs

mwatkins19 Apr 2024 13:43 UTC

40 points

13 comments27 min readLW link

[Question] If digital goods in virtual worlds increase GDP, do we actually become richer?

No77e19 Apr 2024 10:06 UTC

6 points

10 comments1 min readLW link

Experiment on repeating choices

KatjaGrace19 Apr 2024 4:20 UTC

56 points

1 comment3 min readLW link

(worldspiritsockpuppet.com)

Effective Altruists and Rationalists Views & The case for using marketing to highlight AI risks.

gilch19 Apr 2024 4:16 UTC

6 points

1 comment1 min readLW link

(youtu.be)

Cohesion and business problems

Adam Zerner19 Apr 2024 0:45 UTC

12 points

8 comments4 min readLW link

The Thermodynamics of Death

Peter lawless 19 Apr 2024 0:36 UTC

6 points

0 comments10 min readLW link

Backyard Office

jefftk19 Apr 2024 0:31 UTC

13 points

0 comments1 min readLW link

(www.jefftk.com)

hydrogen tube transport

bhauth18 Apr 2024 22:47 UTC

34 points

12 comments5 min readLW link

(www.bhauth.com)

LessOnline Festival Updates Thread

Ben Pace18 Apr 2024 21:55 UTC

59 points

26 comments1 min readLW link

A Review of In-Context Learning Hypotheses for Automated AI Alignment Research

alamerton18 Apr 2024 18:29 UTC

25 points

4 comments16 min readLW link

I’m open for projects (sort of)

cousin_it18 Apr 2024 18:05 UTC

46 points

13 comments1 min readLW link

Blessed information, garbage information, cursed information

tailcalled18 Apr 2024 16:56 UTC

23 points

8 comments3 min readLW link

[Fiction] A Confession

Arjun Panickssery18 Apr 2024 16:28 UTC

38 points

2 comments5 min readLW link

(arjunpanickssery.substack.com)

Discriminating Behaviorally Identical Classifiers: a model problem for applying interpretability to scalable oversight

Sam Marks18 Apr 2024 16:17 UTC

107 points

10 comments12 min readLW link