All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 212223 24 25 26 27 28 29 30

Time complexity for deterministic string machines

alcatal21 Apr 2024 22:35 UTC

21 points

0 comments21 min readLW link

Transfer Learning in Humans

niplav21 Apr 2024 20:49 UTC

57 points

1 comment13 min readLW link

I created an Asi Alignment Tier List

TimeGoat21 Apr 2024 18:44 UTC

−6 points

0 comments1 min readLW link

The losing identity of Twitter

Itay Dreyfus21 Apr 2024 13:43 UTC

20 points

1 comment12 min readLW link

(productidentity.co)

Good Bings copy, great Bings steal

dr_s21 Apr 2024 9:52 UTC

31 points

6 comments9 min readLW link

Paper: “The Ethics of Advanced AI Assistants” -Google DeepMind

Tristan Wegner21 Apr 2024 6:45 UTC

20 points

0 comments1 min readLW link

(storage.googleapis.com)

Contra Chord Simplification

jefftk21 Apr 2024 2:30 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

A couple productivity tips for overthinkers

Steven Byrnes20 Apr 2024 16:05 UTC

78 points

13 comments4 min readLW link

“You’re the most beautiful girl in the world” and Wittgensteinian Language Games

Chris_Leong20 Apr 2024 14:54 UTC

5 points

18 comments1 min readLW link

Past Tense Features

Can20 Apr 2024 14:34 UTC

12 points

0 comments4 min readLW link

Thoughts on seed oil

dynomight20 Apr 2024 12:29 UTC

347 points

129 comments17 min readLW link

(dynomight.net)

How to know whether you are an idealist or a physicalist/materialist

JackOfAllTrades20 Apr 2024 11:53 UTC

−3 points

2 comments1 min readLW link

How I Think, Part Four: Money is Weird

Richard Henage20 Apr 2024 6:21 UTC

0 points

3 comments5 min readLW link

The power of finite and the weakness of infinite binary point numbers

AxiomWriter20 Apr 2024 6:03 UTC

−3 points

6 comments2 min readLW link

WISDOMISM A Moral Theory for the Age of Information

Peter lawless 19 Apr 2024 23:06 UTC

2 points

0 comments9 min readLW link

Inducing Unprompted Misalignment in LLMs

Sam Svenningsen, evhub and Henry Sleight

19 Apr 2024 20:00 UTC

38 points

7 comments16 min readLW link

Introspection

A*19 Apr 2024 19:10 UTC

7 points

0 comments1 min readLW link

[Full Post] Progress Update #1 from the GDM Mech Interp Team

Neel Nanda, Arthur Conmy, lewis smith, Senthooran Rajamanoharan, Tom Lieberum, János Kramár and Vikrant Varma

19 Apr 2024 19:06 UTC

77 points

10 comments8 min readLW link

[Summary] Progress Update #1 from the GDM Mech Interp Team

Neel Nanda, Arthur Conmy, lewis smith, Senthooran Rajamanoharan, Tom Lieberum, János Kramár and Vikrant Varma

19 Apr 2024 19:06 UTC

72 points

0 comments3 min readLW link

Daniel Dennett has died (1942-2024)

kave19 Apr 2024 16:17 UTC

150 points

5 comments1 min readLW link

(dailynous.com)

Events Booking New Callers?

jefftk19 Apr 2024 15:50 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

[Question] What is the best way to talk about probabilities you expect to change with evidence/experiments?

Will_Pearson19 Apr 2024 15:35 UTC

14 points

11 comments1 min readLW link

CTMU insight: maybe consciousness can affect quantum outcomes?

zhukeepa19 Apr 2024 15:23 UTC

13 points

11 comments5 min readLW link

Demonstrate and evaluate risks from AI to society at the AI x Democracy research hackathon

Esben Kran19 Apr 2024 14:46 UTC

5 points

0 comments1 min readLW link

(www.apartresearch.com)

[Question] How to Model the Future of Open-Source LLMs?

Joel Burget19 Apr 2024 14:28 UTC

25 points

9 comments1 min readLW link

What’s up with all the non-Mormons? Weirdly specific universalities across LLMs

mwatkins19 Apr 2024 13:43 UTC

40 points

13 comments27 min readLW link

[Question] If digital goods in virtual worlds increase GDP, do we actually become richer?

No77e19 Apr 2024 10:06 UTC

6 points

10 comments1 min readLW link

Experiment on repeating choices

KatjaGrace19 Apr 2024 4:20 UTC

56 points

1 comment3 min readLW link

(worldspiritsockpuppet.com)

Effective Altruists and Rationalists Views & The case for using marketing to highlight AI risks.

gilch19 Apr 2024 4:16 UTC

6 points

1 comment1 min readLW link

(youtu.be)

Cohesion and business problems

Adam Zerner19 Apr 2024 0:45 UTC

12 points

8 comments4 min readLW link

The Thermodynamics of Death

Peter lawless 19 Apr 2024 0:36 UTC

6 points

0 comments10 min readLW link

Backyard Office

jefftk19 Apr 2024 0:31 UTC

13 points

0 comments1 min readLW link

(www.jefftk.com)

hydrogen tube transport

bhauth18 Apr 2024 22:47 UTC

34 points

12 comments5 min readLW link

(www.bhauth.com)

LessOnline Festival Updates Thread

Ben Pace18 Apr 2024 21:55 UTC

59 points

26 comments1 min readLW link

A Review of In-Context Learning Hypotheses for Automated AI Alignment Research

alamerton18 Apr 2024 18:29 UTC

25 points

4 comments16 min readLW link

I’m open for projects (sort of)

cousin_it18 Apr 2024 18:05 UTC

46 points

13 comments1 min readLW link

Blessed information, garbage information, cursed information

tailcalled18 Apr 2024 16:56 UTC

23 points

8 comments3 min readLW link

[Fiction] A Confession

Arjun Panickssery18 Apr 2024 16:28 UTC

38 points

2 comments5 min readLW link

(arjunpanickssery.substack.com)

Discriminating Behaviorally Identical Classifiers: a model problem for applying interpretability to scalable oversight

Sam Marks18 Apr 2024 16:17 UTC

107 points

10 comments12 min readLW link

Cooperation is optimal, with weaker agents too - tldr

Ryo 18 Apr 2024 15:03 UTC

12 points

22 comments4 min readLW link

(medium.com)

How to coordinate despite our biases? - tldr

Ryo 18 Apr 2024 15:03 UTC

3 points

2 comments3 min readLW link

(medium.com)

Knowledge Base 7: Long-tail knowledge and collective intelligence

iwis18 Apr 2024 14:21 UTC

−6 points

0 comments1 min readLW link

AI #60: Oh the Humanity

Zvi18 Apr 2024 14:10 UTC

44 points

7 comments62 min readLW link

(thezvi.wordpress.com)

UDT1.01: Logical Inductors and Implicit Beliefs (5/10)

Diffractor18 Apr 2024 8:39 UTC

33 points

2 comments19 min readLW link

An examination of GPT-2′s boring yet effective glitch

MiguelDev18 Apr 2024 5:26 UTC

5 points

3 comments3 min readLW link

[Question] What if Ethics is Provably Self-Contradictory?

Yitz18 Apr 2024 5:12 UTC

3 points

7 comments2 min readLW link

The Mom Test: Summary and Thoughts

Adam Zerner18 Apr 2024 3:34 UTC

48 points

3 comments10 min readLW link

Express interest in an “FHI of the West”

habryka18 Apr 2024 3:32 UTC

268 points

41 comments3 min readLW link

Why Would Belief-States Have A Fractal Structure, And Why Would That Matter For Interpretability? An Explainer

johnswentworth and David Lorell

18 Apr 2024 0:27 UTC

184 points

21 comments7 min readLW link

AXRP Episode 28 - Suing Labs for AI Risk with Gabriel Weil

DanielFilan17 Apr 2024 21:42 UTC

12 points

0 comments65 min readLW link