All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Executable philosophy as a failed totalizing meta-worldview

jessicata4 Sep 2024 22:50 UTC

93 points

40 comments4 min readLW link

(unstableontology.com)

Against Explosive Growth

c.trout4 Sep 2024 21:45 UTC

14 points

1 comment5 min readLW link

The Fragility of Life Hypothesis and the Evolution of Cooperation

KristianRonn4 Sep 2024 21:04 UTC

50 points

6 comments11 min readLW link

Emotion-Informed Valuation Mechanism for Improved AI Alignment in Large Language Models

Javier Marin Valenzuela4 Sep 2024 17:00 UTC

2 points

4 comments6 min readLW link

What happens if you present 500 people with an argument that AI is risky?

KatjaGrace and Nathan Young

4 Sep 2024 16:40 UTC

102 points

7 comments3 min readLW link

(blog.aiimpacts.org)

Automating LLM Auditing with Developmental Interpretability

htlou and evhub

4 Sep 2024 15:50 UTC

17 points

0 comments3 min readLW link

Michael Dickens’ Caffeine Tolerance Research

niplav4 Sep 2024 15:41 UTC

46 points

3 comments2 min readLW link

(mdickens.me)

[Question] Are UV-C Air purifiers so useful?

JohnBuridan4 Sep 2024 14:16 UTC

9 points

0 comments1 min readLW link

AI and the Technological Richter Scale

Zvi4 Sep 2024 14:00 UTC

48 points

8 comments13 min readLW link

(thezvi.wordpress.com)

[Question] Is there any rigorous work on using anthropic uncertainty to prevent situational awareness / deception?

David Scott Krueger (formerly: capybaralet)4 Sep 2024 12:40 UTC

19 points

7 comments1 min readLW link

A Comparison Between The Pragmatosphere And Less Wrong

Zero Contradictions4 Sep 2024 9:39 UTC

−18 points

10 comments2 min readLW link

(zerocontradictions.net)

Announcing the Ultimate Jailbreaking Championship

InnerHufflepuff4 Sep 2024 0:35 UTC

15 points

1 comment1 min readLW link

AI Safety at the Frontier: Paper Highlights, August ’24

gasteigerjo3 Sep 2024 19:17 UTC

28 points

0 comments6 min readLW link

(aisafetyfrontier.substack.com)

The Checklist: What Succeeding at AI Safety Will Involve

Sam Bowman3 Sep 2024 18:18 UTC

142 points

49 comments22 min readLW link

(sleepinyourhat.github.io)

Democracy beyond majoritarianism

Arturo Macias3 Sep 2024 15:10 UTC

5 points

2 comments4 min readLW link

On the UBI Paper

Zvi3 Sep 2024 14:50 UTC

57 points

6 comments19 min readLW link

(thezvi.wordpress.com)

An Opinionated Look at Inference Rules

Gianluca Calcagni3 Sep 2024 13:32 UTC

−5 points

2 comments13 min readLW link

Announcing the PIBBSS Symposium ’24!

DusanDNesic and clem_acs

3 Sep 2024 11:19 UTC

19 points

0 comments3 min readLW link

Reducing global AI competition through the Commerce Control List and Immigration reform: a dual-pronged approach

Ben Smith3 Sep 2024 5:28 UTC

16 points

2 comments1 min readLW link

How I got 4.2M YouTube views without making a single video

Closed Limelike Curves3 Sep 2024 3:52 UTC

376 points

36 comments1 min readLW link

Duped: AI and the Making of a Global Suicide Cult

izzyness2 Sep 2024 18:51 UTC

−8 points

0 comments1 min readLW link

A gentle introduction to sparse autoencoders

Nick Jiang2 Sep 2024 18:11 UTC

9 points

0 comments6 min readLW link

What makes math problems hard for reinforcement learning: a case study

Anibal, Bartek, Sergei, Shehper and Piotr2 Sep 2024 18:11 UTC

1 point

0 comments2 min readLW link

(arxiv.org)

Survey: How Do Elite Chinese Students Feel About the Risks of AI?

Nick Corvino2 Sep 2024 18:11 UTC

141 points

13 comments10 min readLW link

Data-driven donations to help Democrats win federal elections: an update

Michael Cohn2 Sep 2024 16:32 UTC

−1 points

2 comments1 min readLW link

(perplexedguide.net)

[Question] What are the effective utilitarian pros and cons of having children (in rich countries)?

SpectrumDT2 Sep 2024 10:01 UTC

2 points

4 comments1 min readLW link

My decomposition of the alignment problem

Daniel C2 Sep 2024 0:21 UTC

20 points

22 comments13 min readLW link

DC Forecasting & Prediction Markets Meetup

David Glidden2 Sep 2024 0:00 UTC

1 point

0 comments1 min readLW link

A primer on the next generation of antibodies

Abhishaike Mahajan1 Sep 2024 22:37 UTC

25 points

0 comments19 min readLW link

(www.owlposting.com)

[Question] Who looked into extreme nuclear meltdowns?

Remmelt1 Sep 2024 21:38 UTC

2 points

8 comments1 min readLW link

Redundant Attention Heads in Large Language Models For In Context Learning

skunnavakkam1 Sep 2024 20:08 UTC

7 points

1 comment4 min readLW link

(skunnavakkam.github.io)

The Role of Transparency and Explainability in Responsible NLP

RAMEBC781 Sep 2024 20:08 UTC

−3 points

1 comment5 min readLW link

Book Review: What Even Is Gender?

Joey Marcellino1 Sep 2024 16:09 UTC

31 points

14 comments12 min readLW link

Can a Bayesian Oracle Prevent Harm from an Agent? (Bengio et al. 2024)

mattmacdermott1 Sep 2024 7:46 UTC

26 points

0 comments5 min readLW link

(yoshuabengio.org)

San Francisco ACX Meetup “First Saturday”

Nate Sternberg1 Sep 2024 4:48 UTC

2 points

1 comment1 min readLW link

Forecasting One-Shot Games

Raemon31 Aug 2024 23:10 UTC

46 points

0 comments7 min readLW link

On epistemic autonomy

sanyer31 Aug 2024 18:50 UTC

11 points

0 comments2 min readLW link

Epistemic states as a potential benign prior

Tamsin Leake31 Aug 2024 18:26 UTC

31 points

2 comments8 min readLW link

(carado.moe)

My Model of Epistemology

adamShimi31 Aug 2024 17:01 UTC

35 points

0 comments8 min readLW link

(epistemologicalfascinations.substack.com)

Verification methods for international AI agreements

Akash31 Aug 2024 14:58 UTC

14 points

1 comment4 min readLW link

(arxiv.org)

Fake Blog Posts as a Problem Solving Device

silentbob31 Aug 2024 9:22 UTC

7 points

0 comments2 min readLW link

Actually Rational & Kind Sequences Reading Group

segfault 31 Aug 2024 4:21 UTC

−55 points

1 comment1 min readLW link

Anthropic is being sued for copying books to train Claude

Remmelt31 Aug 2024 2:57 UTC

20 points

4 comments2 min readLW link

(fingfx.thomsonreuters.com)

Book review: On the Edge

PeterMcCluskey30 Aug 2024 22:18 UTC

34 points

0 comments9 min readLW link

(bayesianinvestor.com)

Can Large Language Models effectively identify cybersecurity risks?

emile delcourt30 Aug 2024 20:20 UTC

18 points

0 comments11 min readLW link

Singular learning theory: exercises

Zach Furman30 Aug 2024 20:00 UTC

88 points

5 comments14 min readLW link

AI for Bio: State Of The Field

sarahconstantin30 Aug 2024 18:00 UTC

73 points

2 comments15 min readLW link

(sarahconstantin.substack.com)

Multi-Tiered AI

Timothy Bruneau30 Aug 2024 17:46 UTC

1 point

0 comments2 min readLW link

I universally trying to reject the Mind Projection Fallacy—consequences

YanLyutnev30 Aug 2024 17:42 UTC

−4 points

0 comments9 min readLW link

AIS terminology proposal: standardize terms for probability ranges

eggsyntax30 Aug 2024 15:43 UTC

30 points

12 comments2 min readLW link