All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Philosophers wrestling with evil, as a social media feed

David Gross3 Jun 2024 22:25 UTC

48 points

2 comments16 min readLW link

ACI#8: Value as a Function of Possible Worlds

Akira Pyinya3 Jun 2024 21:49 UTC

6 points

2 comments7 min readLW link

in defense of Linus Pauling

bhauth3 Jun 2024 21:27 UTC

49 points

8 comments2 min readLW link

(www.bhauth.com)

Finding the estimate of the value of a state in RL agents

Clément Dumas, Walter Laurito , KlaRo and Kaarel

3 Jun 2024 20:26 UTC

7 points

4 comments4 min readLW link

Searching Magic Cards

jefftk3 Jun 2024 17:40 UTC

9 points

2 comments1 min readLW link

(www.jefftk.com)

The Standard Analogy

Zack_M_Davis3 Jun 2024 17:15 UTC

118 points

28 comments12 min readLW link

[Question] How was Less Online for you?

Gordon Seidoh Worley3 Jun 2024 17:10 UTC

22 points

4 comments1 min readLW link

AI catastrophes and rogue deployments

Buck3 Jun 2024 17:04 UTC

119 points

16 comments8 min readLW link

Companies’ safety plans neglect risks from scheming AI

Zach Stein-Perlman3 Jun 2024 15:00 UTC

73 points

4 comments6 min readLW link

ACX Meetup

svfritz3 Jun 2024 13:02 UTC

1 point

0 comments1 min readLW link

Comments on Anthropic’s Scaling Monosemanticity

Robert_AIZI3 Jun 2024 12:15 UTC

97 points

8 comments7 min readLW link

Politics is the mind-killer, but maybe we should talk about it anyway

Chris_Leong3 Jun 2024 6:37 UTC

14 points

33 comments3 min readLW link

[Question] How do you shut down an escaped model?

quetzal_rainbow2 Jun 2024 19:51 UTC

15 points

8 comments1 min readLW link

How to Better Report Sparse Autoencoder Performance

J Bostock2 Jun 2024 19:34 UTC

20 points

4 comments3 min readLW link

[Question] List of arguments for Bayesianism

Aryeh Englander2 Jun 2024 19:06 UTC

9 points

3 comments1 min readLW link

Origins of the Lab Mouse

Niko_McCarty2 Jun 2024 15:40 UTC

16 points

0 comments20 min readLW link

(press.asimov.com)

Why write down the basics of logic if they are so evident?

Crazy philosopher2 Jun 2024 12:02 UTC

3 points

9 comments1 min readLW link

How it All Went Down: The Puzzle Hunt that took us way, way Less Online

A*2 Jun 2024 8:01 UTC

134 points

5 comments5 min readLW link

Simulations and Altruism

FateGrinder2 Jun 2024 2:45 UTC

−7 points

2 comments25 min readLW link

Scanning your Brain with 100,000,000,000 wires?

Johannes C. Mayer1 Jun 2024 18:37 UTC

6 points

6 comments2 min readLW link

[Question] Turning latexed notes into blog posts

notfnofn1 Jun 2024 18:03 UTC

5 points

2 comments1 min readLW link

How do you know you are right when debating? Calculate your AmIRight score.

MrThink1 Jun 2024 15:55 UTC

2 points

5 comments2 min readLW link

Links for May

Kaj_Sotala1 Jun 2024 10:20 UTC

20 points

16 comments18 min readLW link

(kajsotala.fi)

[Question] What do coherence arguments actually prove about agentic behavior?

sunwillrise1 Jun 2024 9:37 UTC

123 points

35 comments6 min readLW link

AI Safety: A Climb To Armageddon?

kmenou1 Jun 2024 6:02 UTC

8 points

3 comments1 min readLW link

(arxiv.org)

When does external behaviour imply interal structure?

Tyler Tracy31 May 2024 16:41 UTC

6 points

5 comments7 min readLW link

[Question] We might be dropping the ball on Autonomous Replication and Adaptation.

Charbel-Raphaël and Épiphanie Gédéon

31 May 2024 13:49 UTC

61 points

30 comments4 min readLW link

Tax Cuts and Innovation

Maxwell Tabarrok31 May 2024 12:58 UTC

3 points

0 comments6 min readLW link

(www.maximum-progress.com)

The Gemini 1.5 Report

Zvi31 May 2024 12:20 UTC

18 points

0 comments17 min readLW link

(thezvi.wordpress.com)

Less Anti-Dakka

Mateusz Bagiński31 May 2024 9:07 UTC

23 points

5 comments3 min readLW link

Web-surfing tips for strange times

eukaryote31 May 2024 7:10 UTC

48 points

19 comments9 min readLW link

(eukaryotewritesblog.substack.com)

There Should Be More Alignment-Driven Startups

Vaniver, Judd Rosenblatt, Cameron Berg and phgubbins

31 May 2024 2:05 UTC

60 points

14 comments11 min readLW link

[Question] How likely is it that AI will torture us until the end of time?

Damilo31 May 2024 1:26 UTC

4 points

24 comments2 min readLW link

Twin Peaks: under the air

KatjaGrace31 May 2024 1:20 UTC

25 points

2 comments2 min readLW link

(worldspiritsockpuppet.com)

Is suffering like shit?

KatjaGrace31 May 2024 1:20 UTC

32 points

5 comments1 min readLW link

(worldspiritsockpuppet.com)

Foresight Vision Weekend Europe 2024

Allison Duettmann31 May 2024 0:07 UTC

3 points

0 comments1 min readLW link

[Question] How have analogous Industries solved Interested > Trained > Employed bottlenecks?

yanni kyriacos30 May 2024 23:59 UTC

4 points

1 comment1 min readLW link

Duckbill Masks Better?

jefftk30 May 2024 23:40 UTC

20 points

3 comments1 min readLW link

(www.jefftk.com)

OpenAI: Helen Toner Speaks

Zvi30 May 2024 21:10 UTC

86 points

8 comments13 min readLW link

(thezvi.wordpress.com)

Non-Disparagement Canaries for OpenAI

aysja and Adam Scholl

30 May 2024 19:20 UTC

287 points

51 comments2 min readLW link

Clarifying METR’s Auditing Role

Beth Barnes30 May 2024 18:41 UTC

108 points

1 comment2 min readLW link

A civilization ran by amateurs

Olli Järviniemi30 May 2024 17:57 UTC

61 points

7 comments6 min readLW link

One week left to apply for the Roots of Progress Blog-Building Intensive

jasoncrawford30 May 2024 16:55 UTC

8 points

0 comments3 min readLW link

(rootsofprogress.org)

Getting started with AI Alignment research: how to reproduce an experiment from research paper

Alexander23030 May 2024 14:51 UTC

3 points

0 comments3 min readLW link

AI #66: Oh to Be Less Online

Zvi30 May 2024 14:20 UTC

37 points

6 comments56 min readLW link

(thezvi.wordpress.com)

The 27 papers

WitheringWeights30 May 2024 8:46 UTC

18 points

2 comments1 min readLW link

Help me to become “less wrong”

milanrosko30 May 2024 8:29 UTC

10 points

7 comments2 min readLW link

The Market Singularity: A New Perspective

azsantosk30 May 2024 7:05 UTC

1 point

0 comments15 min readLW link

Awakening

lsusr30 May 2024 7:03 UTC

119 points

79 comments9 min readLW link

Value Claims (In Particular) Are Usually Bullshit

johnswentworth30 May 2024 6:26 UTC

143 points

18 comments2 min readLW link