All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

When does external behaviour imply interal structure?

Tyler Tracy31 May 2024 16:41 UTC

6 points

5 comments7 min readLW link

[Question] We might be dropping the ball on Autonomous Replication and Adaptation.

Charbel-Raphaël and Épiphanie Gédéon

31 May 2024 13:49 UTC

61 points

30 comments4 min readLW link

Tax Cuts and Innovation

Maxwell Tabarrok31 May 2024 12:58 UTC

3 points

0 comments6 min readLW link

(www.maximum-progress.com)

The Gemini 1.5 Report

Zvi31 May 2024 12:20 UTC

18 points

0 comments17 min readLW link

(thezvi.wordpress.com)

Less Anti-Dakka

Mateusz Bagiński31 May 2024 9:07 UTC

23 points

5 comments3 min readLW link

Web-surfing tips for strange times

eukaryote31 May 2024 7:10 UTC

48 points

19 comments9 min readLW link

(eukaryotewritesblog.substack.com)

There Should Be More Alignment-Driven Startups

Vaniver, Judd Rosenblatt, Cameron Berg and phgubbins

31 May 2024 2:05 UTC

60 points

14 comments11 min readLW link

[Question] How likely is it that AI will torture us until the end of time?

Damilo31 May 2024 1:26 UTC

4 points

24 comments2 min readLW link

Twin Peaks: under the air

KatjaGrace31 May 2024 1:20 UTC

25 points

2 comments2 min readLW link

(worldspiritsockpuppet.com)

Is suffering like shit?

KatjaGrace31 May 2024 1:20 UTC

32 points

5 comments1 min readLW link

(worldspiritsockpuppet.com)

Foresight Vision Weekend Europe 2024

Allison Duettmann31 May 2024 0:07 UTC

3 points

0 comments1 min readLW link

[Question] How have analogous Industries solved Interested > Trained > Employed bottlenecks?

yanni kyriacos30 May 2024 23:59 UTC

4 points

1 comment1 min readLW link

Duckbill Masks Better?

jefftk30 May 2024 23:40 UTC

20 points

3 comments1 min readLW link

(www.jefftk.com)

OpenAI: Helen Toner Speaks

Zvi30 May 2024 21:10 UTC

86 points

8 comments13 min readLW link

(thezvi.wordpress.com)

Non-Disparagement Canaries for OpenAI

aysja and Adam Scholl

30 May 2024 19:20 UTC

287 points

51 comments2 min readLW link

Clarifying METR’s Auditing Role

Beth Barnes30 May 2024 18:41 UTC

108 points

1 comment2 min readLW link

A civilization ran by amateurs

Olli Järviniemi30 May 2024 17:57 UTC

61 points

7 comments6 min readLW link

One week left to apply for the Roots of Progress Blog-Building Intensive

jasoncrawford30 May 2024 16:55 UTC

8 points

0 comments3 min readLW link

(rootsofprogress.org)

Getting started with AI Alignment research: how to reproduce an experiment from research paper

Alexander23030 May 2024 14:51 UTC

3 points

0 comments3 min readLW link

AI #66: Oh to Be Less Online

Zvi30 May 2024 14:20 UTC

37 points

6 comments56 min readLW link

(thezvi.wordpress.com)

The 27 papers

WitheringWeights30 May 2024 8:46 UTC

18 points

2 comments1 min readLW link

Help me to become “less wrong”

milanrosko30 May 2024 8:29 UTC

10 points

7 comments2 min readLW link

The Market Singularity: A New Perspective

azsantosk30 May 2024 7:05 UTC

1 point

0 comments15 min readLW link

Awakening

lsusr30 May 2024 7:03 UTC

119 points

79 comments9 min readLW link

Value Claims (In Particular) Are Usually Bullshit

johnswentworth30 May 2024 6:26 UTC

143 points

18 comments2 min readLW link

The Pearly Gates

lsusr30 May 2024 4:01 UTC

111 points

6 comments3 min readLW link

AXRP Episode 32 - Understanding Agency with Jan Kulveit

DanielFilan30 May 2024 3:50 UTC

20 points

0 comments53 min readLW link

US Presidential Election: Tractability, Importance, and Urgency

kuhanj29 May 2024 23:52 UTC

42 points

2 comments3 min readLW link

San Francisco ACX Meetup “First Saturday”

Nate Sternberg29 May 2024 23:42 UTC

2 points

1 comment1 min readLW link

Thoughts on SB-1047

ryan_greenblatt29 May 2024 23:26 UTC

59 points

1 comment11 min readLW link

How I designed my own writing system, VJScript

vkethana29 May 2024 23:18 UTC

2 points

1 comment1 min readLW link

(www.vkethana.com)

AI and integrity

Nathan Young29 May 2024 20:45 UTC

10 points

0 comments2 min readLW link

(nathanpmyoung.substack.com)

MIRI 2024 Communications Strategy

Gretta Duleba29 May 2024 19:33 UTC

320 points

202 comments7 min readLW link

2024 Summer AI Safety Intro Fellowship and Socials in Boston

KevinWei29 May 2024 18:27 UTC

8 points

0 comments1 min readLW link

Apollo Research 1-year update

Marius Hobbhahn, Lee Sharkey, Lucius Bushnaq, Dan Braun, Mikita Balesni, Jérémy Scheurer, Nicholas Goldowsky-Dill, StefanHex, jake_mendel, AlexMeinke and rusheb

29 May 2024 17:44 UTC

93 points

0 comments7 min readLW link

Response to nostalgebraist: proudly waving my moral-antirealist battle flag

Steven Byrnes29 May 2024 16:48 UTC

102 points

29 comments11 min readLW link

Looking beyond Everett in multiversal views of LLMs

kromem29 May 2024 12:35 UTC

10 points

0 comments8 min readLW link

[Question] Inviting discussion of “Beat AI: A contest using philosophical concepts”

David James29 May 2024 11:55 UTC

2 points

1 comment1 min readLW link

AI companies’ commitments

Zach Stein-Perlman29 May 2024 11:00 UTC

36 points

0 comments1 min readLW link

One way violinists fail

Solenoid_Entity29 May 2024 4:08 UTC

33 points

5 comments3 min readLW link

Hardshipification

Jonathan Moregård28 May 2024 20:02 UTC

84 points

17 comments2 min readLW link

(honestliving.substack.com)

When Are Circular Definitions A Problem?

johnswentworth28 May 2024 20:00 UTC

68 points

15 comments3 min readLW link

Notes on Gracefulness

David Gross28 May 2024 18:40 UTC

19 points

2 comments25 min readLW link

[Question] What’s a better term now that “AGI” is too vague?

Seth Herd28 May 2024 18:02 UTC

15 points

9 comments2 min readLW link

Reward hacking behavior can generalize across tasks

Kei, Isaac Dunn, Henry Sleight, Miles Turpin, evhub, Carson Denison and Ethan Perez

28 May 2024 16:33 UTC

78 points

5 comments21 min readLW link

Quick Advice on Writing Essays

Niko_McCarty28 May 2024 15:02 UTC

10 points

0 comments3 min readLW link

(www.nikomccarty.com)

[Linkpost] The Expressive Capacity of State Space Models: A Formal Language Perspective

Bogdan Ionut Cirstea28 May 2024 13:49 UTC

4 points

3 comments1 min readLW link

(arxiv.org)

OpenAI: Fallout

Zvi28 May 2024 13:20 UTC

204 points

25 comments36 min readLW link

(thezvi.wordpress.com)

2024 State of the AI Regulatory Landscape

Deric Cheng and Elliot Mckernon

28 May 2024 11:59 UTC

30 points

0 comments2 min readLW link

(www.convergenceanalysis.org)

Finding Backward Chaining Circuits in Transformers Trained on Tree Search

abhayesian, Jannik Brinkmann and Victor Levoso

28 May 2024 5:29 UTC

50 points

1 comment9 min readLW link

(arxiv.org)