Philoso­phers wrestling with evil, as a so­cial me­dia feed

David Gross3 Jun 2024 22:25 UTC
48 points
2 comments16 min readLW link

ACI#8: Value as a Func­tion of Pos­si­ble Worlds

Akira Pyinya3 Jun 2024 21:49 UTC
6 points
2 comments7 min readLW link

in defense of Linus Pauling

bhauth3 Jun 2024 21:27 UTC
49 points
8 comments2 min readLW link
(www.bhauth.com)

Find­ing the es­ti­mate of the value of a state in RL agents

3 Jun 2024 20:26 UTC
7 points
4 comments4 min readLW link

Search­ing Magic Cards

jefftk3 Jun 2024 17:40 UTC
9 points
2 comments1 min readLW link
(www.jefftk.com)

The Stan­dard Analogy

Zack_M_Davis3 Jun 2024 17:15 UTC
118 points
28 comments12 min readLW link

[Question] How was Less On­line for you?

Gordon Seidoh Worley3 Jun 2024 17:10 UTC
22 points
4 comments1 min readLW link

AI catas­tro­phes and rogue deployments

Buck3 Jun 2024 17:04 UTC
119 points
16 comments8 min readLW link

Com­pa­nies’ safety plans ne­glect risks from schem­ing AI

Zach Stein-Perlman3 Jun 2024 15:00 UTC
73 points
4 comments6 min readLW link

ACX Meetup

svfritz3 Jun 2024 13:02 UTC
1 point
0 comments1 min readLW link

Com­ments on An­thropic’s Scal­ing Monosemanticity

Robert_AIZI3 Jun 2024 12:15 UTC
97 points
8 comments7 min readLW link

Poli­tics is the mind-kil­ler, but maybe we should talk about it anyway

Chris_Leong3 Jun 2024 6:37 UTC
14 points
33 comments3 min readLW link

[Question] How do you shut down an es­caped model?

quetzal_rainbow2 Jun 2024 19:51 UTC
15 points
8 comments1 min readLW link

How to Bet­ter Re­port Sparse Au­toen­coder Performance

J Bostock2 Jun 2024 19:34 UTC
20 points
4 comments3 min readLW link

[Question] List of ar­gu­ments for Bayesianism

Aryeh Englander2 Jun 2024 19:06 UTC
9 points
3 comments1 min readLW link

Ori­gins of the Lab Mouse

Niko_McCarty2 Jun 2024 15:40 UTC
16 points
0 comments20 min readLW link
(press.asimov.com)

Why write down the ba­sics of logic if they are so ev­i­dent?

Crazy philosopher2 Jun 2024 12:02 UTC
3 points
9 comments1 min readLW link

How it All Went Down: The Puz­zle Hunt that took us way, way Less Online

A*2 Jun 2024 8:01 UTC
134 points
5 comments5 min readLW link

Si­mu­la­tions and Altruism

FateGrinder2 Jun 2024 2:45 UTC
−7 points
2 comments25 min readLW link

Scan­ning your Brain with 100,000,000,000 wires?

Johannes C. Mayer1 Jun 2024 18:37 UTC
6 points
6 comments2 min readLW link

[Question] Turn­ing la­texed notes into blog posts

notfnofn1 Jun 2024 18:03 UTC
5 points
2 comments1 min readLW link

How do you know you are right when de­bat­ing? Calcu­late your AmIRight score.

MrThink1 Jun 2024 15:55 UTC
2 points
5 comments2 min readLW link

Links for May

Kaj_Sotala1 Jun 2024 10:20 UTC
20 points
16 comments18 min readLW link
(kajsotala.fi)

[Question] What do co­her­ence ar­gu­ments ac­tu­ally prove about agen­tic be­hav­ior?

sunwillrise1 Jun 2024 9:37 UTC
123 points
35 comments6 min readLW link

AI Safety: A Climb To Ar­maged­don?

kmenou1 Jun 2024 6:02 UTC
8 points
3 comments1 min readLW link
(arxiv.org)

When does ex­ter­nal be­havi­our im­ply in­teral struc­ture?

Tyler Tracy31 May 2024 16:41 UTC
6 points
5 comments7 min readLW link

[Question] We might be drop­ping the ball on Au­tonomous Repli­ca­tion and Adap­ta­tion.

31 May 2024 13:49 UTC
61 points
30 comments4 min readLW link

Tax Cuts and Innovation

Maxwell Tabarrok31 May 2024 12:58 UTC
3 points
0 comments6 min readLW link
(www.maximum-progress.com)

The Gem­ini 1.5 Report

Zvi31 May 2024 12:20 UTC
18 points
0 comments17 min readLW link
(thezvi.wordpress.com)

Less Anti-Dakka

Mateusz Bagiński31 May 2024 9:07 UTC
23 points
5 comments3 min readLW link

Web-sur­fing tips for strange times

eukaryote31 May 2024 7:10 UTC
48 points
19 comments9 min readLW link
(eukaryotewritesblog.substack.com)

There Should Be More Align­ment-Driven Startups

31 May 2024 2:05 UTC
60 points
14 comments11 min readLW link

[Question] How likely is it that AI will tor­ture us un­til the end of time?

Damilo31 May 2024 1:26 UTC
4 points
24 comments2 min readLW link

Twin Peaks: un­der the air

KatjaGrace31 May 2024 1:20 UTC
25 points
2 comments2 min readLW link
(worldspiritsockpuppet.com)

Is suffer­ing like shit?

KatjaGrace31 May 2024 1:20 UTC
32 points
5 comments1 min readLW link
(worldspiritsockpuppet.com)

Fore­sight Vi­sion Week­end Europe 2024

Allison Duettmann31 May 2024 0:07 UTC
3 points
0 comments1 min readLW link

[Question] How have analo­gous In­dus­tries solved In­ter­ested > Trained > Em­ployed bot­tle­necks?

yanni kyriacos30 May 2024 23:59 UTC
4 points
1 comment1 min readLW link

Duck­bill Masks Bet­ter?

jefftk30 May 2024 23:40 UTC
20 points
3 comments1 min readLW link
(www.jefftk.com)

OpenAI: He­len Toner Speaks

Zvi30 May 2024 21:10 UTC
86 points
8 comments13 min readLW link
(thezvi.wordpress.com)

Non-Dis­par­age­ment Ca­naries for OpenAI

30 May 2024 19:20 UTC
287 points
51 comments2 min readLW link

Clar­ify­ing METR’s Au­dit­ing Role

Beth Barnes30 May 2024 18:41 UTC
108 points
1 comment2 min readLW link

A civ­i­liza­tion ran by amateurs

Olli Järviniemi30 May 2024 17:57 UTC
61 points
7 comments6 min readLW link

One week left to ap­ply for the Roots of Progress Blog-Build­ing Intensive

jasoncrawford30 May 2024 16:55 UTC
8 points
0 comments3 min readLW link
(rootsofprogress.org)

Get­ting started with AI Align­ment re­search: how to re­pro­duce an ex­per­i­ment from re­search paper

Alexander23030 May 2024 14:51 UTC
3 points
0 comments3 min readLW link

AI #66: Oh to Be Less Online

Zvi30 May 2024 14:20 UTC
37 points
6 comments56 min readLW link
(thezvi.wordpress.com)

The 27 papers

WitheringWeights30 May 2024 8:46 UTC
18 points
2 comments1 min readLW link

Help me to be­come “less wrong”

milanrosko30 May 2024 8:29 UTC
10 points
7 comments2 min readLW link

The Mar­ket Sin­gu­lar­ity: A New Perspective

azsantosk30 May 2024 7:05 UTC
1 point
0 comments15 min readLW link

Awakening

lsusr30 May 2024 7:03 UTC
119 points
79 comments9 min readLW link

Value Claims (In Par­tic­u­lar) Are Usu­ally Bullshit

johnswentworth30 May 2024 6:26 UTC
143 points
18 comments2 min readLW link