Lifel­og­ging for Align­ment & Immortality

Dev.Errata17 Aug 2024 23:42 UTC
13 points
3 comments7 min readLW link

Play­ing Minecraft with a Superintelligence

Johannes C. Mayer17 Aug 2024 22:47 UTC
3 points
0 comments2 min readLW link

Does “Ul­ti­mate Neart­er­mism” via Eter­nal In­fla­tion dom­i­nate Longter­mism in ex­pec­ta­tion?

Jordan Arel17 Aug 2024 22:28 UTC
6 points
1 comment3 min readLW link

The causal back­bone conjecture

tailcalled17 Aug 2024 18:50 UTC
26 points
0 comments2 min readLW link

I didn’t have to avoid you; I was just insecure

Chipmonk17 Aug 2024 16:41 UTC
37 points
7 comments2 min readLW link
(chrislakin.blog)

Please sup­port this blog (with money)

Elizabeth17 Aug 2024 15:30 UTC
112 points
3 comments6 min readLW link
(acesounderglass.com)

Re­lease: Op­ti­mal Weave (P1): A Pro­to­type Co­hab­itive Game

mako yass17 Aug 2024 14:08 UTC
82 points
21 comments7 min readLW link

New blog: Ex­pe­di­tion to the Far Lands

Connor Leahy17 Aug 2024 11:07 UTC
28 points
3 comments1 min readLW link
(www.ettf.land)

Ra­tion­al­ists are miss­ing a core piece for agent-like struc­ture (en­ergy vs in­for­ma­tion over­load)

tailcalled17 Aug 2024 9:57 UTC
59 points
9 comments4 min readLW link

Cal­en­dar fea­ture ge­om­e­try in GPT-2 layer 8 resi­d­ual stream SAEs

17 Aug 2024 1:16 UTC
53 points
0 comments5 min readLW link

[un­listed] Benefi­cial ap­pli­ca­tions for cur­rent-level AI in hu­man in­for­ma­tion sys­tems? More likely than you’d think!

mako yass16 Aug 2024 20:49 UTC
11 points
0 comments1 min readLW link

[Question] How un­usual is the fact that there is no AI monopoly?

Viliam16 Aug 2024 20:21 UTC
32 points
15 comments1 min readLW link

The Tech In­dus­try is the Biggest Blocker to Mean­ingful AI Safety Regulations

garrison16 Aug 2024 19:37 UTC
22 points
1 comment1 min readLW link
(garrisonlovely.substack.com)

Prin­ci­pled Satis­fic­ing To Avoid Goodhart

JenniferRM16 Aug 2024 19:05 UTC
45 points
2 comments8 min readLW link

Rewil­d­ing the Gut VS the Au­toim­mune Epidemic

GGD16 Aug 2024 18:00 UTC
51 points
0 comments3 min readLW link

The Bar for Con­tribut­ing to AI Safety is Lower than You Think

Chris_Leong16 Aug 2024 15:20 UTC
20 points
1 comment2 min readLW link

In­ves­ti­gat­ing the Chart of the Cen­tury: Why is food so ex­pen­sive?

Maxwell Tabarrok16 Aug 2024 13:21 UTC
122 points
26 comments3 min readLW link
(www.maximum-progress.com)

Mu­sic in the AI World

Martin Sustrik16 Aug 2024 4:20 UTC
31 points
8 comments1 min readLW link
(250bpm.substack.com)

AGI’s Op­pos­ing Force

SimonBaars16 Aug 2024 4:18 UTC
9 points
2 comments1 min readLW link

[Question] Money Pump Ar­gu­ments as­sume Me­moryless Agents. Isn’t this Un­re­al­is­tic?

Dalcy16 Aug 2024 4:16 UTC
23 points
6 comments1 min readLW link

Recommended

blake808616 Aug 2024 1:49 UTC
2 points
0 comments2 min readLW link
(blakehouseholder.substack.com)

Demis Hass­abis — Google Deep­Mind: The Podcast

Zach Stein-Perlman16 Aug 2024 0:00 UTC
55 points
8 comments3 min readLW link
(www.youtube.com)

Danger, AI Scien­tist, Danger

Zvi15 Aug 2024 22:40 UTC
107 points
9 comments7 min readLW link
(thezvi.wordpress.com)

[Linkpost] ‘The AI Scien­tist: Towards Fully Au­to­mated Open-Ended Scien­tific Dis­cov­ery’

Bogdan Ionut Cirstea15 Aug 2024 21:32 UTC
20 points
1 comment1 min readLW link
(arxiv.org)

On power and its amplification

Ted Sanders15 Aug 2024 20:13 UTC
−1 points
0 comments1 min readLW link

The Over­looked Ne­ces­sity of Com­plete Se­man­tic Rep­re­sen­ta­tion in AI Safety and Alignment

williamsae15 Aug 2024 19:42 UTC
−1 points
0 comments3 min readLW link

My ar­ti­cle in The Na­tion — Cal­ifor­nia’s AI Safety Bill Is a Mask-Off Mo­ment for the Industry

garrison15 Aug 2024 19:25 UTC
35 points
0 comments1 min readLW link
(www.thenation.com)

Cri­tique of ‘Many Peo­ple Fear A.I. They Shouldn’t’ by David Brooks.

Axel Ahlqvist15 Aug 2024 18:38 UTC
12 points
8 comments3 min readLW link

Pri­mary Per­cep­tive Systems

ChristianKl15 Aug 2024 11:26 UTC
14 points
2 comments3 min readLW link

Se­quence overview: Welfare and moral weights

MichaelStJules15 Aug 2024 4:22 UTC
7 points
0 comments1 min readLW link

Fund­ing for pro­grams and events on global catas­trophic risk, effec­tive al­tru­ism, and other topics

14 Aug 2024 23:59 UTC
9 points
0 comments2 min readLW link

Fund­ing for work that builds ca­pac­ity to ad­dress risks from trans­for­ma­tive AI

14 Aug 2024 23:52 UTC
16 points
0 comments5 min readLW link

GPT-2 Some­times Fails at IOI

Ronak_Mehta14 Aug 2024 23:24 UTC
13 points
0 comments2 min readLW link
(ronakrm.github.io)

Toward a Hu­man Hy­brid Lan­guage for En­hanced Hu­man-Ma­chine Com­mu­ni­ca­tion: Ad­dress­ing the AI Align­ment Problem

Andndn Dheudnd14 Aug 2024 22:19 UTC
−6 points
2 comments4 min readLW link

Ad­verse Selec­tion by Life-Sav­ing Char­i­ties

vaishnav9214 Aug 2024 20:46 UTC
41 points
16 comments5 min readLW link
(www.everythingisatrolley.com)

The great Enigma in the sky: The uni­verse as an en­cryp­tion machine

Alex_Shleizer14 Aug 2024 13:21 UTC
4 points
1 comment8 min readLW link

An anti-in­duc­tive sequence

Viliam14 Aug 2024 12:28 UTC
36 points
10 comments3 min readLW link

Rabin’s Paradox

Charlie Steiner14 Aug 2024 5:40 UTC
18 points
40 comments3 min readLW link

An­nounc­ing the $200k EA Com­mu­nity Choice

Austin Chen14 Aug 2024 0:39 UTC
58 points
8 comments1 min readLW link
(manifund.substack.com)

De­bate: Is it eth­i­cal to work at AI ca­pa­bil­ities com­pa­nies?

14 Aug 2024 0:18 UTC
36 points
21 comments11 min readLW link

Fields that I refer­ence when think­ing about AI takeover prevention

Buck13 Aug 2024 23:08 UTC
143 points
16 comments10 min readLW link
(redwoodresearch.substack.com)

Ten counter-ar­gu­ments that AI is (not) an ex­is­ten­tial risk (for now)

Ariel Kwiatkowski13 Aug 2024 22:35 UTC
19 points
5 comments8 min readLW link

Align­ment from equivariance

hamishtodd113 Aug 2024 21:09 UTC
3 points
1 comment5 min readLW link

[LDSL#6] When is quan­tifi­ca­tion needed, and when is it hard?

tailcalled13 Aug 2024 20:39 UTC
31 points
0 comments2 min readLW link

A com­pu­ta­tional com­plex­ity ar­gu­ment for many worlds

jessicata13 Aug 2024 19:35 UTC
32 points
15 comments5 min readLW link
(unstableontology.com)

The Con­scious­ness Co­nun­drum: Why We Can’t Dis­miss Ma­chine Sentience

SystematicApproach13 Aug 2024 18:01 UTC
−21 points
1 comment3 min readLW link

Ten ar­gu­ments that AI is an ex­is­ten­tial risk

13 Aug 2024 17:00 UTC
110 points
41 comments7 min readLW link
(blog.aiimpacts.org)

Eu­gen­ics And Re­pro­duc­tion Li­censes FAQs: For the Com­mon Good

Zero Contradictions13 Aug 2024 16:34 UTC
−8 points
14 comments4 min readLW link
(zerocontradictions.net)

Su­per­in­tel­li­gent AI is pos­si­ble in the 2020s

HunterJay13 Aug 2024 6:03 UTC
41 points
3 comments12 min readLW link

De­bate: Get a col­lege de­gree?

12 Aug 2024 22:23 UTC
42 points
14 comments21 min readLW link