Catas­trophic Risks from AI #6: Dis­cus­sion and FAQ

27 Jun 2023 23:23 UTC
24 points
1 comment13 min readLW link
(arxiv.org)

Catas­trophic Risks from AI #5: Rogue AIs

27 Jun 2023 22:06 UTC
15 points
0 comments22 min readLW link
(arxiv.org)

AISN #12: Policy Pro­pos­als from NTIA’s Re­quest for Com­ment and Re­con­sid­er­ing In­stru­men­tal Convergence

Dan H27 Jun 2023 17:20 UTC
6 points
0 comments1 min readLW link

The Weight of the Fu­ture (Why The Apoca­lypse Can Be A Relief)

Sable27 Jun 2023 17:18 UTC
18 points
14 comments3 min readLW link
(affablyevil.substack.com)

Align­ing AI by op­ti­miz­ing for “wis­dom”

27 Jun 2023 15:20 UTC
27 points
8 comments12 min readLW link

Free­dom un­der Nat­u­ral­is­tic Dualism

Arturo Macias27 Jun 2023 14:34 UTC
1 point
36 comments1 min readLW link
(www.jneurophilosophy.com)

Munk AI de­bate: con­fu­sions and pos­si­ble cruxes

Steven Byrnes27 Jun 2023 14:18 UTC
244 points
21 comments8 min readLW link

Ate­liers: Motivation

Stephen Fowler27 Jun 2023 13:07 UTC
7 points
0 comments2 min readLW link

Self-Blinded Caf­feine RCT

niplav27 Jun 2023 12:38 UTC
44 points
9 comments8 min readLW link

An overview of the points system

Iknownothing27 Jun 2023 9:09 UTC
3 points
4 comments1 min readLW link
(ai-plans.com)

AISC team re­port: Soft-op­ti­miza­tion, Bayes and Goodhart

27 Jun 2023 6:05 UTC
37 points
2 comments15 min readLW link

Epistemic spot check­ing one claim in The Precipice

Isaac King27 Jun 2023 1:03 UTC
33 points
3 comments1 min readLW link

nu­clear costs are inflation

bhauth26 Jun 2023 22:30 UTC
8 points
42 comments5 min readLW link
(www.bhauth.com)

Man in the Arena

Richard_Ngo26 Jun 2023 21:57 UTC
62 points
6 comments8 min readLW link

Catas­trophic Risks from AI #4: Or­ga­ni­za­tional Risks

26 Jun 2023 19:36 UTC
23 points
0 comments21 min readLW link
(arxiv.org)

The fraught voy­age of al­igned novelty

TsviBT26 Jun 2023 19:10 UTC
13 points
0 comments17 min readLW link

[Question] De­cep­tive AI vs. shift­ing in­stru­men­tal incentives

Aryeh Englander26 Jun 2023 18:09 UTC
7 points
2 comments3 min readLW link

On the Cost of Thriv­ing Index

Zvi26 Jun 2023 15:30 UTC
33 points
6 comments9 min readLW link
(thezvi.wordpress.com)

“Safety Cul­ture for AI” is im­por­tant, but isn’t go­ing to be easy

Davidmanheim26 Jun 2023 12:52 UTC
47 points
2 comments2 min readLW link
(forum.effectivealtruism.org)

Direct Prefer­ence Op­ti­miza­tion in One Minute

lukemarks26 Jun 2023 11:52 UTC
22 points
3 comments2 min readLW link

Self-ex­per­i­ment: A sup­ra­phys­iolog­i­cal dosage of testos­terone.

shapeshifter26 Jun 2023 10:26 UTC
8 points
3 comments1 min readLW link

Con­fused Attractiveness

Vlad Loweren26 Jun 2023 9:33 UTC
8 points
5 comments6 min readLW link

60+ Pos­si­ble Futures

Bart Bussmann26 Jun 2023 9:16 UTC
93 points
18 comments11 min readLW link

Bounded sur­prise exam paradox

cousin_it26 Jun 2023 8:37 UTC
29 points
5 comments2 min readLW link

Model, Care, Execution

26 Jun 2023 4:05 UTC
111 points
10 comments12 min readLW link1 review
(bayesshammai.substack.com)

The Fall of Ra­tion­al­ity—The Se­nate of Admins

Ace Delgado26 Jun 2023 1:49 UTC
−10 points
0 comments4 min readLW link

Another med­i­cal miracle

Dentin25 Jun 2023 20:43 UTC
191 points
48 comments3 min readLW link

Did Ben­gio and Teg­mark lose a de­bate about AI x-risk against LeCun and Mitchell?

Karl von Wendt25 Jun 2023 16:59 UTC
106 points
53 comments7 min readLW link

AI-Plans.com—a con­tributable compendium

Iknownothing25 Jun 2023 14:40 UTC
39 points
7 comments4 min readLW link
(ai-plans.com)

Map of maps of in­ter­est­ing fields

MaxG25 Jun 2023 14:02 UTC
24 points
0 comments1 min readLW link
(glozematrix.substack.com)

Why am I Me?

dadadarren25 Jun 2023 12:07 UTC
45 points
46 comments3 min readLW link

Will the grow­ing deer prion epi­demic spread to hu­mans? Why not?

eukaryote25 Jun 2023 4:31 UTC
170 points
33 comments13 min readLW link
(eukaryotewritesblog.com)

Crys­tal Heal­ing — or the Ori­gins of Ex­pected Utility Maximizers

25 Jun 2023 3:18 UTC
54 points
11 comments5 min readLW link

What’s in it for AI?

archeon25 Jun 2023 1:17 UTC
−20 points
0 comments1 min readLW link

Les­sons Learned: Prop­erly Publi­ciz­ing a Re­gional Meetup Event (also, last call to ap­ply!)

Willa25 Jun 2023 0:58 UTC
9 points
2 comments4 min readLW link

San Fran­cisco ACX Meetup “First Satur­day” July 1, 1 pm

guenael24 Jun 2023 22:40 UTC
2 points
0 comments1 min readLW link

Cor­rectly Cal­ibrated Trust

habryka24 Jun 2023 19:48 UTC
36 points
3 comments11 min readLW link
(forum.effectivealtruism.org)

Demo­cratic AI Con­sti­tu­tion: Round-Robin De­bate and Synthesis

scottviteri24 Jun 2023 19:31 UTC
10 points
4 comments5 min readLW link
(scottviteri.com)

DSLT 4. Phase Tran­si­tions in Neu­ral Networks

Liam Carroll24 Jun 2023 17:22 UTC
30 points
3 comments16 min readLW link

[Question] Donate Now vs Donate Later—Rel­a­tive Value of Dona­tions to AI Alignment

AlignmentOptimizer24 Jun 2023 17:20 UTC
4 points
4 comments1 min readLW link

ACX/​EA Meetup Bremen

RasmusHB24 Jun 2023 16:23 UTC
3 points
0 comments1 min readLW link

How to pre­vent Re-Trauma­ti­za­tion on Med­i­ta­tion Retreats

EternallyBlissful24 Jun 2023 14:16 UTC
20 points
1 comment5 min readLW link

[Question] Can you pre­vent nega­tive long-term effects of bad trips with sleep de­pri­va­tion?

EternallyBlissful24 Jun 2023 14:05 UTC
15 points
5 comments1 min readLW link

We ran a read­ing group on The Scout Mindset

24 Jun 2023 10:10 UTC
7 points
0 comments2 min readLW link

Cri­sis Boot Camp: les­sons learned and im­pli­ca­tions for EA

Nicole Ross24 Jun 2023 6:28 UTC
26 points
0 comments1 min readLW link

I just watched don’t look up.

ATheCoder23 Jun 2023 21:22 UTC
0 points
5 comments2 min readLW link

Au­to­matic Rate Limit­ing on LessWrong

Raemon23 Jun 2023 20:19 UTC
77 points
34 comments7 min readLW link

Catas­trophic Risks from AI #3: AI Race

23 Jun 2023 19:21 UTC
18 points
9 comments29 min readLW link
(arxiv.org)

Write the Worst Post on LessWrong!

Johannes C. Mayer23 Jun 2023 19:17 UTC
−10 points
5 comments4 min readLW link

Slay­ing the Hy­dra: to­ward a new game board for AI

Prometheus23 Jun 2023 17:04 UTC
0 points
5 comments6 min readLW link