Against Ac­tive Shooter Drills

ZviJun 16, 2022, 1:40 PM
91 points
30 comments7 min readLW link
(thezvi.wordpress.com)

An­nounc­ing the Align­ment of Com­plex Sys­tems Re­search Group

Jun 4, 2022, 4:10 AM
91 points
20 comments5 min readLW link

The “mind-body vi­cious cy­cle” model of RSI & back pain

Steven ByrnesJun 9, 2022, 12:30 PM
91 points
32 comments12 min readLW link

I’m try­ing out “as­ter­oid mind­set”

Alex_AltairJun 3, 2022, 1:35 PM
90 points
5 comments4 min readLW link

In defense of flailing, with fore­word by Bill Burr

lcJun 17, 2022, 4:40 PM
88 points
6 comments4 min readLW link

Causal con­fu­sion as an ar­gu­ment against the scal­ing hypothesis

Jun 20, 2022, 10:54 AM
86 points
30 comments15 min readLW link

I ap­plied for a MIRI job in 2020. Here’s what hap­pened next.

ViktoriaMalyasovaJun 15, 2022, 7:37 PM
86 points
17 comments7 min readLW link

Tran­script of a Twit­ter Dis­cus­sion on EA from June 2022

ZviJun 6, 2022, 1:50 PM
85 points
4 comments1 min readLW link
(thezvi.wordpress.com)

Air Con­di­tioner Test Re­sults & Discussion

johnswentworthJun 22, 2022, 10:26 PM
82 points
42 comments6 min readLW link

Air Con­di­tioner Repair

ZviJun 27, 2022, 12:40 PM
81 points
34 comments4 min readLW link
(thezvi.wordpress.com)

Rein­vent­ing the wheel

jasoncrawfordJun 4, 2022, 10:39 PM
78 points
13 comments2 min readLW link
(rootsofprogress.org)

AI Train­ing Should Allow Opt-Out

alyssavanceJun 23, 2022, 1:33 AM
76 points
13 comments6 min readLW link

A Quick List of Some Prob­lems in AI Align­ment As A Field

Nicholas / Heather KrossJun 21, 2022, 11:23 PM
75 points
12 comments6 min readLW link
(www.thinkingmuchbetter.com)

Worked Ex­am­ples of Shap­ley Values

lalaithionJun 24, 2022, 5:13 PM
75 points
11 comments8 min readLW link

Some re­flec­tions on the LW com­mu­nity af­ter sev­eral months of ac­tive engagement

M. Y. ZuoJun 25, 2022, 5:04 PM
72 points
40 comments4 min readLW link

Fea­ture re­quest: vot­ing but­tons at the bot­tom?

Oliver SourbutJun 24, 2022, 2:41 PM
70 points
12 comments1 min readLW link

Book Re­view: Talent

ZviJun 3, 2022, 8:10 PM
70 points
19 comments79 min readLW link
(thezvi.wordpress.com)

Re­sources I send to AI re­searchers about AI safety

Vael GatesJun 14, 2022, 2:24 AM
69 points
12 comments1 min readLW link

Elic­it­ing La­tent Knowl­edge (ELK) - Distil­la­tion/​Summary

Marius HobbhahnJun 8, 2022, 1:18 PM
69 points
2 comments21 min readLW link

How to pur­sue a ca­reer in tech­ni­cal AI alignment

Charlie Rogers-SmithJun 4, 2022, 9:11 PM
69 points
1 comment39 min readLW link

Episte­molog­i­cal Vigilance for Alignment

adamShimiJun 6, 2022, 12:27 AM
66 points
11 comments10 min readLW link

[Question] Has any­one ac­tu­ally tried to con­vince Terry Tao or other top math­e­mat­i­ci­ans to work on al­ign­ment?

P.Jun 8, 2022, 10:26 PM
64 points
51 comments4 min readLW link

Seven ways to be­come un­stop­pably agentic

Evie CottrellJun 26, 2022, 5:39 PM
64 points
16 comments8 min readLW link

Half-baked AI Safety ideas thread

Aryeh EnglanderJun 23, 2022, 4:11 PM
64 points
63 comments1 min readLW link

“Brain en­thu­si­asts” in AI Safety

Jun 18, 2022, 9:59 AM
63 points
5 comments10 min readLW link
(universalprior.substack.com)

Ten ex­per­i­ments in mod­u­lar­ity, which we’d like you to run!

Jun 16, 2022, 9:17 AM
62 points
3 comments9 min readLW link

[Question] What’s the con­tin­gency plan if we get AGI to­mor­row?

YitzJun 23, 2022, 3:10 AM
61 points
23 comments1 min readLW link

Open Prob­lems in AI X-Risk [PAIS #5]

Jun 10, 2022, 2:08 AM
61 points
6 comments36 min readLW link

How Do Selec­tion The­o­rems Re­late To In­ter­pretabil­ity?

johnswentworthJun 9, 2022, 7:39 PM
60 points
14 comments3 min readLW link

A short con­cep­tual ex­plainer of Im­manuel Kant’s Cri­tique of Pure Reason

jessicataJun 3, 2022, 1:06 AM
57 points
12 comments16 min readLW link
(unstableontology.com)

Covid 6/​2/​22: De­clin­ing to Respond

ZviJun 2, 2022, 1:50 PM
55 points
10 comments7 min readLW link
(thezvi.wordpress.com)

Kurzge­sagt – The Last Hu­man (Youtube)

habrykaJun 29, 2022, 3:28 AM
54 points
7 comments1 min readLW link
(www.youtube.com)

How fast can we perform a for­ward pass?

jsteinhardtJun 10, 2022, 11:30 PM
53 points
9 comments15 min readLW link
(bounded-regret.ghost.io)

Paradigms of AI al­ign­ment: com­po­nents and enablers

VikaJun 2, 2022, 6:19 AM
53 points
4 comments8 min readLW link

How To: A Work­shop (or any­thing)

Duncan Sabien (Deactivated)Jun 12, 2022, 8:00 AM
53 points
13 comments37 min readLW link1 review

[Link] OpenAI: Learn­ing to Play Minecraft with Video PreTrain­ing (VPT)

Aryeh EnglanderJun 23, 2022, 4:29 PM
53 points
3 comments1 min readLW link

What’s it like to have sex with Dun­can?

Duncan Sabien (Deactivated)Jun 17, 2022, 2:32 AM
52 points
19 comments17 min readLW link

La­tent Ad­ver­sar­ial Training

Adam JermynJun 29, 2022, 8:04 PM
52 points
13 comments5 min readLW link

The hor­ror of what must, yet can­not, be true

Kaj_SotalaJun 2, 2022, 10:20 AM
52 points
18 comments2 min readLW link
(kajsotala.fi)

Per­ils of op­ti­miz­ing in so­cial contexts

owencbJun 16, 2022, 5:40 PM
50 points
1 comment2 min readLW link

Our men­tal build­ing blocks are more differ­ent than I thought

Marius HobbhahnJun 15, 2022, 11:07 AM
50 points
11 comments14 min readLW link

Poorly-Aimed Death Rays

Thane RuthenisJun 11, 2022, 6:29 PM
48 points
5 comments4 min readLW link

Child Contracting

jefftkJun 26, 2022, 2:30 AM
48 points
2 comments1 min readLW link
(www.jefftk.com)

Pitch­ing an Align­ment Softball

mu_(negative)Jun 7, 2022, 4:10 AM
47 points
13 comments10 min readLW link

Why so lit­tle AI risk on ra­tio­nal­ist-ad­ja­cent blogs?

Grant DemareeJun 13, 2022, 6:31 AM
46 points
23 comments8 min readLW link

[Link] Child­care : what the sci­ence says

Gunnar_ZarnckeJun 24, 2022, 9:45 PM
46 points
4 comments1 min readLW link
(criticalscience.medium.com)

Dag­ger of De­tect Evil

lsusrJun 21, 2022, 6:23 AM
45 points
22 comments3 min readLW link

Sum­mary of “AGI Ruin: A List of Lethal­ities”

Stephen McAleeseJun 10, 2022, 10:35 PM
45 points
2 comments8 min readLW link

Con­ti­nu­ity Assumptions

Jan_KulveitJun 13, 2022, 9:31 PM
44 points
13 comments4 min readLW link

FYI: I’m work­ing on a book about the threat of AGI/​ASI for a gen­eral au­di­ence. I hope it will be of value to the cause and the community

Darren McKeeJun 15, 2022, 6:08 PM
43 points
15 comments2 min readLW link