Be­com­ing Black Boxish

vitaliyaSep 25, 2022, 11:35 PM
16 points
0 comments2 min readLW link

An­nounc­ing Balsa Research

ZviSep 25, 2022, 10:50 PM
235 points
64 comments2 min readLW link1 review
(thezvi.wordpress.com)

[Question] How to learn: Strug­gle VS Lookup-Table?

Nicholas / Heather KrossSep 25, 2022, 9:58 PM
15 points
2 comments2 min readLW link

An Un­ex­pected GPT-3 De­ci­sion in a Sim­ple Gam­ble

casualphysicsenjoyerSep 25, 2022, 4:46 PM
8 points
4 comments1 min readLW link

“Agency” needs nuance

Evie CottrellSep 25, 2022, 7:40 AM
23 points
1 comment14 min readLW link

Ac­cep­tance and Com­mit­ment Ther­apy (ACT) 101

Evie CottrellSep 25, 2022, 7:25 AM
7 points
2 comments8 min readLW link

Bath­room Con­struc­tion Cost Comparison

jefftkSep 25, 2022, 2:30 AM
11 points
0 comments2 min readLW link
(www.jefftk.com)

Pri­ori­tiz­ing the Arts in re­sponse to AI automation

CaseySep 25, 2022, 2:25 AM
18 points
11 comments2 min readLW link

UI/​UX From the Dark Ages

ShmiSep 25, 2022, 1:53 AM
25 points
15 comments2 min readLW link

P(mis­al­ign­ment x-risk|AGI) is small #[Fu­ture Fund wor­ld­view prize]

Dibbu DibbuSep 24, 2022, 11:54 PM
−18 points
0 comments4 min readLW link

[Question] Papers to start get­ting into NLP-fo­cused al­ign­ment research

FeraidoonSep 24, 2022, 11:53 PM
6 points
0 comments1 min readLW link

Whose Fault?

MarkoviaSep 24, 2022, 11:53 PM
1 point
0 commentsLW link

Brain-over-body bi­ases, and the em­bod­ied value prob­lem in AI alignment

geoffreymillerSep 24, 2022, 10:24 PM
10 points
6 comments25 min readLW link

Opt out from the Funni

CoafosSep 24, 2022, 10:07 PM
8 points
1 comment2 min readLW link

AI coöper­a­tion is more pos­si­ble than you think

423175Sep 24, 2022, 9:26 PM
7 points
0 comments2 min readLW link

“Cot­ton Gin” AI Risk

423175Sep 24, 2022, 9:26 PM
7 points
3 comments2 min readLW link

Two rea­sons we might be closer to solv­ing al­ign­ment than it seems

Sep 24, 2022, 8:00 PM
57 points
9 comments4 min readLW link

Orexin and the quest for more wak­ing hours

ChristianKlSep 24, 2022, 7:54 PM
131 points
39 comments5 min readLW link

AI Safety Dis­cord com­mu­nity (re­quest­ing help!)

CaseySep 24, 2022, 5:35 PM
8 points
0 comments2 min readLW link

[Question] I’m plan­ning to start cre­at­ing more write-ups sum­ma­riz­ing my thoughts on var­i­ous is­sues, mostly re­lated to AI ex­is­ten­tial safety. What do you want to hear my nu­anced takes on?

David Scott Krueger (formerly: capybaralet)Sep 24, 2022, 12:38 PM
9 points
10 comments1 min readLW link

At­tempts at For­ward­ing Speed Priors

Sep 24, 2022, 5:49 AM
30 points
2 comments18 min readLW link

An­nounc­ing $5,000 bounty for (re­spon­si­bly) end­ing malaria

lcSep 24, 2022, 4:28 AM
116 points
40 comments4 min readLW link

[Question] Why Do AI re­searchers Rate the Prob­a­bil­ity of Doom So Low?

AorouSep 24, 2022, 2:33 AM
7 points
6 comments3 min readLW link

Set List Approaches

jefftkSep 24, 2022, 2:30 AM
9 points
0 comments12 min readLW link
(www.jefftk.com)

A ranked link of LessWrong tags/​concepts

peterslatterySep 24, 2022, 12:11 AM
16 points
4 comments1 min readLW link

[Question] Posts with click­able sec­tions of images?

NoBadCakeSep 23, 2022, 11:19 PM
1 point
5 comments1 min readLW link

Un­der what cir­cum­stances have gov­ern­ments can­cel­led AI-type sys­tems?

David GrossSep 23, 2022, 9:11 PM
7 points
1 comment1 min readLW link
(www.carnegieuktrust.org.uk)

There are no rules

unoptimalSep 23, 2022, 8:47 PM
38 points
2 comments5 min readLW link

In­ter­pret­ing Neu­ral Net­works through the Poly­tope Lens

Sep 23, 2022, 5:58 PM
144 points
29 comments33 min readLW link

The het­ero­gene­ity of hu­man value types: Im­pli­ca­tions for AI alignment

geoffreymillerSep 23, 2022, 5:03 PM
10 points
2 comments10 min readLW link

How to use DMT with­out go­ing in­sane: On nav­i­gat­ing epistemic un­cer­tainty in the DMT memeplex

cube_flipperSep 23, 2022, 4:32 PM
7 points
4 comments8 min readLW link
(smoothbrains.net)

Sha­har Avin On How To Reg­u­late Ad­vanced AI Systems

Michaël TrazziSep 23, 2022, 3:46 PM
31 points
0 comments4 min readLW link
(theinsideview.ai)

In­ter­lude: But Who Op­ti­mizes The Op­ti­mizer?

Paul BricmanSep 23, 2022, 3:30 PM
15 points
0 comments10 min readLW link

Why do so many things break in a 2 el­e­ment set?

Alok SinghSep 23, 2022, 6:30 AM
6 points
3 comments1 min readLW link
(alok.github.io)

In­tel­li­gence as a Platform

Robert KennedySep 23, 2022, 5:51 AM
10 points
5 comments3 min readLW link

Public-fac­ing Cen­sor­ship Is Safety Theater, Caus­ing Rep­u­ta­tional Da­m­age

YitzSep 23, 2022, 5:08 AM
149 points
42 comments6 min readLW link

A game of mattering

KatjaGraceSep 23, 2022, 2:30 AM
64 points
7 comments5 min readLW link
(worldspiritsockpuppet.com)

Mak­ing Prunes

jefftkSep 23, 2022, 2:13 AM
10 points
0 comments1 min readLW link
(www.jefftk.com)

Fund­ing is All You Need: Get­ting into Grad School by Hack­ing the NSF GRFP Fellowship

hapaninSep 22, 2022, 9:39 PM
106 points
9 comments12 min readLW link

[Question] What Do AI Safety Pitches Not Get About Your Field?

ArisSep 22, 2022, 9:27 PM
28 points
3 comments1 min readLW link

“Free Will” in a Com­pu­ta­tional Universe

DragonGodSep 22, 2022, 9:25 PM
5 points
6 comments14 min readLW link

Ini­tial Thoughts on Dis­solv­ing “Could­ness”

DragonGodSep 22, 2022, 9:23 PM
6 points
1 comment3 min readLW link

Let’s Com­pare Notes

Shoshannah TekofskySep 22, 2022, 8:47 PM
17 points
3 comments6 min readLW link

Method­olog­i­cal Ther­apy: An Agenda For Tack­ling Re­search Bottlenecks

Sep 22, 2022, 6:41 PM
54 points
6 comments9 min readLW link

Berkeley group house, spots open

Jack RSep 22, 2022, 5:13 PM
4 points
1 comment1 min readLW link

Fake qual­ities of mind

Kaj_SotalaSep 22, 2022, 4:40 PM
59 points
2 comments2 min readLW link
(kajsotala.fi)

Dath Ilan’s Views on Stop­gap Corrigibility

David UdellSep 22, 2022, 4:16 PM
78 points
19 comments13 min readLW link
(www.glowfic.com)

Ukraine Post #12

ZviSep 22, 2022, 2:40 PM
104 points
3 comments16 min readLW link
(thezvi.wordpress.com)

Covid 9/​22/​22: The Joe Bi­den Sings

ZviSep 22, 2022, 2:40 PM
15 points
17 comments24 min readLW link
(thezvi.wordpress.com)

AI Risk In­tro 2: Solv­ing The Problem

Sep 22, 2022, 1:55 PM
22 points
0 comments27 min readLW link