Some re­cent sur­vey pa­pers on (mostly near-term) AI safety, se­cu­rity, and assurance

Aryeh Englander13 Jan 2021 21:50 UTC
13 points
0 comments3 min readLW link

What is the cur­rency of the fu­ture? 5 sug­ges­tions.

MrThink13 Jan 2021 21:10 UTC
18 points
11 comments2 min readLW link

[Question] How to iden­tify “pivotal chap­ters” in HtRaB?

habitmelon13 Jan 2021 20:46 UTC
1 point
0 comments1 min readLW link

Notes on Gratitude

David Gross13 Jan 2021 20:37 UTC
11 points
0 comments19 min readLW link

An Ex­plo­ra­tory Toy AI Take­off Model

niplav13 Jan 2021 18:13 UTC
10 points
6 comments12 min readLW link

[AN #133]: Build­ing ma­chines that can co­op­er­ate (with hu­mans, in­sti­tu­tions, or other ma­chines)

Rohin Shah13 Jan 2021 18:10 UTC
14 points
0 comments9 min readLW link
(mailchi.mp)

[Question] Any ex­am­ples of peo­ple an­a­lyz­ing/​cri­tiquing sci­en­tific stud­ies or pa­pers?

warrenjordan13 Jan 2021 18:00 UTC
3 points
6 comments1 min readLW link

What’s good about haikus?

KatjaGrace13 Jan 2021 8:10 UTC
14 points
3 comments1 min readLW link
(worldspiritsockpuppet.com)

The im­pact merge

Joe Carlsmith13 Jan 2021 7:26 UTC
48 points
11 comments4 min readLW link

“If” is in the map

Chris_Leong13 Jan 2021 3:09 UTC
8 points
8 comments1 min readLW link

Vot­ing Phase for 2019 Review

Raemon13 Jan 2021 1:33 UTC
53 points
18 comments3 min readLW link

#2: Neu­ro­cry­op­reser­va­tion vs whole-body preservation

13 Jan 2021 1:18 UTC
59 points
26 comments12 min readLW link

[Question] How much harder is it to re­vive a neuro-only cry­on­ics pa­tient?

Mati_Roy12 Jan 2021 23:24 UTC
20 points
3 comments2 min readLW link

Zen and Ra­tion­al­ity: Karma

Gordon Seidoh Worley12 Jan 2021 20:56 UTC
11 points
1 comment1 min readLW link

AI Align­ment Us­ing Re­v­erse Simulation

Sven Nilsen12 Jan 2021 20:48 UTC
0 points
0 comments1 min readLW link

The True Face of the Enemy

Space L Clottey12 Jan 2021 10:03 UTC
35 points
77 comments10 min readLW link

A vastly faster vac­cine rollout

KatjaGrace12 Jan 2021 7:40 UTC
86 points
37 comments5 min readLW link
(worldspiritsockpuppet.com)

Physi­cist’s dissociation

Mateusz Mazurkiewicz12 Jan 2021 4:40 UTC
5 points
0 comments1 min readLW link

Re­view of ‘De­bate on In­stru­men­tal Con­ver­gence be­tween LeCun, Rus­sell, Ben­gio, Zador, and More’

TurnTrout12 Jan 2021 3:57 UTC
40 points
1 comment2 min readLW link

[Question] Base rate of RCT from de­vel­op­ing coun­tries find­ing un­ex­pected effects

MichaelLowe12 Jan 2021 1:57 UTC
18 points
2 comments1 min readLW link

[Question] What skills or habits have last­ing value through time?

Duff12 Jan 2021 1:54 UTC
7 points
12 comments1 min readLW link

D&D.Sci II: The Sorceror’s Per­sonal Shopper

abstractapplic12 Jan 2021 1:38 UTC
60 points
29 comments2 min readLW link

Group house norms re­ally do seem toxic to many peo­ple.

sapphire11 Jan 2021 23:42 UTC
66 points
31 comments3 min readLW link

Trans­parency and AGI safety

jylin0411 Jan 2021 18:51 UTC
54 points
12 comments30 min readLW link

In Defense of Twit­ter’s De­ci­sion to Ban Trump

ragintumbleweed11 Jan 2021 18:08 UTC
4 points
21 comments5 min readLW link

The time I got re­ally into poker

KatjaGrace11 Jan 2021 13:00 UTC
23 points
4 comments2 min readLW link
(worldspiritsockpuppet.com)

Shouldn’t it mat­ter to the vic­tim?

Joe Carlsmith11 Jan 2021 7:16 UTC
5 points
2 comments15 min readLW link

Avoid Un­nec­es­sar­ily Poli­ti­cal Examples

Raemon11 Jan 2021 5:41 UTC
106 points
42 comments3 min readLW link

Mini thoughts on mintheism

CraigMichael11 Jan 2021 5:14 UTC
14 points
3 comments7 min readLW link

Effi­ciency Wages: A Dou­ble-Edged Sword

Aaron Bergman11 Jan 2021 4:43 UTC
6 points
7 comments5 min readLW link
(aaronbergman.substack.com)

Pre­dic­tion: The Defense Depart­ment Will Blame Trump for the Slow Re­sponse on Jan. 7, 2021

Tim Liptrot11 Jan 2021 1:02 UTC
2 points
5 comments2 min readLW link

Does GenZ have a shorter at­ten­tion span?

Srijan Singh10 Jan 2021 21:38 UTC
2 points
0 comments2 min readLW link

Pre­dic­tion can be Outer Aligned at Optimum

Lukas Finnveden10 Jan 2021 18:48 UTC
15 points
12 comments11 min readLW link

Re­view of Soft Take­off Can Still Lead to DSA

Daniel Kokotajlo10 Jan 2021 18:10 UTC
79 points
15 comments6 min readLW link

De­cem­ber 2020 gw­ern.net links

gwern10 Jan 2021 17:21 UTC
30 points
0 comments1 min readLW link
(www.gwern.net)

2020: Fore­cast­ing in Re­view.

NunoSempere10 Jan 2021 16:06 UTC
28 points
0 comments10 min readLW link

Fit­ting­ness: Ra­tional suc­cess in con­cept formation

Polytopos10 Jan 2021 15:58 UTC
6 points
9 comments6 min readLW link

A work­shop on Life Influences

Elo10 Jan 2021 11:23 UTC
8 points
0 comments4 min readLW link

Will we wit­ness the com­pas­sion of a na­tion?

kithpendragon10 Jan 2021 11:10 UTC
−9 points
27 comments3 min readLW link

Overconfidence

lsusr10 Jan 2021 10:36 UTC
26 points
13 comments3 min readLW link

Dist­in­guish­ing goals from chores

Amir Bolous10 Jan 2021 7:45 UTC
5 points
1 comment4 min readLW link

[Question] How should you go about valu­ing your time?

Adam Zerner10 Jan 2021 6:54 UTC
17 points
5 comments1 min readLW link

A differ­ent dictionary

KatjaGrace10 Jan 2021 6:30 UTC
17 points
1 comment2 min readLW link
(worldspiritsockpuppet.com)

Rhythm 0 and the Ab­solu­tion of Responsibility

Precious Oluwatobi Emmanuel10 Jan 2021 1:51 UTC
−5 points
0 comments4 min readLW link

[Question] What to do if you can’t form any habits what­so­ever?

masasin10 Jan 2021 1:17 UTC
18 points
24 comments1 min readLW link

Imi­ta­tive Gen­er­al­i­sa­tion (AKA ‘Learn­ing the Prior’)

Beth Barnes10 Jan 2021 0:30 UTC
107 points
15 comments11 min readLW link1 review

Bab­ble Thread

Adam Zerner9 Jan 2021 21:52 UTC
10 points
7 comments1 min readLW link

[U.S. spe­cific] PPP: free money for self-em­ployed & orgs (time-sen­si­tive)

Steven Byrnes9 Jan 2021 19:53 UTC
35 points
1 comment2 min readLW link

I think to find out what I write

Bob Baker9 Jan 2021 18:55 UTC
0 points
0 comments1 min readLW link

The Case for a Jour­nal of AI Alignment

adamShimi9 Jan 2021 18:13 UTC
45 points
32 comments4 min readLW link