Ly­ing to Save Humanity

cebsuvxNov 14, 2022, 11:04 PM
−1 points
4 comments1 min readLW link

Mo­ral con­ta­gion heuristic

MvolzNov 14, 2022, 9:17 PM
14 points
3 comments2 min readLW link

Will we run out of ML data? Ev­i­dence from pro­ject­ing dataset size trends

Pablo VillalobosNov 14, 2022, 4:42 PM
75 points
12 comments2 min readLW link
(epochai.org)

I (with the help of a few more peo­ple) am plan­ning to cre­ate an in­tro­duc­tion to AI Safety that a smart teenager can un­der­stand. What am I miss­ing?

TapataktNov 14, 2022, 4:12 PM
3 points
5 comments1 min readLW link

Two New New­comb Variants

eva_Nov 14, 2022, 2:01 PM
26 points
22 comments3 min readLW link

Im­prov­ing Emer­gency Ve­hi­cle Utilization

jefftkNov 14, 2022, 2:00 PM
15 points
10 comments1 min readLW link
(www.jefftk.com)

X-risk Miti­ga­tion Does Ac­tu­ally Re­quire Longter­mism

DragonGodNov 14, 2022, 12:54 PM
6 points
1 comment1 min readLW link

[Question] Why don’t we have self driv­ing cars yet?

Linda LinseforsNov 14, 2022, 12:19 PM
22 points
16 comments1 min readLW link

Ei­gen­val­ues for Dis­tance from The Bud­dhist Pre­cepts And The Ten Commandments

benjamin.j.campbellNov 14, 2022, 5:50 AM
−3 points
2 comments1 min readLW link

AI Safety Micro­grant Round

Chris_LeongNov 14, 2022, 4:25 AM
22 points
1 comment1 min readLW link

Es­ti­mat­ing the prob­a­bil­ity that FTX Fu­ture Fund grant money gets clawed back

spencergNov 14, 2022, 3:33 AM
28 points
6 comments1 min readLW link

Ra­tional over­con­fi­dence in the tens of billions: re­cent example

banevNov 13, 2022, 10:48 PM
−20 points
3 comments2 min readLW link

In Defence of Tem­po­ral Dis­count­ing in Longter­mist Ethics

DragonGodNov 13, 2022, 9:54 PM
25 points
4 comments1 min readLW link

An­nounc­ing Non­lin­ear Emer­gency Funding

KatWoodsNov 13, 2022, 7:02 PM
54 points
0 comments1 min readLW link

The Align­ment Com­mu­nity Is Cul­turally Broken

sudoNov 13, 2022, 6:53 PM
136 points
68 comments2 min readLW link

The Fu­til­ity of Sta­tus and Signalling

Ape in the coatNov 13, 2022, 5:14 PM
19 points
4 comments3 min readLW link

A short cri­tique of Vanessa Kosoy’s PreDCA

Martín SotoNov 13, 2022, 4:00 PM
28 points
8 comments4 min readLW link

What’s the Alter­na­tive to In­de­pen­dence?

jefftkNov 13, 2022, 3:30 PM
50 points
3 comments1 min readLW link
(www.jefftk.com)

De­ci­sion mak­ing un­der model am­bi­guity, moral un­cer­tainty, and other agents with free will?

Jobst HeitzigNov 13, 2022, 12:50 PM
4 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

The sky is not blue (par­don the ob­vi­ous­ness)

banevNov 13, 2022, 10:49 AM
−13 points
6 comments1 min readLW link

Char­ac­ter­iz­ing In­trin­sic Com­po­si­tion­al­ity in Trans­form­ers with Tree Projections

Ulisse MiniNov 13, 2022, 9:46 AM
12 points
2 comments1 min readLW link
(arxiv.org)

Not­ing an un­sub­stan­ti­ated com­mu­nal be­lief about the FTX disaster

YitzNov 13, 2022, 5:37 AM
50 points
52 comments1 min readLW link

Sols­tice 2022 Roundup

dspeyerNov 12, 2022, 9:26 PM
34 points
12 comments1 min readLW link

Women and Effec­tive Altruism

P. G. Keerthana GopalakrishnanNov 12, 2022, 8:57 PM
−30 points
15 comments2 min readLW link
(keerthanapg.com)

A Poem for S.B.F.

AnthonyRepettoNov 12, 2022, 8:41 PM
−30 points
21 comments1 min readLW link

Mus­ings on the ap­pro­pri­ate tar­gets for standards

tailcalledNov 12, 2022, 8:19 PM
11 points
13 comments1 min readLW link

Ways to buy time

Nov 12, 2022, 7:31 PM
34 points
23 comments12 min readLW link

[Question] How do new­com­ers delve deeper into the com­mu­nity?

Lord DreadwarNov 12, 2022, 7:00 PM
7 points
2 comments1 min readLW link

User-Con­trol­led Al­gorith­mic Feeds

jefftkNov 12, 2022, 3:20 PM
35 points
7 comments2 min readLW link
(www.jefftk.com)

Vanessa Kosoy’s PreDCA, distilled

Martín SotoNov 12, 2022, 11:38 AM
17 points
19 comments5 min readLW link

Poster Ses­sion on AI Safety

Neil CrawfordNov 12, 2022, 3:50 AM
7 points
7 comments1 min readLW link

Is AI Gain-of-Func­tion re­search a thing?

MadHatterNov 12, 2022, 2:33 AM
9 points
2 comments2 min readLW link

Why don’t or­ga­ni­za­tions have a CREAMO?

ShmiNov 12, 2022, 2:19 AM
0 points
8 comments1 min readLW link

“Ru­de­ness”, a use­ful co­or­di­na­tion mechanic

RaemonNov 11, 2022, 10:27 PM
49 points
20 comments2 min readLW link

In­ter­nal­iz­ing the dam­age of bad-act­ing part­ners cre­ates in­cen­tives for due diligence

tailcalledNov 11, 2022, 8:57 PM
17 points
7 comments1 min readLW link

Spec­u­la­tion on Cur­rent Op­por­tu­ni­ties for Unusu­ally High Im­pact in Global Health

johnswentworthNov 11, 2022, 8:47 PM
114 points
31 comments4 min readLW link

[Question] Is acausal ex­tor­tion pos­si­ble?

sisyphusNov 11, 2022, 7:48 PM
−20 points
34 comments3 min readLW link

Cathar­sis in Bb

jefftkNov 11, 2022, 5:40 PM
6 points
0 comments1 min readLW link
(www.jefftk.com)

In­stru­men­tal con­ver­gence is what makes gen­eral in­tel­li­gence possible

tailcalledNov 11, 2022, 4:38 PM
105 points
11 comments4 min readLW link

Weekly Roundup #5

ZviNov 11, 2022, 4:20 PM
33 points
0 comments6 min readLW link
(thezvi.wordpress.com)

Charg­ing for the Dharma

jchanNov 11, 2022, 2:02 PM
32 points
18 comments5 min readLW link

[Question] EA (& AI Safety) has over­es­ti­mated its pro­jected fund­ing — which de­ci­sions must be re­vised?

Cleo NardoNov 11, 2022, 1:50 PM
22 points
7 comments1 min readLW link
(forum.effectivealtruism.org)

Where the log­i­cal fal­lacy is not (Gen­er­al­iza­tion From Fic­tional Ev­i­dence)

banevNov 11, 2022, 10:41 AM
−12 points
14 comments1 min readLW link

Why I’m Work­ing On Model Ag­nos­tic Interpretability

Jessica RumbelowNov 11, 2022, 9:24 AM
27 points
9 comments2 min readLW link

How likely are ma­lign pri­ors over ob­jec­tives? [aborted WIP]

David JohnstonNov 11, 2022, 5:36 AM
−1 points
0 comments8 min readLW link

Do Time­less De­ci­sion The­o­rists re­ject all black­mail from other Time­less De­ci­sion The­o­rists?

myrenNov 11, 2022, 12:38 AM
7 points
8 comments3 min readLW link

We must be very clear: fraud in the ser­vice of effec­tive al­tru­ism is unacceptable

evhubNov 10, 2022, 11:31 PM
42 points
56 comments1 min readLW link

[simu­la­tion] 4chan user claiming to be the at­tor­ney hired by Google’s sen­tient chat­bot LaMDA shares wild de­tails of encounter

janusNov 10, 2022, 9:39 PM
19 points
1 comment13 min readLW link
(generative.ink)

di­v­ine carrot

Alok SinghNov 10, 2022, 8:50 PM
18 points
2 comments1 min readLW link
(alok.github.io)

Me­tac­u­lus An­nounces The Million Pre­dic­tions Hackathon

ChristianWilliamsNov 10, 2022, 8:00 PM
7 points
0 comments1 min readLW link