A Gym Grid­world En­vi­ron­ment for the Treach­er­ous Turn

Michaël Trazzi28 Jul 2018 21:27 UTC
74 points
9 comments3 min readLW link
(github.com)

Algo trad­ing is a cen­tral ex­am­ple of AI risk

Vanessa Kosoy28 Jul 2018 20:31 UTC
27 points
5 comments1 min readLW link

Strate­gies for Per­sonal Growth

Raemon28 Jul 2018 18:27 UTC
146 points
27 comments4 min readLW link

Sav­ing the world in 80 days: Epilogue

Logan Riggs28 Jul 2018 17:04 UTC
51 points
14 comments2 min readLW link

De­ci­sions are not about chang­ing the world, they are about learn­ing what world you live in

Shmi28 Jul 2018 8:41 UTC
39 points
71 comments11 min readLW link

An­throp­ics: A Short Note on the Fis­sion Riddle

Chris_Leong28 Jul 2018 4:14 UTC
11 points
16 comments2 min readLW link

LW Up­date 2018-07-27 – Shar­ing Drafts

Raemon28 Jul 2018 2:54 UTC
31 points
7 comments1 min readLW link

Some in­tu­ition on why con­scious­ness seems subjective

SoerenMind27 Jul 2018 22:37 UTC
20 points
10 comments7 min readLW link

Coun­ter­fac­tual out­come state tran­si­tion parameters

Anders_H27 Jul 2018 21:13 UTC
37 points
1 comment6 min readLW link

Model-build­ing and scapegoating

Benquo27 Jul 2018 16:02 UTC
21 points
4 comments2 min readLW link
(benjaminrosshoffman.com)

Ro­bust­ness to fun­da­men­tal un­cer­tainty in AGI alignment

Gordon Seidoh Worley27 Jul 2018 0:41 UTC
7 points
1 comment1 min readLW link
(arxiv.org)

The Ex­per­i­men­tal Apparatus

Elo26 Jul 2018 22:16 UTC
9 points
2 comments1 min readLW link

Cul­ture, in­ter­pre­tive la­bor, and tidy­ing one’s room

Benquo26 Jul 2018 20:59 UTC
25 points
0 comments2 min readLW link
(benjaminrosshoffman.com)

Pre­dic­tion Mar­kets: When Do They Work?

Zvi26 Jul 2018 12:30 UTC
166 points
17 comments10 min readLW link
(thezvi.wordpress.com)

rat­tumb de­bate: Are cog­ni­tive bi­ases a good thing ?

Paperclip Minimizer26 Jul 2018 7:38 UTC
3 points
0 comments1 min readLW link
(antinegationism.tumblr.com)

Would you benefit from au­dio ver­sions of posts?

Raemon26 Jul 2018 4:53 UTC
22 points
21 comments1 min readLW link

Don’t Get Dis­tracted by the Boilerplate

johnswentworth26 Jul 2018 2:15 UTC
87 points
19 comments2 min readLW link

Fad­ing Novelty

lifelonglearner25 Jul 2018 21:36 UTC
26 points
2 comments6 min readLW link

Nar­row AI Nanny: Reach­ing Strate­gic Ad­van­tage via Nar­row AI to Prevent Creation of the Danger­ous Superintelligence

avturchin25 Jul 2018 17:12 UTC
12 points
7 comments21 min readLW link

Opinion Ar­ti­cle Against Mea­sur­ing Impact

ole.koksvik25 Jul 2018 7:37 UTC
6 points
2 comments1 min readLW link
(www.theguardian.com)

The Evil Ge­nie Puzzle

Chris_Leong25 Jul 2018 6:12 UTC
19 points
44 comments1 min readLW link

Com­pu­ta­tional effi­ciency rea­sons not to model VNM-ra­tio­nal prefer­ence re­la­tions with util­ity functions

AlexMennen25 Jul 2018 2:11 UTC
16 points
5 comments3 min readLW link

To cut your loses or push for­ward in court?

CommuterQuery24 Jul 2018 18:26 UTC
−8 points
1 comment1 min readLW link

ISO: Name of Problem

johnswentworth24 Jul 2018 17:15 UTC
28 points
15 comments1 min readLW link

Ber­lin LessWrong Meetup

ChristianKl24 Jul 2018 16:49 UTC
12 points
0 comments1 min readLW link

LessWrong Meetup for Ham­ming Cir­cle’s

ChristianKl24 Jul 2018 16:48 UTC
10 points
0 comments1 min readLW link

[deleted]

Paperclip Minimizer24 Jul 2018 16:06 UTC
7 points
10 comments1 min readLW link

Top Left Mood

Jacob Falkovich24 Jul 2018 14:35 UTC
17 points
2 comments1 min readLW link
(putanumonit.com)

The prob­lem of other minds

Elo24 Jul 2018 1:04 UTC
5 points
9 comments1 min readLW link

The risk of an Amer­i­can Civil War is remote

Samo Burja23 Jul 2018 18:00 UTC
38 points
0 comments8 min readLW link
(medium.com)

Align­ment Newslet­ter #16: 07/​23/​18

Rohin Shah23 Jul 2018 16:20 UTC
42 points
0 comments12 min readLW link
(mailchi.mp)

Games in Kocherga club: Fal­la­cy­ma­nia, Tower of Chaos, Scien­tific Discovery

Alexander23023 Jul 2018 12:06 UTC
4 points
0 comments1 min readLW link

Fun­da­men­tals of For­mal­i­sa­tion Level 6: Tur­ing Machines and the Halt­ing Problem

philip_b23 Jul 2018 9:46 UTC
9 points
0 comments1 min readLW link

Let’s Dis­cuss Func­tional De­ci­sion Theory

Chris_Leong23 Jul 2018 7:24 UTC
29 points
18 comments1 min readLW link

Re­place your­self be­fore you stop or­ga­niz­ing your com­mu­nity.

Raemon22 Jul 2018 20:57 UTC
65 points
16 comments4 min readLW link

Who Wants The Job?

Zvi22 Jul 2018 14:00 UTC
24 points
29 comments2 min readLW link
(thezvi.wordpress.com)

Sim­pli­cio and Sophisticus

Zvi22 Jul 2018 13:30 UTC
42 points
1 comment4 min readLW link
(thezvi.wordpress.com)

Ex­or­ciz­ing the Speed Prior?

abramdemski22 Jul 2018 6:45 UTC
14 points
6 comments3 min readLW link

12 Virtues of Ra­tion­al­ity posters/​icons

habryka22 Jul 2018 5:19 UTC
59 points
8 comments1 min readLW link

Bayesian Rea­son­ing with Un­song Theod­icy means we shouldn’t de­stroy the universe

pku22 Jul 2018 1:25 UTC
7 points
1 comment1 min readLW link

Stable Poin­t­ers to Value III: Re­cur­sive Quantilization

abramdemski21 Jul 2018 8:06 UTC
20 points
4 comments4 min readLW link

Con­cep­tual prob­lems with util­ity func­tions, sec­ond at­tempt at ex­plain­ing

Dacyn21 Jul 2018 2:08 UTC
16 points
5 comments2 min readLW link

Can few-shot learn­ing teach AI right from wrong?

Charlie Steiner20 Jul 2018 7:45 UTC
13 points
3 comments6 min readLW link

The Psy­chol­ogy Of Re­s­olute Agents

Chris_Leong20 Jul 2018 5:42 UTC
10 points
20 comments5 min readLW link

Prob­a­bil­ity is Real, and Value is Complex

abramdemski20 Jul 2018 5:24 UTC
79 points
20 comments6 min readLW link

Solv­ing the AI Race Finalists

Gordon Seidoh Worley19 Jul 2018 21:04 UTC
24 points
0 comments1 min readLW link
(medium.com)

“Ar­tifi­cial In­tel­li­gence” (new en­try at Stan­ford En­cy­clo­pe­dia of Philos­o­phy)

fortyeridania19 Jul 2018 9:48 UTC
5 points
8 comments1 min readLW link
(plato.stanford.edu)

Dis­cus­sion: Rais­ing the San­ity Waterline

Chriswaterguy19 Jul 2018 2:12 UTC
2 points
0 comments1 min readLW link

LW Up­date 2018-07-18 – Align­men­tFo­rum Bug Fixes

Raemon19 Jul 2018 2:10 UTC
13 points
0 comments1 min readLW link

Gen­er­al­ized Kelly betting

Linda Linsefors19 Jul 2018 1:38 UTC
15 points
5 comments2 min readLW link