Two Ne­glected Prob­lems in Hu­man-AI Safety

Wei Dai16 Dec 2018 22:13 UTC
102 points
25 comments2 min readLW link

Bab­ble, Learn­ing, and the Typ­i­cal Mind Fallacy

NaiveTortoise16 Dec 2018 16:51 UTC
6 points
0 comments1 min readLW link
(an1lam.github.io)

[Question] What are some con­crete prob­lems about log­i­cal coun­ter­fac­tu­als?

Chris_Leong16 Dec 2018 10:20 UTC
25 points
4 comments1 min readLW link

The E-Coli Test for AI Alignment

johnswentworth16 Dec 2018 8:10 UTC
70 points
24 comments1 min readLW link

on wellunderstoodness

Quinn16 Dec 2018 7:22 UTC
9 points
2 comments4 min readLW link

Sabine “Bee” Hossen­felder (and Robin Han­son) on How to fix Academia with Pre­dic­tion Markets

Shmi16 Dec 2018 6:37 UTC
12 points
0 comments1 min readLW link
(backreaction.blogspot.com)

New edi­tion of “Ra­tion­al­ity: From AI to Zom­bies”

Rob Bensinger15 Dec 2018 21:33 UTC
84 points
27 comments2 min readLW link

Gw­ern about cen­taurs: there is no chance that any use­ful man+ma­chine com­bi­na­tion will work to­gether for more than 10 years, as hu­mans soon will be only a liability

avturchin15 Dec 2018 21:32 UTC
34 points
4 comments1 min readLW link
(www.reddit.com)

Ar­gue Poli­tics* With Your Best Friends

sarahconstantin15 Dec 2018 19:00 UTC
75 points
6 comments6 min readLW link
(srconstantin.wordpress.com)

In­ter­pret­ing ge­netic testing

jefftk15 Dec 2018 15:56 UTC
24 points
1 comment2 min readLW link

[Question] What is ab­strac­tion?

Adam Zerner15 Dec 2018 8:36 UTC
25 points
11 comments4 min readLW link

In­tro­duc­ing the Longevity Re­search Institute

sarahconstantin14 Dec 2018 20:20 UTC
79 points
11 comments1 min readLW link
(srconstantin.wordpress.com)

Player vs. Char­ac­ter: A Two-Level Model of Ethics

sarahconstantin14 Dec 2018 19:40 UTC
94 points
27 comments7 min readLW link3 reviews
(srconstantin.wordpress.com)

[Question] How to re­set my pass­word?

hirvinen14 Dec 2018 16:18 UTC
3 points
1 comment1 min readLW link

[Question] What pod­casts does the com­mu­nity listen to?

hristovassilev14 Dec 2018 15:40 UTC
13 points
6 comments1 min readLW link

Med­i­ta­tions on Momentum

Richard Meadows14 Dec 2018 10:53 UTC
107 points
32 comments10 min readLW link

[Question] Can I use Less Wrong brand­ing in youtube videos?

Bae's Theorem14 Dec 2018 7:10 UTC
3 points
5 comments1 min readLW link

Three AI Safety Re­lated Ideas

Wei Dai13 Dec 2018 21:32 UTC
69 points
38 comments2 min readLW link

An Ex­ten­sive Cat­e­gori­sa­tion of In­finite Paradoxes

Chris_Leong13 Dec 2018 18:36 UTC
−14 points
48 comments13 min readLW link

The Bat and Ball Prob­lem Revisited

drossbucket13 Dec 2018 7:16 UTC
69 points
30 comments15 min readLW link2 reviews

Multi-agent pre­dic­tive minds and AI alignment

Jan_Kulveit12 Dec 2018 23:48 UTC
63 points
18 comments10 min readLW link

[Question] What went wrong in this in­ter­ac­tion?

t3tsubo12 Dec 2018 19:59 UTC
1 point
8 comments1 min readLW link

In­ter­net Search Tips: how I use Google/​Google Scholar/​Lib­gen

gwern12 Dec 2018 14:50 UTC
51 points
0 comments1 min readLW link
(www.gwern.net)

Za­greb Ra­tion­al­ity Meetup

Roko Jelavić12 Dec 2018 13:08 UTC
1 point
0 comments1 min readLW link

Should ethi­cists be in­side or out­side a pro­fes­sion?

Eliezer Yudkowsky12 Dec 2018 1:40 UTC
97 points
7 comments9 min readLW link

Align­ment Newslet­ter #36

Rohin Shah12 Dec 2018 1:10 UTC
21 points
0 comments11 min readLW link
(mailchi.mp)

A hun­dred Shakespeares

Stuart_Armstrong11 Dec 2018 23:11 UTC
29 points
5 comments2 min readLW link

Norms of Mem­ber­ship for Vol­un­tary Groups

sarahconstantin11 Dec 2018 22:10 UTC
192 points
10 comments7 min readLW link
(srconstantin.wordpress.com)

Quan­tum im­mor­tal­ity: Is de­cline of mea­sure com­pen­sated by merg­ing timelines?

avturchin11 Dec 2018 19:39 UTC
9 points
8 comments2 min readLW link

Bounded ra­tio­nal­ity abounds in mod­els, not ex­plic­itly defined

Stuart_Armstrong11 Dec 2018 19:34 UTC
14 points
9 comments1 min readLW link

Figur­ing out what Alice wants: non-hu­man Alice

Stuart_Armstrong11 Dec 2018 19:31 UTC
16 points
17 comments2 min readLW link

As­sum­ing we’ve solved X, could we do Y...

Stuart_Armstrong11 Dec 2018 18:13 UTC
31 points
16 comments2 min readLW link

[Question] Who’s wel­come to our LessWrong mee­tups?

ChristianKl10 Dec 2018 13:31 UTC
19 points
5 comments1 min readLW link

[Question] How Old is Smal­lpox?

Raemon10 Dec 2018 10:50 UTC
44 points
5 comments2 min readLW link

LessWrong Tel Aviv: Civ­i­liza­tional Collapse

JoshuaFox10 Dec 2018 7:26 UTC
7 points
0 comments1 min readLW link

Bos­ton Sec­u­lar Solstice

jefftk10 Dec 2018 1:59 UTC
10 points
0 comments1 min readLW link

[Question] Why should EA care about ra­tio­nal­ity (and vice-versa)?

Gordon Seidoh Worley9 Dec 2018 22:03 UTC
14 points
13 comments1 min readLW link

Measly Med­i­ta­tion Measurements

justinpombrio9 Dec 2018 20:54 UTC
62 points
19 comments1 min readLW link

Re­view: Slay the Spire

Zvi9 Dec 2018 20:40 UTC
22 points
1 comment7 min readLW link
(thezvi.wordpress.com)

[Question] In­stead of us­ing the high-level lan­guages, pro­gram­mers will start us­ing pro­gram­ming of more high-level or hu­man lan­guage level pro­gram­ming?

manhobby9 Dec 2018 18:54 UTC
−24 points
7 comments1 min readLW link

Kin­der­garten in NYC: Much More than You Wanted to Know

Laura B9 Dec 2018 15:36 UTC
36 points
1 comment11 min readLW link

New Rat­fic: Nyssa in the Realm of Possibility

Alicorn9 Dec 2018 5:00 UTC
40 points
0 comments1 min readLW link

[Question] Near-term Worse than Ex­is­ten­tial Threats

Alephywr9 Dec 2018 3:10 UTC
−15 points
3 comments1 min readLW link

[Question] What pre­cisely do we mean by AI al­ign­ment?

Gordon Seidoh Worley9 Dec 2018 2:23 UTC
29 points
8 comments1 min readLW link

LW Up­date 2018-12-06 – All Posts Page, Ques­tions Page, Posts Item rework

Raemon8 Dec 2018 21:30 UTC
18 points
1 comment1 min readLW link

GraphQL tu­to­rial for LessWrong and Effec­tive Altru­ism Forum

riceissa8 Dec 2018 19:51 UTC
88 points
5 comments5 min readLW link

[Question] What is “So­cial Real­ity?”

Raemon8 Dec 2018 17:41 UTC
38 points
17 comments1 min readLW link

Pre­dic­tion Mar­kets Are About Be­ing Right

Zvi8 Dec 2018 14:00 UTC
83 points
7 comments7 min readLW link
(thezvi.wordpress.com)

[Question] Why should I care about ra­tio­nal­ity?

TurnTrout8 Dec 2018 3:49 UTC
24 points
5 comments1 min readLW link

Book re­view: Ar­tifi­cial In­tel­li­gence Safety and Security

PeterMcCluskey8 Dec 2018 3:47 UTC
27 points
3 comments8 min readLW link
(www.bayesianinvestor.com)