For­mally Stat­ing the AI Align­ment Problem

Gordon Seidoh Worley19 Feb 2018 19:06 UTC
14 points
0 comments13 min readLW link
(mapandterritory.org)

User­names in RSS feeds

benwr19 Feb 2018 2:22 UTC
1 point
0 comments1 min readLW link

An al­ter­na­tive way to browse LessWrong 2.0

saturn19 Feb 2018 2:10 UTC
17 points
6 comments1 min readLW link

An al­ter­na­tive way to browse LessWrong 2.0

clone of saturn19 Feb 2018 1:52 UTC
48 points
58 comments1 min readLW link

Whose rea­son­ing can you rely on when your own is faulty?

weft18 Feb 2018 22:41 UTC
29 points
1 comment2 min readLW link

A Sim­ple Motto

katerinjo18 Feb 2018 18:12 UTC
11 points
18 comments1 min readLW link

[Meta] New mod­er­a­tion tools and mod­er­a­tion guidelines

habryka18 Feb 2018 3:22 UTC
42 points
74 comments2 min readLW link

Is skil­led hunt­ing un­eth­i­cal?

JamesFaville17 Feb 2018 18:48 UTC
6 points
18 comments12 min readLW link

Mis­sives from China

alkjash17 Feb 2018 12:30 UTC
12 points
4 comments1 min readLW link
(radimentary.wordpress.com)

In Defence of Con­flict Theory

Richard_Ngo17 Feb 2018 3:33 UTC
34 points
10 comments7 min readLW link
(thinkingcomplete.blogspot.co.uk)

Re­plac­ing ex­pen­sive costly signals

KatjaGrace17 Feb 2018 0:50 UTC
30 points
13 comments1 min readLW link
(meteuphoric.wordpress.com)

Circling

Unreal16 Feb 2018 23:26 UTC
73 points
275 comments9 min readLW link3 reviews

Bayes Rule Applied

Gordon Seidoh Worley16 Feb 2018 18:30 UTC
4 points
0 comments1 min readLW link
(towardsdatascience.com)

Clar­ify­ing the Post­mod­ernism De­bate With Skep­ti­cal Modernism

Chris_Leong16 Feb 2018 9:40 UTC
17 points
12 comments3 min readLW link

Con­fi­dence Confusion

alkjash16 Feb 2018 2:00 UTC
6 points
15 comments2 min readLW link
(radimentary.wordpress.com)

Toward a New Tech­ni­cal Ex­pla­na­tion of Tech­ni­cal Explanation

abramdemski16 Feb 2018 0:44 UTC
86 points
36 comments18 min readLW link1 review

Two Types of Updatelessness

abramdemski15 Feb 2018 20:19 UTC
23 points
17 comments1 min readLW link

Tune Your Cog­ni­tive Strategies

SquirrelInHell15 Feb 2018 17:54 UTC
40 points
6 comments1 min readLW link
(bewelltuned.com)

The law of effect, ran­dom­iza­tion and New­comb’s problem

Caspar Oesterheld15 Feb 2018 15:31 UTC
7 points
1 comment1 min readLW link
(casparoesterheld.com)

Sub­du­ing Moloch

Teja Prabhu14 Feb 2018 23:34 UTC
6 points
15 comments2 min readLW link

Ac­tive vs Pas­sive Distraction

squidious14 Feb 2018 22:14 UTC
26 points
2 comments3 min readLW link

More pre­cise re­gret bound for DRL

Vanessa Kosoy14 Feb 2018 11:58 UTC
1 point
0 comments9 min readLW link

Catas­tro­phe Miti­ga­tion Us­ing DRL (Ap­pen­dices)

Vanessa Kosoy14 Feb 2018 11:57 UTC
0 points
0 comments6 min readLW link

The Prin­ci­pled In­tel­li­gence Hypothesis

KatjaGrace14 Feb 2018 1:00 UTC
34 points
15 comments4 min readLW link
(meteuphoric.wordpress.com)

Hufflepuff Cyn­i­cism on Crocker’s Rule

abramdemski14 Feb 2018 0:52 UTC
16 points
11 comments2 min readLW link

Ra­tion­al­ist Lent

Qiaochu_Yuan13 Feb 2018 23:55 UTC
41 points
62 comments1 min readLW link

Figures!

Куля Ботаніки13 Feb 2018 21:43 UTC
3 points
0 comments1 min readLW link

Spam­ming Micro-In­ten­tions to Gen­er­ate Willpower

moridinamael13 Feb 2018 20:16 UTC
44 points
23 comments3 min readLW link

Hufflepuff Cynicism

abramdemski13 Feb 2018 2:15 UTC
25 points
17 comments6 min readLW link

A Proper Scor­ing Rule for Con­fi­dence Intervals

Scott Garrabrant13 Feb 2018 1:45 UTC
63 points
47 comments1 min readLW link

Open thread, Fe­bru­ary 2018

EnterUsername12 Feb 2018 21:10 UTC
11 points
22 comments1 min readLW link

Ra­tion­al­ity Feed: Last Month’s Best Posts

sapphire12 Feb 2018 13:18 UTC
23 points
1 comment3 min readLW link

Some con­cep­tual high­lights from “Disjunc­tive Sce­nar­ios of Catas­trophic AI Risk”

Kaj_Sotala12 Feb 2018 12:30 UTC
45 points
4 comments6 min readLW link
(kajsotala.fi)

“Just Suffer Un­til It Passes”

lionhearted (Sebastian Marshall)12 Feb 2018 4:01 UTC
89 points
26 comments1 min readLW link

Dis­cord Server Emoji: A Com­mu­nity Dialect

Adrian Smith11 Feb 2018 0:47 UTC
5 points
2 comments5 min readLW link

The Sig­nal and the Corrective

MalcolmOcean11 Feb 2018 0:28 UTC
21 points
3 comments1 min readLW link
(everythingstudies.com)

Eter­nal, and Hearth­stone Econ­omy ver­sus Magic Economy

Zvi10 Feb 2018 20:20 UTC
10 points
7 comments6 min readLW link
(thezvi.wordpress.com)

The end of pub­lic trans­porta­tion. The fu­ture of pub­lic trans­porta­tion.

mako yass9 Feb 2018 21:51 UTC
3 points
33 comments2 min readLW link

Sta­tus: Map and Territory

alkjash9 Feb 2018 19:10 UTC
19 points
3 comments1 min readLW link
(radimentary.wordpress.com)

Antiantinatalism

Jacob Falkovich9 Feb 2018 16:49 UTC
6 points
4 comments5 min readLW link

A Safer Or­a­cle Setup?

Ofer9 Feb 2018 12:16 UTC
5 points
4 comments4 min readLW link

“Backchain­ing” in Strategy

Davis_Kingsley9 Feb 2018 12:01 UTC
23 points
17 comments1 min readLW link

Fun­da­men­tal At­tri­bu­tion Er­ror and The Other

Adrian Smith9 Feb 2018 6:48 UTC
5 points
5 comments4 min readLW link

Stable Poin­t­ers to Value II: En­vi­ron­men­tal Goals

abramdemski9 Feb 2018 6:03 UTC
19 points
3 comments4 min readLW link

Knowl­edge is Freedom

Scott Garrabrant9 Feb 2018 5:24 UTC
32 points
16 comments6 min readLW link

Pop­u­lar re­li­gions sug­gest ex­trap­o­lated vo­li­tion is non-ex­is­tence and wireheading

denisbider9 Feb 2018 0:06 UTC
10 points
14 comments1 min readLW link

Science like a chef

Adam Zerner8 Feb 2018 21:23 UTC
32 points
17 comments3 min readLW link

Write a Thou­sand Roads to Rome

Screwtape8 Feb 2018 18:09 UTC
112 points
17 comments4 min readLW link

Men­tal TAPs

Logan Riggs8 Feb 2018 17:26 UTC
13 points
2 comments1 min readLW link

Two kinds of Agency

Elo8 Feb 2018 6:28 UTC
10 points
9 comments3 min readLW link