How the MtG Color Wheel Ex­plains AI Safety

Scott Garrabrant15 Feb 2019 23:42 UTC
65 points
4 comments6 min readLW link

Some dis­junc­tive rea­sons for ur­gency on AI risk

Wei Dai15 Feb 2019 20:43 UTC
36 points
24 comments1 min readLW link

So you want to be a wizard

NaiveTortoise15 Feb 2019 15:43 UTC
16 points
0 comments1 min readLW link
(jvns.ca)

Co­op­er­a­tion is for Winners

Jacob Falkovich15 Feb 2019 14:58 UTC
21 points
6 comments4 min readLW link

Quan­tify­ing an­thropic effects on the Fermi paradox

Lukas Finnveden15 Feb 2019 10:51 UTC
29 points
5 comments27 min readLW link

[Question] How does OpenAI’s lan­guage model af­fect our AI timeline es­ti­mates?

jimrandomh15 Feb 2019 3:11 UTC
50 points
7 comments1 min readLW link

Has The Func­tion To Sort Posts By Votes Stopped Work­ing?

Capybasilisk14 Feb 2019 19:14 UTC
1 point
3 comments1 min readLW link

[Question] Who owns OpenAI’s new lan­guage model?

ioannes14 Feb 2019 17:51 UTC
16 points
9 comments1 min readLW link

The Pre­dic­tion Pyra­mid: Why Fun­da­men­tal Work is Needed for Pre­dic­tion Work

ozziegooen14 Feb 2019 16:21 UTC
43 points
15 comments3 min readLW link

Short story: An AGI’s Repug­nant Physics Experiment

ozziegooen14 Feb 2019 14:46 UTC
9 points
5 comments1 min readLW link

New York Res­tau­rants I Love: Breakfast

Zvi14 Feb 2019 13:10 UTC
10 points
3 comments8 min readLW link
(thezvi.wordpress.com)

[Question] Are there doc­u­men­taries on ra­tio­nal­ity?

Yoav Ravid14 Feb 2019 11:34 UTC
12 points
5 comments1 min readLW link

Align­ment Newslet­ter #45

Rohin Shah14 Feb 2019 2:10 UTC
25 points
2 comments8 min readLW link
(mailchi.mp)

Three Kinds of Re­search Doc­u­ments: Ex­plo­ra­tion, Ex­pla­na­tion, Academic

ozziegooen13 Feb 2019 21:25 UTC
22 points
18 comments3 min readLW link

Hu­mans in­ter­pret­ing humans

Stuart_Armstrong13 Feb 2019 19:03 UTC
12 points
1 comment2 min readLW link

An­chor­ing vs Taste: a model

Stuart_Armstrong13 Feb 2019 19:03 UTC
10 points
0 comments2 min readLW link

[Question] In­di­vi­d­ual profit-shar­ing?

ioannes13 Feb 2019 17:58 UTC
10 points
8 comments1 min readLW link

The RAIN Frame­work for In­for­ma­tional Effectiveness

ozziegooen13 Feb 2019 12:54 UTC
37 points
16 comments6 min readLW link

On Long and In­sight­ful Posts

Qria13 Feb 2019 3:52 UTC
19 points
3 comments1 min readLW link

Lay­ers of Ex­per­tise and the Curse of Curiosity

Gyrodiot12 Feb 2019 23:41 UTC
19 points
1 comment6 min readLW link

Nuances with as­crip­tion universality

evhub12 Feb 2019 23:38 UTC
20 points
1 comment2 min readLW link

Learn­ing prefer­ences by look­ing at the world

Rohin Shah12 Feb 2019 22:25 UTC
43 points
10 comments7 min readLW link
(bair.berkeley.edu)

Func­tional silence: com­mu­ni­ca­tion that min­i­mizes change of re­ceiver’s beliefs

chaosmage12 Feb 2019 21:32 UTC
27 points
5 comments2 min readLW link

Ar­gu­ments for moral indefinability

Richard_Ngo12 Feb 2019 10:40 UTC
50 points
10 comments7 min readLW link
(thinkingcomplete.blogspot.com)

Art: A Ra­tion­al­ist’s Take?

schrodingart12 Feb 2019 5:07 UTC
2 points
4 comments6 min readLW link

Lan­guage, the Key to Everything

chris8217912 Feb 2019 5:06 UTC
−2 points
2 comments4 min readLW link

Tri­an­gle SSC Meetup-February

willbobaggins12 Feb 2019 3:07 UTC
1 point
0 comments1 min readLW link

Would I think for ten thou­sand years?

Stuart_Armstrong11 Feb 2019 19:37 UTC
25 points
13 comments1 min readLW link

“Nor­ma­tive as­sump­tions” need not be complex

Stuart_Armstrong11 Feb 2019 19:03 UTC
11 points
0 comments2 min readLW link

Emo­tional Cli­mate Change—an in­con­ve­nient idea

marcus_gabler11 Feb 2019 17:55 UTC
−30 points
8 comments2 min readLW link

Co­her­ent be­havi­our in the real world is an in­co­her­ent concept

Richard_Ngo11 Feb 2019 17:00 UTC
51 points
17 comments9 min readLW link

[Question] Why do you re­ject nega­tive util­i­tar­i­anism?

Teo Ajantaival11 Feb 2019 15:38 UTC
32 points
27 comments1 min readLW link

[Question] How im­por­tant is it that LW has an un­limited sup­ply of karma?

jacobjacob11 Feb 2019 1:41 UTC
27 points
9 comments2 min readLW link

Min­i­mize Use of Stan­dard In­ter­net Food Delivery

Zvi10 Feb 2019 19:50 UTC
−18 points
28 comments2 min readLW link
(thezvi.wordpress.com)

Propo­si­tional Logic, Syn­tac­tic Implication

Donald Hobson10 Feb 2019 18:12 UTC
5 points
1 comment1 min readLW link

Fight­ing the al­lure of de­pres­sive realism

aaq10 Feb 2019 16:46 UTC
19 points
2 comments3 min readLW link

Struc­tured Con­cur­rency Cross-lan­guage Forum

Martin Sustrik10 Feb 2019 9:20 UTC
12 points
0 comments1 min readLW link
(250bpm.com)

Prob­a­bil­ity space has 2 metrics

Donald Hobson10 Feb 2019 0:28 UTC
88 points
11 comments1 min readLW link

Some Thoughts on Metaphilosophy

Wei Dai10 Feb 2019 0:28 UTC
76 points
30 comments4 min readLW link

The Ar­gu­ment from Philo­soph­i­cal Difficulty

Wei Dai10 Feb 2019 0:28 UTC
59 points
31 comments1 min readLW link

Dojo on stress

Elo9 Feb 2019 22:49 UTC
13 points
0 comments4 min readLW link

[Question] When should we ex­pect the ed­u­ca­tion bub­ble to pop? How can we short it?

jacobjacob9 Feb 2019 21:39 UTC
35 points
12 comments1 min readLW link

The Cake is a Lie, Part 2.

IncomprehensibleMane9 Feb 2019 20:07 UTC
−27 points
7 comments9 min readLW link

The Case for a Big­ger Audience

John_Maxwell9 Feb 2019 7:22 UTC
68 points
58 comments2 min readLW link

[Question] Can some­one de­sign this Google Sheets bug list tem­plate for me?

Bae's Theorem9 Feb 2019 6:55 UTC
4 points
4 comments1 min readLW link

Re­in­force­ment Learn­ing in the Iter­ated Am­plifi­ca­tion Framework

William_S9 Feb 2019 0:56 UTC
25 points
12 comments4 min readLW link

HCH is not just Me­chan­i­cal Turk

William_S9 Feb 2019 0:46 UTC
42 points
6 comments3 min readLW link

Friendly SSC and LW meetup

Sean Aubin9 Feb 2019 0:20 UTC
1 point
0 comments1 min readLW link

The Ham­ming Question

Raemon8 Feb 2019 19:34 UTC
59 points
38 comments3 min readLW link

Make an ap­point­ment with your saner self

MalcolmOcean8 Feb 2019 5:05 UTC
28 points
0 comments4 min readLW link