Build­ing up to an In­ter­nal Fam­ily Sys­tems model

Kaj_Sotala26 Jan 2019 12:25 UTC
281 points
86 comments28 min readLW link2 reviews

The 3 Books Tech­nique for Learn­ing a New Skilll

Matt Goldenberg9 Jan 2019 12:45 UTC
206 points
48 comments2 min readLW link

Book Sum­mary: Con­scious­ness and the Brain

Kaj_Sotala16 Jan 2019 14:43 UTC
170 points
20 comments26 min readLW link1 review

Some Thoughts on My Psy­chi­a­try Practice

Laura B16 Jan 2019 23:16 UTC
154 points
43 comments4 min readLW link2 reviews

From Per­sonal to Pri­son Gangs: En­forc­ing Proso­cial Behavior

johnswentworth24 Jan 2019 18:07 UTC
153 points
26 comments5 min readLW link2 reviews

Disen­tan­gling ar­gu­ments for the im­por­tance of AI safety

Richard_Ngo21 Jan 2019 12:41 UTC
133 points
23 comments8 min readLW link

Se­quence in­tro­duc­tion: non-agent and mul­ti­a­gent mod­els of mind

Kaj_Sotala7 Jan 2019 14:12 UTC
123 points
16 comments7 min readLW link1 review

Refram­ing Su­per­in­tel­li­gence: Com­pre­hen­sive AI Ser­vices as Gen­eral Intelligence

Rohin Shah8 Jan 2019 7:12 UTC
122 points
77 comments5 min readLW link2 reviews
(www.fhi.ox.ac.uk)

S-Curves for Trend Forecasting

Matt Goldenberg23 Jan 2019 18:17 UTC
113 points
23 comments7 min readLW link4 reviews

Book Re­view: The Struc­ture Of Scien­tific Revolutions

Scott Alexander9 Jan 2019 7:10 UTC
104 points
30 comments19 min readLW link1 review
(slatestarcodex.com)

[Question] Why is so much dis­cus­sion hap­pen­ing in pri­vate Google Docs?

Wei Dai12 Jan 2019 2:19 UTC
101 points
22 comments1 min readLW link

[Question] What are the open prob­lems in Hu­man Ra­tion­al­ity?

Raemon13 Jan 2019 4:46 UTC
99 points
93 comments1 min readLW link3 reviews

Less Com­pe­ti­tion, More Mer­i­toc­racy?

Zvi20 Jan 2019 2:00 UTC
85 points
19 comments20 min readLW link3 reviews
(thezvi.wordpress.com)

An­nounce­ment: AI al­ign­ment prize round 4 winners

cousin_it20 Jan 2019 14:46 UTC
74 points
41 comments1 min readLW link

Com­ments on CAIS

Richard_Ngo12 Jan 2019 15:20 UTC
71 points
14 comments7 min readLW link

Strat­egy is the De­con­fu­sion of Action

ryan_b2 Jan 2019 20:56 UTC
69 points
4 comments6 min readLW link

AI safety with­out goal-di­rected behavior

Rohin Shah7 Jan 2019 7:48 UTC
68 points
15 comments4 min readLW link

Com­bat vs Nur­ture & Meta-Contrarianism

abramdemski10 Jan 2019 23:17 UTC
66 points
12 comments4 min readLW link

“AlphaS­tar: Mas­ter­ing the Real-Time Strat­egy Game StarCraft II”, Deep­Mind [won 10 of 11 games against hu­man pros]

gwern24 Jan 2019 20:49 UTC
62 points
52 comments1 min readLW link
(deepmind.com)

[Question] Does anti-malaria char­ity de­stroy the lo­cal anti-malaria in­dus­try?

Viliam5 Jan 2019 19:04 UTC
61 points
16 comments1 min readLW link

Will hu­mans build goal-di­rected agents?

Rohin Shah5 Jan 2019 1:33 UTC
61 points
43 comments5 min readLW link

The Re­la­tion­ship Between Hier­ar­chy and Wealth

sarahconstantin23 Jan 2019 2:00 UTC
59 points
8 comments12 min readLW link
(srconstantin.wordpress.com)

Two More De­ci­sion The­ory Prob­lems for Humans

Wei Dai4 Jan 2019 9:00 UTC
56 points
14 comments2 min readLW link

Me­gapro­ject management

ryan_b11 Jan 2019 17:08 UTC
55 points
11 comments5 min readLW link1 review

Mas­culine Virtues

Jacob Falkovich30 Jan 2019 16:03 UTC
52 points
32 comments13 min readLW link

What AI Safety Re­searchers Have Writ­ten About the Na­ture of Hu­man Values

avturchin16 Jan 2019 13:59 UTC
52 points
3 comments15 min readLW link

Learn­ing-In­ten­tions vs Do­ing-In­ten­tions

Ruby1 Jan 2019 22:22 UTC
51 points
14 comments4 min readLW link

Non-Con­se­quen­tial­ist Co­op­er­a­tion?

abramdemski11 Jan 2019 9:15 UTC
50 points
15 comments7 min readLW link

Direc­tions and desider­ata for AI alignment

paulfchristiano13 Jan 2019 7:47 UTC
48 points
1 comment14 min readLW link

Book Tril­ogy Re­view: Re­mem­brance of Earth’s Past (The Three Body Prob­lem)

Zvi30 Jan 2019 1:10 UTC
48 points
15 comments40 min readLW link
(thezvi.wordpress.com)

Book Recom­men­da­tions: An Every­one Cul­ture and Mo­ral Mazes

sarahconstantin10 Jan 2019 21:40 UTC
45 points
13 comments3 min readLW link
(srconstantin.wordpress.com)

A Frame­work for In­ter­nal Debugging

Matt Goldenberg16 Jan 2019 16:04 UTC
44 points
3 comments5 min readLW link

Op­ti­miz­ing for Sto­ries (vs Op­ti­miz­ing Real­ity)

Ruby7 Jan 2019 8:03 UTC
43 points
11 comments7 min readLW link

And My Ax­iom! In­sights from ‘Com­putabil­ity and Logic’

TurnTrout16 Jan 2019 19:48 UTC
42 points
17 comments8 min readLW link

CDT=EDT=UDT

abramdemski13 Jan 2019 23:46 UTC
39 points
16 comments12 min readLW link

An­throp­ics is pretty normal

Stuart_Armstrong17 Jan 2019 13:26 UTC
39 points
9 comments8 min readLW link

Can there be an in­de­scrib­able hel­l­world?

Stuart_Armstrong29 Jan 2019 15:00 UTC
39 points
19 comments2 min readLW link

I want it my way!

nickhayes4 Jan 2019 18:08 UTC
39 points
2 comments9 min readLW link

Too Smart for My Own Good

isovector22 Jan 2019 17:51 UTC
38 points
4 comments3 min readLW link

Lit­tle­wood’s Law and the Global Media

gwern12 Jan 2019 17:46 UTC
37 points
3 comments1 min readLW link
(www.gwern.net)

[Question] What are ques­tions?

Pee Doom9 Jan 2019 7:37 UTC
35 points
17 comments2 min readLW link

Hu­man-AI Interaction

Rohin Shah15 Jan 2019 1:57 UTC
34 points
10 comments4 min readLW link

AlphaGo Zero and ca­pa­bil­ity amplification

paulfchristiano9 Jan 2019 0:40 UTC
33 points
23 comments2 min readLW link

[Question] What is a rea­son­able out­side view for the fate of so­cial move­ments?

jacobjacob4 Jan 2019 0:21 UTC
33 points
27 comments1 min readLW link

Start­ing to see 2 months later

Pausecafe23 Jan 2019 20:46 UTC
32 points
3 comments2 min readLW link

Disad­van­tages of Card Rebalancing

Zvi6 Jan 2019 23:30 UTC
32 points
5 comments18 min readLW link
(thezvi.wordpress.com)

Align­ment Newslet­ter #39

Rohin Shah1 Jan 2019 8:10 UTC
32 points
2 comments5 min readLW link
(mailchi.mp)

[Question] What ex­er­cises go best with 3 blue 1 brown’s Lin­ear Alge­bra videos?

Raemon1 Jan 2019 21:29 UTC
31 points
12 comments1 min readLW link

Fol­low­ing hu­man norms

Rohin Shah20 Jan 2019 23:59 UTC
30 points
10 comments5 min readLW link

Thoughts on re­ward en­g­ineer­ing

paulfchristiano24 Jan 2019 20:15 UTC
30 points
30 comments11 min readLW link