IRL 4/​8: Max­i­mum En­tropy IRL and Bayesian IRL

RAISE25 Mar 2019 22:07 UTC
4 points
0 comments1 min readLW link
(app.grasple.com)

If you’ve at­tended LW/​SSC mee­tups, please take this sur­vey!

mingyuan25 Mar 2019 21:48 UTC
8 points
2 comments1 min readLW link

To perform best at work, look at Time & En­ergy ac­count balance

SerenaTan1925 Mar 2019 19:37 UTC
9 points
0 comments2 min readLW link

Ed­in­burgh SSC meetup

Hamish Peter Todd25 Mar 2019 16:49 UTC
1 point
0 comments1 min readLW link

Subagents, akra­sia, and co­her­ence in humans

Kaj_Sotala25 Mar 2019 14:24 UTC
139 points
31 comments16 min readLW link

The Amish, and Strate­gic Norms around Technology

Raemon24 Mar 2019 22:16 UTC
138 points
18 comments3 min readLW link2 reviews

[Question] Did the re­cent black­mail dis­cus­sion change your be­liefs?

Dagon24 Mar 2019 16:06 UTC
36 points
7 comments1 min readLW link

The Poli­tics of Age (the Young vs. the Old)

Martin Sustrik24 Mar 2019 6:40 UTC
16 points
17 comments1 min readLW link
(250bpm.com)

Why the AI Align­ment Prob­lem Might be Un­solv­able?

Sailor Vulcan24 Mar 2019 4:10 UTC
4 points
15 comments7 min readLW link

A Tale of Four Moralities

Sailor Vulcan24 Mar 2019 3:46 UTC
13 points
9 comments4 min readLW link

800 sci­en­tist call out against statis­ti­cal sig­nifi­cance

Yoav Ravid23 Mar 2019 12:57 UTC
10 points
1 comment1 min readLW link
(www.nature.com)

[Question] Willing to share some words that changed your be­liefs/​be­hav­ior?

Duncan Sabien (Deactivated)23 Mar 2019 2:08 UTC
28 points
4 comments1 min readLW link

[Question] Can Bayes the­o­rem rep­re­sent in­finite con­fu­sion?

Yoav Ravid22 Mar 2019 18:02 UTC
4 points
13 comments1 min readLW link

The Game The­ory of Blackmail

Linda Linsefors22 Mar 2019 17:44 UTC
25 points
17 comments4 min readLW link

New En­try at the Stan­ford En­cy­clo­pe­dia of Philos­o­phy on the Prag­matic The­ory of Truth

Iwan Danilo22 Mar 2019 17:39 UTC
−3 points
1 comment1 min readLW link
(plato.stanford.edu)

South Bay SSC Meetup

David Friedman22 Mar 2019 3:10 UTC
2 points
0 comments1 min readLW link

Ret­ro­spec­tive on a quan­ti­ta­tive pro­duc­tivity log­ging attempt

femtogrammar22 Mar 2019 2:31 UTC
25 points
5 comments3 min readLW link

Declar­a­tive Mathematics

johnswentworth21 Mar 2019 19:05 UTC
59 points
10 comments3 min readLW link

The Main Sources of AI Risk?

21 Mar 2019 18:28 UTC
121 points
26 comments2 min readLW link

[Link] IDA 9/​14: The Scheme

RAISE21 Mar 2019 18:28 UTC
4 points
0 comments1 min readLW link

[Question] What should we ex­pect from GPT-3?

avturchin21 Mar 2019 14:28 UTC
22 points
2 comments1 min readLW link

[Ques­tion] Track­ing ac­cu­racy of per­sonal forecasts

CheerfulWarrior20 Mar 2019 20:39 UTC
8 points
14 comments1 min readLW link

Crit­i­cism cat­alyzes an­a­lyt­i­cal think­ing in groups

rayraegah20 Mar 2019 16:27 UTC
10 points
0 comments1 min readLW link

Games in Kocherga club: Fal­la­cy­ma­nia, Tower of Chaos, Scien­tific Discovery

Alexander23020 Mar 2019 13:52 UTC
3 points
0 comments1 min readLW link

Moscow LW meetup in “Nauchka” library

Alexander23020 Mar 2019 13:49 UTC
3 points
0 comments1 min readLW link

[Question] What’s wrong with these analo­gies for un­der­stand­ing In­formed Over­sight and IDA?

Wei Dai20 Mar 2019 9:11 UTC
35 points
3 comments1 min readLW link

Align­ment Newslet­ter #49

Rohin Shah20 Mar 2019 4:20 UTC
23 points
1 comment11 min readLW link
(mailchi.mp)

Some thoughts af­ter read­ing Ar­tifi­cial In­tel­li­gence: A Modern Approach

swift_spiral19 Mar 2019 23:39 UTC
38 points
4 comments2 min readLW link

Rest Days vs Re­cov­ery Days

Unreal19 Mar 2019 22:37 UTC
215 points
36 comments6 min readLW link1 review

Par­tial prefer­ences and models

Stuart_Armstrong19 Mar 2019 16:29 UTC
12 points
9 comments2 min readLW link

IRL 3/​8: Miti­gat­ing de­gen­er­acy: fea­ture matching

RAISE18 Mar 2019 20:15 UTC
6 points
0 comments1 min readLW link
(app.grasple.com)

[Question] Is there a differ­ence be­tween un­cer­tainty over your util­ity func­tion and un­cer­tainty over out­comes?

Chris_Leong18 Mar 2019 18:41 UTC
14 points
12 comments1 min readLW link

Ideas for a fact check­ing widget

Yoav Ravid18 Mar 2019 14:25 UTC
9 points
4 comments1 min readLW link

Im­pli­ca­tions of liv­ing within a Simulation

Tater18 Mar 2019 6:22 UTC
1 point
7 comments2 min readLW link

What failure looks like

paulfchristiano17 Mar 2019 20:18 UTC
417 points
54 comments8 min readLW link2 reviews

Cry­op­reser­va­tion of Valia Zeldin

avturchin17 Mar 2019 19:15 UTC
19 points
0 comments1 min readLW link
(medium.com)

In­sights from Munkres’ Topology

Rafael Harth17 Mar 2019 16:52 UTC
30 points
0 comments14 min readLW link

Mo­ti­va­tional Meet­ing Place

Vincent B17 Mar 2019 16:17 UTC
8 points
1 comment3 min readLW link

[Question] Ask LW: Have you read Yud­kowsky’s AI to Zom­bie book?

CaiwitzAzaria17 Mar 2019 13:31 UTC
10 points
20 comments1 min readLW link

[Question] What so­cieties have ever had le­gal or ac­cepted black­mail?

clone of saturn17 Mar 2019 9:16 UTC
33 points
23 comments1 min readLW link

[Question] How large is the fal­lout area of the biggest cobalt bomb we can build?

habryka17 Mar 2019 5:50 UTC
20 points
8 comments1 min readLW link

A cog­ni­tive in­ter­ven­tion for wrist pain

rmoehn17 Mar 2019 5:26 UTC
28 points
24 comments6 min readLW link

Has “poli­tics is the mind-kil­ler” been a mind-kil­ler?

SonnieBailey17 Mar 2019 3:05 UTC
31 points
26 comments3 min readLW link

Com­par­i­son of de­ci­sion the­o­ries (with a fo­cus on log­i­cal-coun­ter­fac­tual de­ci­sion the­o­ries)

riceissa16 Mar 2019 21:15 UTC
78 points
11 comments10 min readLW link

Ter­ror­ism and Rus­sell’s love of excitement

CaiwitzAzaria16 Mar 2019 6:53 UTC
−9 points
0 comments1 min readLW link

Boe­ing 737 MAX MCAS as an agent cor­rigi­bil­ity failure

Shmi16 Mar 2019 1:46 UTC
60 points
3 comments1 min readLW link

Hu­mans aren’t agents—what then for value learn­ing?

Charlie Steiner15 Mar 2019 22:01 UTC
28 points
14 comments3 min readLW link

Privacy

Zvi15 Mar 2019 20:20 UTC
79 points
78 comments6 min readLW link
(thezvi.wordpress.com)

Ac­tive Cu­ri­os­ity vs Open Curiosity

Unreal15 Mar 2019 16:54 UTC
76 points
24 comments3 min readLW link

IDA 5-8/​14: Ap­proval Directed Agents

RAISE14 Mar 2019 23:58 UTC
4 points
0 comments1 min readLW link
(app.grasple.com)