All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 201820192020 2021 2022 2023 2024

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 252627 28 29 30 31

IRL 4/8: Maximum Entropy IRL and Bayesian IRL

RAISE25 Mar 2019 22:07 UTC

4 points

0 comments1 min readLW link

(app.grasple.com)

If you’ve attended LW/SSC meetups, please take this survey!

mingyuan25 Mar 2019 21:48 UTC

8 points

2 comments1 min readLW link

To perform best at work, look at Time & Energy account balance

SerenaTan1925 Mar 2019 19:37 UTC

9 points

0 comments2 min readLW link

Edinburgh SSC meetup

Hamish Peter Todd25 Mar 2019 16:49 UTC

1 point

0 comments1 min readLW link

Subagents, akrasia, and coherence in humans

Kaj_Sotala25 Mar 2019 14:24 UTC

139 points

31 comments16 min readLW link

The Amish, and Strategic Norms around Technology

Raemon24 Mar 2019 22:16 UTC

138 points

18 comments3 min readLW link 2 reviews

[Question] Did the recent blackmail discussion change your beliefs?

Dagon24 Mar 2019 16:06 UTC

36 points

7 comments1 min readLW link

The Politics of Age (the Young vs. the Old)

Martin Sustrik24 Mar 2019 6:40 UTC

16 points

17 comments1 min readLW link

(250bpm.com)

Why the AI Alignment Problem Might be Unsolvable?

Sailor Vulcan24 Mar 2019 4:10 UTC

4 points

15 comments7 min readLW link

A Tale of Four Moralities

Sailor Vulcan24 Mar 2019 3:46 UTC

13 points

9 comments4 min readLW link

800 scientist call out against statistical significance

Yoav Ravid23 Mar 2019 12:57 UTC

10 points

1 comment1 min readLW link

(www.nature.com)

[Question] Willing to share some words that changed your beliefs/behavior?

Duncan Sabien (Deactivated)23 Mar 2019 2:08 UTC

28 points

4 comments1 min readLW link

[Question] Can Bayes theorem represent infinite confusion?

Yoav Ravid22 Mar 2019 18:02 UTC

4 points

13 comments1 min readLW link

The Game Theory of Blackmail

Linda Linsefors22 Mar 2019 17:44 UTC

25 points

17 comments4 min readLW link

New Entry at the Stanford Encyclopedia of Philosophy on the Pragmatic Theory of Truth

Iwan Danilo22 Mar 2019 17:39 UTC

−3 points

1 comment1 min readLW link

(plato.stanford.edu)

South Bay SSC Meetup

David Friedman22 Mar 2019 3:10 UTC

2 points

0 comments1 min readLW link

Retrospective on a quantitative productivity logging attempt

femtogrammar22 Mar 2019 2:31 UTC

25 points

5 comments3 min readLW link

Declarative Mathematics

johnswentworth21 Mar 2019 19:05 UTC

59 points

10 comments3 min readLW link

The Main Sources of AI Risk?

Daniel Kokotajlo and Wei Dai

21 Mar 2019 18:28 UTC

121 points

26 comments2 min readLW link

[Link] IDA 9/14: The Scheme

RAISE21 Mar 2019 18:28 UTC

4 points

0 comments1 min readLW link

[Question] What should we expect from GPT-3?

avturchin21 Mar 2019 14:28 UTC

22 points

2 comments1 min readLW link

[Question] Tracking accuracy of personal forecasts

CheerfulWarrior20 Mar 2019 20:39 UTC

8 points

14 comments1 min readLW link

Criticism catalyzes analytical thinking in groups

rayraegah20 Mar 2019 16:27 UTC

10 points

0 comments1 min readLW link

Games in Kocherga club: Fallacymania, Tower of Chaos, Scientific Discovery

Alexander23020 Mar 2019 13:52 UTC

3 points

0 comments1 min readLW link

Moscow LW meetup in “Nauchka” library

Alexander23020 Mar 2019 13:49 UTC

3 points

0 comments1 min readLW link

[Question] What’s wrong with these analogies for understanding Informed Oversight and IDA?

Wei Dai20 Mar 2019 9:11 UTC

35 points

3 comments1 min readLW link

Alignment Newsletter #49

Rohin Shah20 Mar 2019 4:20 UTC

23 points

1 comment11 min readLW link

(mailchi.mp)

Some thoughts after reading Artificial Intelligence: A Modern Approach

swift_spiral19 Mar 2019 23:39 UTC

38 points

4 comments2 min readLW link

Rest Days vs Recovery Days

Unreal19 Mar 2019 22:37 UTC

215 points

36 comments6 min readLW link 1 review

Partial preferences and models

Stuart_Armstrong19 Mar 2019 16:29 UTC

12 points

9 comments2 min readLW link

IRL 3/8: Mitigating degeneracy: feature matching

RAISE18 Mar 2019 20:15 UTC

6 points

0 comments1 min readLW link

(app.grasple.com)

[Question] Is there a difference between uncertainty over your utility function and uncertainty over outcomes?

Chris_Leong18 Mar 2019 18:41 UTC

14 points

12 comments1 min readLW link

Ideas for a fact checking widget

Yoav Ravid18 Mar 2019 14:25 UTC

9 points

4 comments1 min readLW link

Implications of living within a Simulation

Tater18 Mar 2019 6:22 UTC

1 point

7 comments2 min readLW link

What failure looks like

paulfchristiano17 Mar 2019 20:18 UTC

417 points

54 comments8 min readLW link 2 reviews

Cryopreservation of Valia Zeldin

avturchin17 Mar 2019 19:15 UTC

19 points

0 comments1 min readLW link

(medium.com)

Insights from Munkres’ Topology

Rafael Harth17 Mar 2019 16:52 UTC

30 points

0 comments14 min readLW link

Motivational Meeting Place

Vincent B17 Mar 2019 16:17 UTC

8 points

1 comment3 min readLW link

[Question] Ask LW: Have you read Yudkowsky’s AI to Zombie book?

CaiwitzAzaria17 Mar 2019 13:31 UTC

10 points

20 comments1 min readLW link

[Question] What societies have ever had legal or accepted blackmail?

clone of saturn17 Mar 2019 9:16 UTC

33 points

23 comments1 min readLW link

[Question] How large is the fallout area of the biggest cobalt bomb we can build?

habryka17 Mar 2019 5:50 UTC

20 points

8 comments1 min readLW link

A cognitive intervention for wrist pain

rmoehn17 Mar 2019 5:26 UTC

28 points

24 comments6 min readLW link

Has “politics is the mind-killer” been a mind-killer?

SonnieBailey17 Mar 2019 3:05 UTC

31 points

26 comments3 min readLW link

Comparison of decision theories (with a focus on logical-counterfactual decision theories)

riceissa16 Mar 2019 21:15 UTC

78 points

11 comments10 min readLW link

Terrorism and Russell’s love of excitement

CaiwitzAzaria16 Mar 2019 6:53 UTC

−9 points

0 comments1 min readLW link

Boeing 737 MAX MCAS as an agent corrigibility failure

Shmi16 Mar 2019 1:46 UTC

60 points

3 comments1 min readLW link

Humans aren’t agents—what then for value learning?

Charlie Steiner15 Mar 2019 22:01 UTC

28 points

14 comments3 min readLW link

Privacy

Zvi15 Mar 2019 20:20 UTC

79 points

78 comments6 min readLW link

(thezvi.wordpress.com)

Active Curiosity vs Open Curiosity

Unreal15 Mar 2019 16:54 UTC

76 points

24 comments3 min readLW link

IDA 5-8/14: Approval Directed Agents

RAISE14 Mar 2019 23:58 UTC

4 points

0 comments1 min readLW link

(app.grasple.com)