All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 201720182019 2020 2021 2022 2023 2024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 232425 26 27 28 29 30 31

Melatonin: Much More Than You Wanted To Know

Scott AlexanderJul 11, 2018, 5:40 PM

123 points

16 comments15 min readLW link

(slatestarcodex.com)

Monk Treehouse: some problems defining simulation

dranorterJul 11, 2018, 7:35 AM

6 points

1 comment5 min readLW link

Mathematical Mindset

komponistoJul 11, 2018, 3:03 AM

54 points

5 comments2 min readLW link

Decision-theoretic problems and Theories; An (Incomplete) comparative list

somervtaJul 11, 2018, 2:59 AM

36 points

0 comments1 min readLW link

(docs.google.com)

Agents That Learn From Human Behavior Can’t Learn Human Values That Humans Haven’t Learned Yet

steven0461Jul 11, 2018, 2:59 AM

28 points

11 comments1 min readLW link

On the Role of Counterfactuals in Learning

Max KanwalJul 11, 2018, 2:45 AM

11 points

2 comments3 min readLW link

Clarifying Consequentialists in the Solomonoff Prior

Vlad MikulikJul 11, 2018, 2:35 AM

20 points

16 comments6 min readLW link

Complete Class: Consequentialist Foundations

abramdemskiJul 11, 2018, 1:57 AM

58 points

37 comments13 min readLW link

Conditions under which misaligned subagents can (not) arise in classifiers

anon1Jul 11, 2018, 1:52 AM

12 points

2 comments2 min readLW link

No, I won’t go there, it feels like you’re trying to Pascal-mug me

RupertJul 11, 2018, 1:37 AM

9 points

0 comments2 min readLW link

Conceptual problems with utility functions

DacynJul 11, 2018, 1:29 AM

22 points

12 comments2 min readLW link

Dependent Type Theory and Zero-Shot Reasoning

evhubJul 11, 2018, 1:16 AM

27 points

3 comments5 min readLW link

A comment on the IDA-AlphaGoZero metaphor; capabilities versus alignment

AlexMennenJul 11, 2018, 1:03 AM

40 points

1 comment1 min readLW link

Bounding Goodhart’s Law

eric_langloisJul 11, 2018, 12:46 AM

43 points

2 comments5 min readLW link

Mechanistic Transparency for Machine Learning

DanielFilanJul 11, 2018, 12:34 AM

55 points

9 comments4 min readLW link

An environment for studying counterfactuals

NisanJul 11, 2018, 12:14 AM

15 points

6 comments3 min readLW link

A universal score for optimizers

levinJul 10, 2018, 11:52 PM

15 points

8 comments3 min readLW link

Bayesian Probability is for things that are Space-like Separated from You

Scott GarrabrantJul 10, 2018, 11:47 PM

87 points

22 comments2 min readLW link

Alignment problems for economists

Chris van MerwijkJul 10, 2018, 11:43 PM

5 points

2 comments2 min readLW link

Non-resolve as Resolve

Linda LinseforsJul 10, 2018, 11:31 PM

15 points

1 comment2 min readLW link

A framework for thinking about wireheading

theotherotheralexJul 10, 2018, 11:14 PM

15 points

4 comments1 min readLW link

Logical Uncertainty and Functional Decision Theory

swordsintoploughsharesJul 10, 2018, 11:08 PM

15 points

4 comments2 min readLW link

Repeated (and improved) Sleeping Beauty problem

Linda LinseforsJul 10, 2018, 10:32 PM

12 points

5 comments2 min readLW link

Probability is fake, frequency is real

Linda LinseforsJul 10, 2018, 10:32 PM

12 points

7 comments1 min readLW link

Conditioning, Counterfactuals, Exploration, and Gears

DiffractorJul 10, 2018, 10:11 PM

28 points

1 comment5 min readLW link

Two agents can have the same source code and optimise different utility functions

Joar SkalseJul 10, 2018, 9:51 PM

11 points

11 comments1 min readLW link

The Intentional Agency Experiment

Alexander Gietelink OldenzielJul 10, 2018, 8:32 PM

13 points

5 comments3 min readLW link

Announcing AlignmentForum.org Beta

RaemonJul 10, 2018, 8:19 PM

68 points

35 comments2 min readLW link

Choosing to Choose?

Daniel HerrmannJul 10, 2018, 8:15 PM

10 points

7 comments5 min readLW link

Study on what makes people approve or condemn mind upload technology; references LW

Kaj_SotalaJul 10, 2018, 5:14 PM

22 points

0 comments2 min readLW link

(www.nature.com)

How to parent more predictably

jefftkJul 10, 2018, 3:18 PM

78 points

1 comment4 min readLW link

Open Thread July 2018

nullJul 10, 2018, 2:51 PM

10 points

9 comments1 min readLW link

Three anchorings: number, attitude, and taste

Stuart_ArmstrongJul 10, 2018, 2:21 PM

14 points

4 comments2 min readLW link

The Dilemma of Worse Than Death Scenarios

arkaeikJul 10, 2018, 9:18 AM

14 points

18 comments4 min readLW link

Newcomb’s Problem In One Paragraph

Chris_LeongJul 10, 2018, 7:10 AM

7 points

0 comments1 min readLW link

Letting Go III: Unilateral or GTFO

johnswentworthJul 10, 2018, 6:26 AM

21 points

3 comments2 min readLW link

Sydney Rationality Dojo—December

NextJul 10, 2018, 4:22 AM

1 point

0 comments1 min readLW link

Sydney Rationality Dojo—November

NextJul 10, 2018, 4:20 AM

1 point

0 comments1 min readLW link

Sydney Rationality Dojo—October

NextJul 10, 2018, 4:19 AM

1 point

0 comments1 min readLW link

Sydney Rationality Dojo—September

NextJul 10, 2018, 4:12 AM

1 point

0 comments1 min readLW link

Sydney Rationality Dojo—August

NextJul 10, 2018, 4:04 AM

1 point

0 comments1 min readLW link

Context Windows: A Model of Unproductive Disagreement

Zachary JacobiJul 10, 2018, 1:40 AM

4 points

2 comments5 min readLW link

Fundamentals of Formalisation Level 5: Formal Proof

philip_bJul 9, 2018, 8:55 PM

13 points

0 comments1 min readLW link

RAISE is looking for full-time content developers

nullJul 9, 2018, 5:01 PM

22 points

5 comments1 min readLW link

Alignment Newsletter #14

Rohin Shah9 Jul 2018 16:20 UTC

14 points

0 comments9 min readLW link

(mailchi.mp)

Math: Textbooks and the DTP pipeline

Andrew Quinn9 Jul 2018 15:09 UTC

12 points

3 comments2 min readLW link

The Craft And The Codex

Paperclip Minimizer9 Jul 2018 10:50 UTC

12 points

7 comments LW link

(slatestarcodex.com)

The Fermi Paradox: What did Sandberg, Drexler and Ord Really Dissolve?

Shmi8 Jul 2018 21:18 UTC

47 points

28 comments5 min readLW link

An Exercise in Applied Rationality: A New Apartment

Sable8 Jul 2018 21:18 UTC

8 points

9 comments1 min readLW link

Estimating the consequences of device detection tech

Jsevillamol8 Jul 2018 18:25 UTC

27 points

4 comments7 min readLW link