All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 201720182019 2020 2021 2022 2023 2024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 161718 19 20 21 22 23 24 25 26 27 28 29 30 31

Look Under the Light Post

Gordon Seidoh WorleyJul 16, 2018, 10:19 PM

22 points

8 comments4 min readLW link

Alignment Newsletter #15: 07/16/18

Rohin ShahJul 16, 2018, 4:10 PM

42 points

0 comments15 min readLW link

(mailchi.mp)

Compact vs. Wide Models

VaniverJul 16, 2018, 4:09 AM

31 points

5 comments3 min readLW link

Probabilistic decision-making as an anxiety-reduction technique

RationallyDenseJul 16, 2018, 3:51 AM

8 points

4 comments1 min readLW link

Buridan’s ass in coordination games

jessicataJul 16, 2018, 2:51 AM

52 points

26 comments10 min readLW link

Research Debt

ElizabethJul 15, 2018, 7:36 PM

25 points

2 comments LW link

(distill.pub)

An optimistic explanation of the outrage epidemic

chaosmageJul 15, 2018, 2:35 PM

18 points

5 comments3 min readLW link

Announcement: AI alignment prize round 3 winners and next round

cousin_itJul 15, 2018, 7:40 AM

93 points

7 comments1 min readLW link

Meetup Cookbook

maiaJul 14, 2018, 10:26 PM

74 points

7 comments1 min readLW link

(tigrennatenn.neocities.org)

Expected Pain Parameters

AlicornJul 14, 2018, 7:30 PM

87 points

12 comments2 min readLW link

Boltzmann Brains and Within-model vs. Between-models Probability

Charlie SteinerJul 14, 2018, 9:52 AM

15 points

12 comments3 min readLW link

[1607.08289] “Mammalian Value Systems” (as a starting point for human value system model created by IRL agent)

avturchinJul 14, 2018, 9:46 AM

9 points

9 comments LW link

(arxiv.org)

Generating vs Recognizing

lifelonglearnerJul 14, 2018, 5:10 AM

15 points

3 comments4 min readLW link

LW Update 2018-7-14 – Styling Rework, CommentsItem, Performance

RaemonJul 14, 2018, 1:13 AM

30 points

0 comments1 min readLW link

Secondary Stressors and Tactile Ambition

lionhearted (Sebastian Marshall)Jul 13, 2018, 12:26 AM

16 points

16 comments4 min readLW link

A Sarno-Hanson Synthesis

moridinamaelJul 12, 2018, 4:13 PM

52 points

15 comments4 min readLW link

Probability is a model, frequency is an observation: Why both halfers and thirders are correct in the Sleeping Beauty problem.

ShmiJul 12, 2018, 6:52 AM

26 points

34 comments2 min readLW link

What does the stock market tell us about AI timelines?

Tobias_BaumannJul 12, 2018, 6:05 AM

6 points

5 comments LW link

(s-risks.org)

An Agent is a Worldline in Tegmark V

komponistoJul 12, 2018, 5:12 AM

24 points

12 comments2 min readLW link

Washington, D.C.: What If

RobinZJul 12, 2018, 4:30 AM

9 points

0 comments1 min readLW link

Are pre-specified utility functions about the real world possible in principle?

mloganJul 11, 2018, 6:46 PM

24 points

7 comments4 min readLW link

Melatonin: Much More Than You Wanted To Know

Scott AlexanderJul 11, 2018, 5:40 PM

122 points

16 comments15 min readLW link

(slatestarcodex.com)

Monk Treehouse: some problems defining simulation

dranorterJul 11, 2018, 7:35 AM

6 points

1 comment5 min readLW link

Mathematical Mindset

komponistoJul 11, 2018, 3:03 AM

54 points

5 comments2 min readLW link

Decision-theoretic problems and Theories; An (Incomplete) comparative list

somervtaJul 11, 2018, 2:59 AM

36 points

0 comments1 min readLW link

(docs.google.com)

Agents That Learn From Human Behavior Can’t Learn Human Values That Humans Haven’t Learned Yet

steven0461Jul 11, 2018, 2:59 AM

28 points

11 comments1 min readLW link

On the Role of Counterfactuals in Learning

Max KanwalJul 11, 2018, 2:45 AM

11 points

2 comments3 min readLW link

Clarifying Consequentialists in the Solomonoff Prior

Vlad MikulikJul 11, 2018, 2:35 AM

20 points

16 comments6 min readLW link

Complete Class: Consequentialist Foundations

abramdemskiJul 11, 2018, 1:57 AM

58 points

37 comments13 min readLW link

Conditions under which misaligned subagents can (not) arise in classifiers

anon1Jul 11, 2018, 1:52 AM

12 points

2 comments2 min readLW link

No, I won’t go there, it feels like you’re trying to Pascal-mug me

RupertJul 11, 2018, 1:37 AM

9 points

0 comments2 min readLW link

Conceptual problems with utility functions

DacynJul 11, 2018, 1:29 AM

22 points

12 comments2 min readLW link

Dependent Type Theory and Zero-Shot Reasoning

evhubJul 11, 2018, 1:16 AM

27 points

3 comments5 min readLW link

A comment on the IDA-AlphaGoZero metaphor; capabilities versus alignment

AlexMennenJul 11, 2018, 1:03 AM

40 points

1 comment1 min readLW link

Bounding Goodhart’s Law

eric_langloisJul 11, 2018, 12:46 AM

43 points

2 comments5 min readLW link

Mechanistic Transparency for Machine Learning

DanielFilanJul 11, 2018, 12:34 AM

55 points

9 comments4 min readLW link

An environment for studying counterfactuals

NisanJul 11, 2018, 12:14 AM

15 points

6 comments3 min readLW link

A universal score for optimizers

levinJul 10, 2018, 11:52 PM

15 points

8 comments3 min readLW link

Bayesian Probability is for things that are Space-like Separated from You

Scott GarrabrantJul 10, 2018, 11:47 PM

86 points

22 comments2 min readLW link

Alignment problems for economists

Chris van MerwijkJul 10, 2018, 11:43 PM

5 points

2 comments2 min readLW link

Non-resolve as Resolve

Linda LinseforsJul 10, 2018, 11:31 PM

15 points

1 comment2 min readLW link

A framework for thinking about wireheading

theotherotheralexJul 10, 2018, 11:14 PM

15 points

4 comments1 min readLW link

Logical Uncertainty and Functional Decision Theory

swordsintoploughsharesJul 10, 2018, 11:08 PM

15 points

4 comments2 min readLW link

Repeated (and improved) Sleeping Beauty problem

Linda LinseforsJul 10, 2018, 10:32 PM

12 points

5 comments2 min readLW link

Probability is fake, frequency is real

Linda LinseforsJul 10, 2018, 10:32 PM

12 points

7 comments1 min readLW link

Conditioning, Counterfactuals, Exploration, and Gears

DiffractorJul 10, 2018, 10:11 PM

28 points

1 comment5 min readLW link

Two agents can have the same source code and optimise different utility functions

Joar SkalseJul 10, 2018, 9:51 PM

11 points

11 comments1 min readLW link

The Intentional Agency Experiment

Alexander Gietelink OldenzielJul 10, 2018, 8:32 PM

13 points

5 comments3 min readLW link

Announcing AlignmentForum.org Beta

RaemonJul 10, 2018, 8:19 PM

68 points

35 comments2 min readLW link

Choosing to Choose?

Daniel HerrmannJul 10, 2018, 8:15 PM

10 points

7 comments5 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer