All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 201820192020 2021 2022 2023 2024 2025

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 192021 22 23 24 25 26 27 28 29 30 31

A misconception about immigration

limerottAug 19, 2019, 10:37 PM

1 point

9 comments4 min readLW link

(limerott.com)

[Question] Do We Change Our Minds Less Often Than We Think?

RaemonAug 19, 2019, 9:37 PM

20 points

5 comments1 min readLW link

Classifying specification problems as variants of Goodhart’s Law

VikaAug 19, 2019, 8:40 PM

72 points

5 comments5 min readLW link 1 review

Unstriving

Jacob FalkovichAug 19, 2019, 2:31 PM

38 points

7 comments6 min readLW link

Goodhart’s Curse and Limitations on AI Alignment

Gordon Seidoh WorleyAug 19, 2019, 7:57 AM

25 points

18 comments10 min readLW link

Raph Koster on Virtual Worlds vs Games (notes)

RaemonAug 18, 2019, 7:01 PM

26 points

8 comments2 min readLW link

“Can We Survive Technology” by von Neumann

Ben PaceAug 18, 2019, 6:58 PM

32 points

2 comments1 min readLW link

(geosci.uchicago.edu)

Prokaryote Multiverse. An argument that potential simulators do not have significantly more complex physics than ours

mako yassAug 18, 2019, 4:22 AM

0 points

5 comments2 min readLW link

Neural Nets in Python 1

lifelonglearnerAug 18, 2019, 2:48 AM

10 points

3 comments8 min readLW link

Inspection Paradox as a Driver of Group Separation

ShmiAug 17, 2019, 9:47 PM

29 points

0 comments1 min readLW link

South Bay Meetup

David FriedmanAug 17, 2019, 7:56 PM

1 point

0 comments1 min readLW link

Problems in AI Alignment that philosophers could potentially contribute to

Wei DaiAug 17, 2019, 5:38 PM

79 points

14 comments2 min readLW link

[Question] How can you use music to boost learning?

Matthew BarnettAug 17, 2019, 6:59 AM

11 points

1 comment1 min readLW link

A Primer on Matrix Calculus, Part 3: The Chain Rule

Matthew BarnettAug 17, 2019, 1:50 AM

12 points

4 comments6 min readLW link

Nashville SSC September Meetup

friedelcraftinessAug 16, 2019, 3:16 PM

1 point

0 comments1 min readLW link

Beliefs Are For True Things

Davis_KingsleyAug 15, 2019, 11:23 PM

8 points

5 comments3 min readLW link

[Question] What experiments would demonstrate “upper limits of augmented working memory?”

RaemonAug 15, 2019, 10:09 PM

33 points

6 comments2 min readLW link

Clarifying some key hypotheses in AI alignment

Ben Cottier and Rohin Shah

Aug 15, 2019, 9:29 PM

79 points

12 comments9 min readLW link

Tessercube — OpenPGP Made Mobile

Suji YanAug 15, 2019, 9:34 AM

4 points

0 comments1 min readLW link

A Primer on Matrix Calculus, Part 2: Jacobians and other fun

Matthew BarnettAug 15, 2019, 1:13 AM

22 points

7 comments7 min readLW link

Partial summary of debate with Benquo and Jessicata [pt 1]

RaemonAug 14, 2019, 8:02 PM

89 points

63 comments22 min readLW link 3 reviews

“Designing agent incentives to avoid reward tampering”, DeepMind

gwernAug 14, 2019, 4:57 PM

28 points

15 comments1 min readLW link

(medium.com)

Subagents, trauma and rationality

Kaj_SotalaAug 14, 2019, 1:14 PM

111 points

4 comments19 min readLW link

Predicted AI alignment event/meeting calendar

rmoehnAug 14, 2019, 7:14 AM

29 points

14 comments1 min readLW link

Natural laws should be explicit constraints on strategy space

ryan_bAug 13, 2019, 8:22 PM

8 points

6 comments1 min readLW link

Distance Functions are Hard

Grue_SlinkyAug 13, 2019, 5:33 PM

31 points

19 comments6 min readLW link

Book Review: Secular Cycles

Scott AlexanderAug 13, 2019, 4:10 AM

62 points

10 comments16 min readLW link 1 review

(slatestarcodex.com)

A Primer on Matrix Calculus, Part 1: Basic review

Matthew BarnettAug 12, 2019, 11:44 PM

25 points

4 comments7 min readLW link

[Question] What explanatory power does Kahneman’s System 2 possess?

Richard_NgoAug 12, 2019, 3:23 PM

31 points

2 comments1 min readLW link

Mesa-Optimizers and Over-optimization Failure (Optimizing and Goodhart Effects, Clarifying Thoughts—Part 4)

DavidmanheimAug 12, 2019, 8:07 AM

15 points

3 comments4 min readLW link

Adjectives from the Future: The Dangers of Result-based Descriptions

Pradeep_KumarAug 11, 2019, 7:19 PM

19 points

8 comments11 min readLW link

[Question] Could we solve this email mess if we all moved to paid emails?

jacobjacobAug 11, 2019, 4:31 PM

29 points

50 comments4 min readLW link

AI Safety Reading Group

Søren ElverlinAug 11, 2019, 9:01 AM

16 points

8 comments1 min readLW link

[Question] Does human choice have to be transitive in order to be rational/consistent?

jmhAug 11, 2019, 1:49 AM

9 points

6 comments1 min readLW link

Diana Fleischman and Geoffrey Miller—Audience Q&A

Jacob FalkovichAug 10, 2019, 10:37 PM

38 points

6 comments9 min readLW link

Intransitive Preferences You Can’t Pump

zulupineappleAug 9, 2019, 11:10 PM

0 points

2 comments1 min readLW link

Categorial preferences and utility functions

DavidHolmesAug 9, 2019, 9:36 PM

10 points

6 comments5 min readLW link

[Question] What is the state of the ego depletion field?

Eli TyreAug 9, 2019, 8:30 PM

27 points

10 comments1 min readLW link

Why Gradients Vanish and Explode

Matthew BarnettAug 9, 2019, 2:54 AM

25 points

9 comments3 min readLW link

AI Forecasting Dictionary (Forecasting infrastructure, part 1)

jacobjacob and bgold

Aug 8, 2019, 4:10 PM

50 points

0 comments5 min readLW link

[Question] Why do humans not have built-in neural i/o channels?

Richard_NgoAug 8, 2019, 1:09 PM

25 points

23 comments1 min readLW link

Which of these five AI alignment research projects ideas are no good?

rmoehnAug 8, 2019, 7:17 AM

25 points

13 comments1 min readLW link

Calibrating With Cards

lifelonglearnerAug 8, 2019, 6:44 AM

32 points

3 comments3 min readLW link

[Question] Is there a source/market for LW-related t-shirts?

jooyous8 Aug 2019 4:30 UTC

8 points

3 comments1 min readLW link

Verification and Transparency

DanielFilan8 Aug 2019 1:50 UTC

35 points

6 comments2 min readLW link

(danielfilan.com)

Toy model piece #2: Combining short and long range partial preferences

Stuart_Armstrong8 Aug 2019 0:11 UTC

14 points

0 comments4 min readLW link

Four Ways An Impact Measure Could Help Alignment

Matthew Barnett8 Aug 2019 0:10 UTC

21 points

1 comment9 min readLW link

Nashville August SSC Meetup

friedelcraftiness7 Aug 2019 20:11 UTC

1 point

0 comments1 min readLW link

In defense of Oracle (“Tool”) AI research

Steven Byrnes7 Aug 2019 19:14 UTC

22 points

11 comments4 min readLW link

Help forecast study replication in this social science prediction market

rosiecam7 Aug 2019 18:18 UTC

29 points

3 comments1 min readLW link