All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 201820192020 2021 2022 2023 2024

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Partial summary of debate with Benquo and Jessicata [pt 1]

Raemon14 Aug 2019 20:02 UTC

89 points

63 comments22 min readLW link 3 reviews

“Designing agent incentives to avoid reward tampering”, DeepMind

gwern14 Aug 2019 16:57 UTC

28 points

15 comments1 min readLW link

(medium.com)

Subagents, trauma and rationality

Kaj_Sotala14 Aug 2019 13:14 UTC

111 points

4 comments19 min readLW link

Predicted AI alignment event/meeting calendar

rmoehn14 Aug 2019 7:14 UTC

29 points

14 comments1 min readLW link

Natural laws should be explicit constraints on strategy space

ryan_b13 Aug 2019 20:22 UTC

8 points

6 comments1 min readLW link

Distance Functions are Hard

Grue_Slinky13 Aug 2019 17:33 UTC

31 points

19 comments6 min readLW link

Book Review: Secular Cycles

Scott Alexander13 Aug 2019 4:10 UTC

62 points

10 comments16 min readLW link 1 review

(slatestarcodex.com)

A Primer on Matrix Calculus, Part 1: Basic review

Matthew Barnett12 Aug 2019 23:44 UTC

25 points

4 comments7 min readLW link

[Question] What explanatory power does Kahneman’s System 2 possess?

Richard_Ngo12 Aug 2019 15:23 UTC

31 points

2 comments1 min readLW link

Mesa-Optimizers and Over-optimization Failure (Optimizing and Goodhart Effects, Clarifying Thoughts—Part 4)

Davidmanheim12 Aug 2019 8:07 UTC

15 points

3 comments4 min readLW link

Adjectives from the Future: The Dangers of Result-based Descriptions

Pradeep_Kumar11 Aug 2019 19:19 UTC

19 points

8 comments11 min readLW link

[Question] Could we solve this email mess if we all moved to paid emails?

jacobjacob11 Aug 2019 16:31 UTC

29 points

50 comments4 min readLW link

AI Safety Reading Group

Søren Elverlin11 Aug 2019 9:01 UTC

16 points

8 comments1 min readLW link

[Question] Does human choice have to be transitive in order to be rational/consistent?

jmh11 Aug 2019 1:49 UTC

9 points

6 comments1 min readLW link

Diana Fleischman and Geoffrey Miller—Audience Q&A

Jacob Falkovich10 Aug 2019 22:37 UTC

38 points

6 comments9 min readLW link

Intransitive Preferences You Can’t Pump

zulupineapple9 Aug 2019 23:10 UTC

0 points

2 comments1 min readLW link

Categorial preferences and utility functions

DavidHolmes9 Aug 2019 21:36 UTC

10 points

6 comments5 min readLW link

[Question] What is the state of the ego depletion field?

Eli Tyre9 Aug 2019 20:30 UTC

27 points

10 comments1 min readLW link

Why Gradients Vanish and Explode

Matthew Barnett9 Aug 2019 2:54 UTC

25 points

9 comments3 min readLW link

AI Forecasting Dictionary (Forecasting infrastructure, part 1)

jacobjacob and bgold

8 Aug 2019 16:10 UTC

50 points

0 comments5 min readLW link

[Question] Why do humans not have built-in neural i/o channels?

Richard_Ngo8 Aug 2019 13:09 UTC

25 points

23 comments1 min readLW link

Which of these five AI alignment research projects ideas are no good?

rmoehn8 Aug 2019 7:17 UTC

25 points

13 comments1 min readLW link

Calibrating With Cards

lifelonglearner8 Aug 2019 6:44 UTC

32 points

3 comments3 min readLW link

[Question] Is there a source/market for LW-related t-shirts?

jooyous8 Aug 2019 4:30 UTC

8 points

3 comments1 min readLW link

Verification and Transparency

DanielFilan8 Aug 2019 1:50 UTC

35 points

6 comments2 min readLW link

(danielfilan.com)

Toy model piece #2: Combining short and long range partial preferences

Stuart_Armstrong8 Aug 2019 0:11 UTC

14 points

0 comments4 min readLW link

Four Ways An Impact Measure Could Help Alignment

Matthew Barnett8 Aug 2019 0:10 UTC

21 points

1 comment9 min readLW link

Nashville August SSC Meetup

friedelcraftiness7 Aug 2019 20:11 UTC

1 point

0 comments1 min readLW link

In defense of Oracle (“Tool”) AI research

Steven Byrnes7 Aug 2019 19:14 UTC

22 points

11 comments4 min readLW link

Help forecast study replication in this social science prediction market

rosiecam7 Aug 2019 18:18 UTC

29 points

3 comments1 min readLW link

[Question] Edit Nickname

Luigi Lotti7 Aug 2019 17:42 UTC

5 points

1 comment1 min readLW link

Self-Supervised Learning and AGI Safety

Steven Byrnes7 Aug 2019 14:21 UTC

29 points

9 comments12 min readLW link

Emotions are not beliefs

Chris_Leong7 Aug 2019 6:27 UTC

25 points

2 comments2 min readLW link

Understanding Recent Impact Measures

Matthew Barnett7 Aug 2019 4:57 UTC

16 points

6 comments7 min readLW link

[Site Update] Behind the scenes data-layer and caching improvements

habryka7 Aug 2019 0:49 UTC

23 points

3 comments1 min readLW link

Project Proposal: Considerations for trading off capabilities and safety impacts of AI research

David Scott Krueger (formerly: capybaralet)6 Aug 2019 22:22 UTC

25 points

11 comments2 min readLW link

Subagents, neural Turing machines, thought selection, and blindspots

Kaj_Sotala6 Aug 2019 21:15 UTC

87 points

3 comments12 min readLW link

[Question] Percent reduction of gun-related deaths by color of gun.

Gunnar_Zarncke6 Aug 2019 20:28 UTC

8 points

11 comments1 min readLW link

New paper: Corrigibility with Utility Preservation

Koen.Holtman6 Aug 2019 19:04 UTC

44 points

11 comments2 min readLW link

Weak foundation of determinism analysis

aiiixiii6 Aug 2019 19:03 UTC

14 points

54 comments3 min readLW link

Trauma, Meditation, and a Cool Scar

Logan Riggs6 Aug 2019 16:17 UTC

102 points

17 comments5 min readLW link 1 review

[Question] Why is the nitrogen cycle so under-emphasized compared to climate change

ChristianKl6 Aug 2019 9:25 UTC

15 points

4 comments1 min readLW link

[Question] How would a person go about starting a geoengineering startup?

Pee Doom6 Aug 2019 7:34 UTC

11 points

5 comments1 min readLW link

Status 451 on Diagnosis: Russell Aphasia

Zack_M_Davis6 Aug 2019 4:43 UTC

48 points

1 comment1 min readLW link

(status451.com)

Searle’s Chinese Room and the Meaning of Meaning

Jimdrix_Hendri6 Aug 2019 4:09 UTC

0 points

4 comments2 min readLW link

[Question] What are the best resources for examining the evidence for anthropogenic climate change?

Matthew Barnett6 Aug 2019 2:53 UTC

10 points

8 comments1 min readLW link

A Survey of Early Impact Measures

Matthew Barnett6 Aug 2019 1:22 UTC

29 points

0 comments8 min readLW link

Preferences as an (instinctive) stance

Stuart_Armstrong6 Aug 2019 0:43 UTC

18 points

4 comments4 min readLW link

[Question] How to navigate through contradictory (health/fitness) advice?

Sherrinford5 Aug 2019 20:58 UTC

14 points

7 comments1 min readLW link

My recommendations for gratitude exercises

MaxCarpendale5 Aug 2019 19:04 UTC

40 points

3 comments5 min readLW link