All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 201820192020 2021 2022 2023 2024

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

Test Cases for Impact Regularisation Methods

DanielFilan6 Feb 2019 21:50 UTC

72 points

5 comments13 min readLW link

(danielfilan.com)

A tentative solution to a certain mythological beast of a problem

Edward Knox6 Feb 2019 20:42 UTC

−11 points

9 comments1 min readLW link

AI Alignment is Alchemy.

Jeevan6 Feb 2019 20:32 UTC

−9 points

20 comments1 min readLW link

My use of the phrase “Super-Human Feedback”

David Scott Krueger (formerly: capybaralet)6 Feb 2019 19:11 UTC

13 points

0 comments1 min readLW link

Thoughts on Ben Garfinkel’s “How sure are we about this AI stuff?”

David Scott Krueger (formerly: capybaralet)6 Feb 2019 19:09 UTC

25 points

17 comments1 min readLW link

Show LW: (video) how to remember everything you learn

ArthurLidia6 Feb 2019 19:02 UTC

3 points

0 comments1 min readLW link

Does the EA community do “basic science” grants? How do I get one?

Jameson Quinn6 Feb 2019 18:10 UTC

7 points

6 comments1 min readLW link

Is the World Getting Better? A brief summary of recent debate

ErickBall6 Feb 2019 17:38 UTC

35 points

8 comments2 min readLW link

(capx.co)

Security amplification

paulfchristiano6 Feb 2019 17:28 UTC

21 points

2 comments13 min readLW link

Alignment Newsletter #44

Rohin Shah6 Feb 2019 8:30 UTC

18 points

0 comments9 min readLW link

(mailchi.mp)

South Bay Meetup March 2nd

David Friedman6 Feb 2019 6:48 UTC

1 point

0 comments1 min readLW link

[Question] If Rationality can be likened to a ‘Martial Art’, what would be the Forms?

Bae's Theorem6 Feb 2019 5:48 UTC

21 points

10 comments1 min readLW link

Complexity Penalties in Statistical Learning

michael_h6 Feb 2019 4:13 UTC

31 points

3 comments6 min readLW link

Automated Nomic Game 2

jefftk5 Feb 2019 22:11 UTC

19 points

2 comments2 min readLW link

Should we bait criminals using clones ?

Aël Chappuit5 Feb 2019 21:13 UTC

−23 points

3 comments1 min readLW link

Describing things: parsimony, fruitfulness, and adaptability

Mary Chernyshenko5 Feb 2019 20:59 UTC

1 point

0 comments1 min readLW link

Philosophy as low-energy approximation

Charlie Steiner5 Feb 2019 19:34 UTC

40 points

20 comments3 min readLW link

When to use quantilization

RyanCarey5 Feb 2019 17:17 UTC

65 points

5 comments4 min readLW link

(notes on) Policy Desiderata for Superintelligent AI: A Vector Field Approach

Ben Pace4 Feb 2019 22:08 UTC

43 points

5 comments7 min readLW link

SSC Paris Meetup, 09/02/18

fbreton4 Feb 2019 19:54 UTC

1 point

0 comments1 min readLW link

January 2019 gwern.net newsletter

gwern4 Feb 2019 15:53 UTC

15 points

0 comments1 min readLW link

(www.gwern.net)

My atheism story

Pausecafe4 Feb 2019 14:33 UTC

26 points

3 comments7 min readLW link

(Why) Does the Basilisk Argument fail?

Lookingforyourlogic3 Feb 2019 23:50 UTC

0 points

11 comments2 min readLW link

Constructing Goodhart

johnswentworth3 Feb 2019 21:59 UTC

29 points

10 comments3 min readLW link

Conclusion to the sequence on value learning

Rohin Shah3 Feb 2019 21:05 UTC

51 points

20 comments5 min readLW link

AI Safety Prerequisites Course: Revamp and New Lessons

philip_b3 Feb 2019 21:04 UTC

24 points

5 comments1 min readLW link

[Question] What are some of bizarre theories based on anthropic reasoning?

Dr. Jamchie3 Feb 2019 18:48 UTC

21 points

13 comments1 min readLW link

Rationality: What’s the point?

Hazard3 Feb 2019 16:34 UTC

12 points

11 comments1 min readLW link

Quantifying Human Suffering and “Everyday Suffering”

willfranks3 Feb 2019 13:07 UTC

7 points

3 comments1 min readLW link

[Question] How to stay concentrated for a long period of time?

infinickel3 Feb 2019 5:24 UTC

6 points

15 comments1 min readLW link

How to notice being mind-hacked

Shmi2 Feb 2019 23:13 UTC

18 points

22 comments2 min readLW link

Depression philosophizing

aaq2 Feb 2019 22:54 UTC

6 points

2 comments1 min readLW link

LessWrong DC: Metameetup

rusalkii2 Feb 2019 18:50 UTC

1 point

0 comments1 min readLW link

SSC Atlanta Meetup

Steve French2 Feb 2019 3:11 UTC

2 points

0 comments1 min readLW link

[Question] How does Gradient Descent Interact with Goodhart?

Scott Garrabrant2 Feb 2019 0:14 UTC

68 points

19 comments4 min readLW link

Philadelphia SSC Meetup

Majuscule1 Feb 2019 23:51 UTC

1 point

0 comments1 min readLW link

STRUCTURE: Reality and rational best practice

Hazard1 Feb 2019 23:51 UTC

5 points

2 comments1 min readLW link

An Attempt To Explain No-Self In Simple Terms

Justin Vriend1 Feb 2019 23:50 UTC

1 point

0 comments3 min readLW link

STRUCTURE: How the Social Affects your rationality

Hazard1 Feb 2019 23:35 UTC

0 points

0 comments1 min readLW link

STRUCTURE: A Crash Course in Your Brain

Hazard1 Feb 2019 23:17 UTC

6 points

4 comments1 min readLW link

February Nashville SSC Meetup

Dude McDude1 Feb 2019 22:36 UTC

1 point

0 comments1 min readLW link

[Question] What kind of information would serve as the best evidence for resolving the debate of whether a centrist or leftist Democratic nominee is likelier to take the White House in 2020?

Evan_Gaensbauer1 Feb 2019 18:40 UTC

10 points

10 comments3 min readLW link

Urgent & important: How (not) to do your to-do list

bfinn1 Feb 2019 17:44 UTC

51 points

20 comments13 min readLW link

Who wants to be a Millionaire?

Bucky1 Feb 2019 14:02 UTC

29 points

1 comment11 min readLW link

What is Wrong?

Inyuki1 Feb 2019 12:02 UTC

1 point

2 comments2 min readLW link

Drexler on AI Risk

PeterMcCluskey1 Feb 2019 5:11 UTC

35 points

10 comments9 min readLW link

(www.bayesianinvestor.com)

Boundaries—A map and territory experiment. [post-rationality]

Elo1 Feb 2019 2:08 UTC

−18 points

14 comments2 min readLW link

[Question] Why is this utilitarian calculus wrong? Or is it?

EconomicModel31 Jan 2019 23:57 UTC

15 points

21 comments1 min readLW link

Small hope for less bias and more practability

ArthurLidia31 Jan 2019 22:09 UTC

0 points

0 comments1 min readLW link

Reliability amplification

paulfchristiano31 Jan 2019 21:12 UTC

24 points

3 comments7 min readLW link