All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 201920202021 2022 2023 2024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 272829 30 31

Research agenda for AI safety and a better civilization

agilecavemanJul 22, 2020, 6:35 AM

12 points

2 comments16 min readLW link

More Right

Adam ZernerJul 22, 2020, 3:36 AM

22 points

29 comments4 min readLW link

[not ongoing] Thoughts on Proportional voting methods

Jameson QuinnJul 22, 2020, 2:46 AM

32 points

53 comments46 min readLW link

[Preprint] The Computational Limits of Deep Learning

Gordon Seidoh WorleyJul 21, 2020, 9:25 PM

9 points

4 comments1 min readLW link

(arxiv.org)

Fresh Bread

ZviJul 21, 2020, 8:40 PM

22 points

1 comment2 min readLW link

(thezvi.wordpress.com)

Competition: Amplify Rohin’s Prediction on AGI researchers & Safety Concerns

stuhlmuellerJul 21, 2020, 8:06 PM

83 points

41 comments3 min readLW link

Alignment As A Bottleneck To Usefulness Of GPT-3

johnswentworthJul 21, 2020, 8:02 PM

111 points

57 comments3 min readLW link

How good is humanity at coordination?

BuckJul 21, 2020, 8:01 PM

82 points

44 comments3 min readLW link

$1000 bounty for OpenAI to show whether GPT3 was “deliberately” pretending to be stupider than it is

Bird ConceptJul 21, 2020, 6:42 PM

56 points

39 comments2 min readLW link

(twitter.com)

[Question] What are the limits of self-education?

nitropieJul 21, 2020, 6:01 PM

3 points

2 comments1 min readLW link

[Meta] anonymous merit or public status

[anonymous]Jul 21, 2020, 6:01 PM

6 points

4 comments1 min readLW link

AI Benefits Post 5: Outstanding Questions on Governing Benefits

CullenJul 21, 2020, 4:46 PM

4 points

0 comments4 min readLW link

The “AI Dungeons” Dragon Model is heavily path dependent (testing GPT-3 on ethics)

Rafael HarthJul 21, 2020, 12:14 PM

44 points

9 comments6 min readLW link

Uncalibrated quantum experiments act clasically

justinpombrioJul 21, 2020, 5:31 AM

18 points

12 comments8 min readLW link

The Rediscovery of Interiority in Machine Learning

DanBJul 21, 2020, 5:02 AM

5 points

4 comments LW link

(danburfoot.net)

Chains, Bottlenecks and Optimization

curiJul 21, 2020, 2:07 AM

14 points

12 comments4 min readLW link

“Can you keep this confidential? How do you know?”

RaemonJul 21, 2020, 12:33 AM

164 points

43 comments3 min readLW link 2 reviews

Parallels Between AI Safety by Debate and Evidence Law

CullenJul 20, 2020, 10:52 PM

10 points

1 comment2 min readLW link

(cullenokeefe.com)

Thiel on Progress and Stagnation

Richard_NgoJul 20, 2020, 8:27 PM

173 points

32 comments11 min readLW link

(docs.google.com)

Learning Values in Practice

Stuart_ArmstrongJul 20, 2020, 6:38 PM

24 points

0 comments5 min readLW link

Inefficient doesn’t mean indifferent, but it might mean wimpy.

DirectedEvolutionJul 20, 2020, 6:27 PM

14 points

3 comments5 min readLW link

[Question] To what extent is GPT-3 capable of reasoning?

TurnTroutJul 20, 2020, 5:10 PM

70 points

73 comments16 min readLW link

Selling real estate: should you overprice or underprice?

Steven ByrnesJul 20, 2020, 3:54 PM

19 points

5 comments10 min readLW link

[Question] “Do Nothing” utility function, 3½ years later?

niplavJul 20, 2020, 11:09 AM

5 points

3 comments1 min readLW link

Operationalizing Interpretability

lifelonglearnerJul 20, 2020, 5:22 AM

20 points

0 comments4 min readLW link

Use resilience, instead of imprecision, to communicate uncertainty

habryka20 Jul 2020 5:08 UTC

3 points

1 comment1 min readLW link

(forum.effectivealtruism.org)

What Would I Do? Self-prediction in Simple Algorithms

Scott Garrabrant20 Jul 2020 4:27 UTC

65 points

12 comments5 min readLW link

“Should Blackmail Be Legal” Hanson/Zvi Debate (Sun July 26th, 3pm PDT)

Ben Pace20 Jul 2020 4:06 UTC

36 points

13 comments1 min readLW link

The 8 Techniques to Tolerify the Dark World

adamShimi20 Jul 2020 0:58 UTC

2 points

5 comments2 min readLW link

Praise of some popular LW articles

DirectedEvolution20 Jul 2020 0:32 UTC

40 points

1 comment7 min readLW link

Types Of Online Meetups

Dan B19 Jul 2020 23:51 UTC

4 points

2 comments2 min readLW link

Musical Outgroups

eapache19 Jul 2020 22:55 UTC

9 points

1 comment4 min readLW link

Forum Assisted Discussion

Dan B19 Jul 2020 22:38 UTC

9 points

0 comments3 min readLW link

Pulse and Glide Cycling

jefftk19 Jul 2020 19:02 UTC

11 points

5 comments2 min readLW link

(www.jefftk.com)

[Question] Math. proof of the superiority of independent guesses?

Milton19 Jul 2020 2:38 UTC

−3 points

7 comments1 min readLW link

Criticism of some popular LW articles

DirectedEvolution19 Jul 2020 1:16 UTC

71 points

19 comments6 min readLW link

Swiss Political System: More than You ever Wanted to Know (I.)

Martin Sustrik19 Jul 2020 1:11 UTC

173 points

39 comments24 min readLW link 2 reviews

[Question] Why is pseudo-alignment “worse” than other ways ML can fail to generalize?

nostalgebraist18 Jul 2020 22:54 UTC

45 points

9 comments2 min readLW link

Against Reopening Ottawa

eapache18 Jul 2020 20:08 UTC

6 points

2 comments5 min readLW link

Collection of GPT-3 results

Kaj_Sotala18 Jul 2020 20:04 UTC

89 points

24 comments1 min readLW link

(twitter.com)

[Question] Is there an easy way to turn a LW sequence into an epub?

ChristianKl18 Jul 2020 18:20 UTC

17 points

9 comments1 min readLW link

Calibrate words, not just probabilities

MikkW18 Jul 2020 5:56 UTC

11 points

3 comments2 min readLW link

[Question] Erving Goffman’s ‘paper’

Saffron18 Jul 2020 1:12 UTC

5 points

2 comments1 min readLW link

Lessons on AI Takeover from the conquistadors

Daniel Kokotajlo and Bird Concept

17 Jul 2020 22:35 UTC

61 points

31 comments6 min readLW link

[Question] Can an agent use interactive proofs to check the alignment of succesors?

PabloAMC17 Jul 2020 19:07 UTC

7 points

2 comments1 min readLW link

Anthropomorphizing Humans

johnswentworth17 Jul 2020 17:49 UTC

46 points

6 comments2 min readLW link

Telling more rational stories

DirectedEvolution17 Jul 2020 17:47 UTC

26 points

21 comments3 min readLW link

Solving Math Problems by Relay

Ben Goldhaber and Owain_Evans

17 Jul 2020 15:32 UTC

103 points

26 comments7 min readLW link

[Question] What are the best tools you have seen to keep track of knowledge around testable statements?

migueltorrescosta17 Jul 2020 15:02 UTC

2 points

1 comment1 min readLW link

Environments as a bottleneck in AGI development

Richard_Ngo17 Jul 2020 5:02 UTC

41 points

19 comments6 min readLW link