Par­allels Between AI Safety by De­bate and Ev­i­dence Law

Cullen20 Jul 2020 22:52 UTC
10 points
1 comment2 min readLW link
(cullenokeefe.com)

Thiel on Progress and Stagnation

Richard_Ngo20 Jul 2020 20:27 UTC
173 points
32 comments11 min readLW link
(docs.google.com)

Learn­ing Values in Practice

Stuart_Armstrong20 Jul 2020 18:38 UTC
24 points
0 comments5 min readLW link

Ineffi­cient doesn’t mean in­differ­ent, but it might mean wimpy.

DirectedEvolution20 Jul 2020 18:27 UTC
14 points
3 comments5 min readLW link

[Question] To what ex­tent is GPT-3 ca­pa­ble of rea­son­ing?

TurnTrout20 Jul 2020 17:10 UTC
70 points
73 comments16 min readLW link

Sel­ling real es­tate: should you over­price or un­der­price?

Steven Byrnes20 Jul 2020 15:54 UTC
19 points
5 comments10 min readLW link

[Question] “Do Noth­ing” util­ity func­tion, 3½ years later?

niplav20 Jul 2020 11:09 UTC
5 points
3 comments1 min readLW link

Oper­a­tional­iz­ing Interpretability

lifelonglearner20 Jul 2020 5:22 UTC
20 points
0 comments4 min readLW link

Use re­silience, in­stead of im­pre­ci­sion, to com­mu­ni­cate uncertainty

habryka20 Jul 2020 5:08 UTC
3 points
1 comment1 min readLW link
(forum.effectivealtruism.org)

What Would I Do? Self-pre­dic­tion in Sim­ple Algorithms

Scott Garrabrant20 Jul 2020 4:27 UTC
65 points
12 comments5 min readLW link

“Should Black­mail Be Le­gal” Han­son/​Zvi De­bate (Sun July 26th, 3pm PDT)

Ben Pace20 Jul 2020 4:06 UTC
36 points
13 comments1 min readLW link

The 8 Tech­niques to Tol­er­ify the Dark World

adamShimi20 Jul 2020 0:58 UTC
2 points
5 comments2 min readLW link

Praise of some pop­u­lar LW articles

DirectedEvolution20 Jul 2020 0:32 UTC
40 points
1 comment7 min readLW link

Types Of On­line Meetups

Dan B19 Jul 2020 23:51 UTC
4 points
2 comments2 min readLW link

Mu­si­cal Outgroups

eapache19 Jul 2020 22:55 UTC
9 points
1 comment4 min readLW link

Fo­rum As­sisted Discussion

Dan B19 Jul 2020 22:38 UTC
9 points
0 comments3 min readLW link

Pulse and Glide Cycling

jefftk19 Jul 2020 19:02 UTC
11 points
5 comments2 min readLW link
(www.jefftk.com)

[Question] Math. proof of the su­pe­ri­or­ity of in­de­pen­dent guesses?

Milton19 Jul 2020 2:38 UTC
−3 points
7 comments1 min readLW link

Crit­i­cism of some pop­u­lar LW articles

DirectedEvolution19 Jul 2020 1:16 UTC
71 points
19 comments6 min readLW link

Swiss Poli­ti­cal Sys­tem: More than You ever Wanted to Know (I.)

Martin Sustrik19 Jul 2020 1:11 UTC
172 points
39 comments24 min readLW link2 reviews

[Question] Why is pseudo-al­ign­ment “worse” than other ways ML can fail to gen­er­al­ize?

nostalgebraist18 Jul 2020 22:54 UTC
45 points
9 comments2 min readLW link

Against Reopen­ing Ottawa

eapache18 Jul 2020 20:08 UTC
6 points
2 comments5 min readLW link

Col­lec­tion of GPT-3 results

Kaj_Sotala18 Jul 2020 20:04 UTC
89 points
24 comments1 min readLW link
(twitter.com)

[Question] Is there an easy way to turn a LW se­quence into an epub?

ChristianKl18 Jul 2020 18:20 UTC
17 points
9 comments1 min readLW link

Cal­ibrate words, not just probabilities

MikkW18 Jul 2020 5:56 UTC
11 points
3 comments2 min readLW link

[Question] Erv­ing Goff­man’s ‘pa­per’

Saffron18 Jul 2020 1:12 UTC
5 points
2 comments1 min readLW link

Les­sons on AI Takeover from the conquistadors

17 Jul 2020 22:35 UTC
61 points
31 comments5 min readLW link

[Question] Can an agent use in­ter­ac­tive proofs to check the al­ign­ment of suc­ce­sors?

PabloAMC17 Jul 2020 19:07 UTC
7 points
2 comments1 min readLW link

An­thro­po­mor­phiz­ing Humans

johnswentworth17 Jul 2020 17:49 UTC
46 points
6 comments2 min readLW link

Tel­ling more ra­tio­nal stories

DirectedEvolution17 Jul 2020 17:47 UTC
26 points
21 comments3 min readLW link

Solv­ing Math Prob­lems by Relay

17 Jul 2020 15:32 UTC
103 points
26 comments7 min readLW link

[Question] What are the best tools you have seen to keep track of knowl­edge around testable state­ments?

migueltorrescosta17 Jul 2020 15:02 UTC
2 points
1 comment1 min readLW link

En­vi­ron­ments as a bot­tle­neck in AGI development

Richard_Ngo17 Jul 2020 5:02 UTC
41 points
19 comments6 min readLW link

My Dat­ing Plan ala Ge­offrey Miller

snog toddgrass17 Jul 2020 4:52 UTC
2 points
57 comments3 min readLW link

Meta-prefer­ences are weird

16 Jul 2020 23:03 UTC
13 points
2 comments5 min readLW link

Sun­day July 19, 1pm (PDT) — talks by Rae­mon, ricraz, mr-hire, Jame­son Quinn

16 Jul 2020 20:04 UTC
26 points
6 comments1 min readLW link

[Question] What should be the topic of my LW mini-talk this Sun­day (July 18th)?

Jameson Quinn16 Jul 2020 16:32 UTC
7 points
3 comments1 min readLW link

Covid 7/​16: Be­com­ing the Mask

Zvi16 Jul 2020 12:40 UTC
82 points
20 comments15 min readLW link
(thezvi.wordpress.com)

Why as­so­ci­a­tive op­er­a­tions?

Sunny from QAD16 Jul 2020 12:36 UTC
6 points
7 comments1 min readLW link
(questionsanddaylight.com)

[Question] How big of an is­sue are patent trolls to the av­er­age startup?

ChristianKl16 Jul 2020 11:31 UTC
12 points
4 comments1 min readLW link

[AN #107]: The con­ver­gent in­stru­men­tal sub­goals of goal-di­rected agents

Rohin Shah16 Jul 2020 6:47 UTC
13 points
1 comment8 min readLW link
(mailchi.mp)

[AN #108]: Why we should scru­ti­nize ar­gu­ments for AI risk

Rohin Shah16 Jul 2020 6:47 UTC
19 points
6 comments12 min readLW link
(mailchi.mp)

Align­ment pro­pos­als and com­plex­ity classes

evhub16 Jul 2020 0:27 UTC
40 points
26 comments13 min readLW link

[Question] How should AI de­bate be judged?

abramdemski15 Jul 2020 22:20 UTC
49 points
26 comments6 min readLW link

Au­to­mat­i­cally Turn­ing Off Com­puter at Night

Raemon15 Jul 2020 20:42 UTC
19 points
13 comments2 min readLW link

[Question] Public Figures Con­tract­ing COVID-19 as Pos­i­tive Event

mc15 Jul 2020 19:56 UTC
−11 points
2 comments1 min readLW link

Diver­gence causes iso­lated de­mands for rigor

George3d615 Jul 2020 18:59 UTC
14 points
4 comments7 min readLW link
(blog.cerebralab.com)

[Question] What do we now know about long-term con­se­quences of a COVID-19 in­fec­tion?

ChristianKl15 Jul 2020 14:42 UTC
14 points
1 comment1 min readLW link

New pa­per: AGI Agent Safety by Iter­a­tively Im­prov­ing the Utility Function

Koen.Holtman15 Jul 2020 14:05 UTC
21 points
2 comments6 min readLW link

Ed­u­ca­tion 2.0 — A brand new ed­u­ca­tion system

aryan15 Jul 2020 10:09 UTC
−8 points
3 comments6 min readLW link