[Question] Why is pseudo-al­ign­ment “worse” than other ways ML can fail to gen­er­al­ize?

nostalgebraistJul 18, 2020, 10:54 PM
45 points
9 comments2 min readLW link

Against Reopen­ing Ottawa

eapacheJul 18, 2020, 8:08 PM
6 points
2 comments5 min readLW link

Col­lec­tion of GPT-3 results

Kaj_SotalaJul 18, 2020, 8:04 PM
89 points
24 comments1 min readLW link
(twitter.com)

[Question] Is there an easy way to turn a LW se­quence into an epub?

ChristianKlJul 18, 2020, 6:20 PM
17 points
9 comments1 min readLW link

Cal­ibrate words, not just probabilities

MikkWJul 18, 2020, 5:56 AM
11 points
3 comments2 min readLW link

[Question] Erv­ing Goff­man’s ‘pa­per’

SaffronJul 18, 2020, 1:12 AM
5 points
2 comments1 min readLW link

Les­sons on AI Takeover from the conquistadors

Jul 17, 2020, 10:35 PM
61 points
31 comments6 min readLW link

[Question] Can an agent use in­ter­ac­tive proofs to check the al­ign­ment of suc­ce­sors?

PabloAMCJul 17, 2020, 7:07 PM
7 points
2 comments1 min readLW link

An­thro­po­mor­phiz­ing Humans

johnswentworthJul 17, 2020, 5:49 PM
46 points
6 comments2 min readLW link

Tel­ling more ra­tio­nal stories

DirectedEvolutionJul 17, 2020, 5:47 PM
26 points
21 comments3 min readLW link

Solv­ing Math Prob­lems by Relay

Jul 17, 2020, 3:32 PM
103 points
26 comments7 min readLW link

[Question] What are the best tools you have seen to keep track of knowl­edge around testable state­ments?

migueltorrescostaJul 17, 2020, 3:02 PM
2 points
1 comment1 min readLW link

En­vi­ron­ments as a bot­tle­neck in AGI development

Richard_NgoJul 17, 2020, 5:02 AM
41 points
19 comments6 min readLW link

My Dat­ing Plan ala Ge­offrey Miller

snog toddgrassJul 17, 2020, 4:52 AM
2 points
57 comments3 min readLW link

Meta-prefer­ences are weird

Jul 16, 2020, 11:03 PM
13 points
2 comments5 min readLW link

Sun­day July 19, 1pm (PDT) — talks by Rae­mon, ricraz, mr-hire, Jame­son Quinn

Jul 16, 2020, 8:04 PM
26 points
6 comments1 min readLW link

[Question] What should be the topic of my LW mini-talk this Sun­day (July 18th)?

Jameson QuinnJul 16, 2020, 4:32 PM
7 points
3 comments1 min readLW link

Covid 7/​16: Be­com­ing the Mask

ZviJul 16, 2020, 12:40 PM
82 points
20 comments15 min readLW link
(thezvi.wordpress.com)

Why as­so­ci­a­tive op­er­a­tions?

Sunny from QADJul 16, 2020, 12:36 PM
6 points
7 commentsLW link
(questionsanddaylight.com)

[Question] How big of an is­sue are patent trolls to the av­er­age startup?

ChristianKlJul 16, 2020, 11:31 AM
12 points
4 comments1 min readLW link

[AN #107]: The con­ver­gent in­stru­men­tal sub­goals of goal-di­rected agents

Rohin ShahJul 16, 2020, 6:47 AM
13 points
1 comment8 min readLW link
(mailchi.mp)

[AN #108]: Why we should scru­ti­nize ar­gu­ments for AI risk

Rohin ShahJul 16, 2020, 6:47 AM
19 points
6 comments12 min readLW link
(mailchi.mp)

Align­ment pro­pos­als and com­plex­ity classes

evhubJul 16, 2020, 12:27 AM
40 points
26 comments13 min readLW link

[Question] How should AI de­bate be judged?

abramdemskiJul 15, 2020, 10:20 PM
49 points
26 comments6 min readLW link

Au­to­mat­i­cally Turn­ing Off Com­puter at Night

RaemonJul 15, 2020, 8:42 PM
19 points
13 comments2 min readLW link

[Question] Public Figures Con­tract­ing COVID-19 as Pos­i­tive Event

mcJul 15, 2020, 7:56 PM
−11 points
2 comments1 min readLW link

Diver­gence causes iso­lated de­mands for rigor

George3d6Jul 15, 2020, 6:59 PM
14 points
4 comments7 min readLW link
(blog.cerebralab.com)

[Question] What do we now know about long-term con­se­quences of a COVID-19 in­fec­tion?

ChristianKlJul 15, 2020, 2:42 PM
14 points
1 comment1 min readLW link

New pa­per: AGI Agent Safety by Iter­a­tively Im­prov­ing the Utility Function

Koen.HoltmanJul 15, 2020, 2:05 PM
21 points
2 comments6 min readLW link

Ed­u­ca­tion 2.0 — A brand new ed­u­ca­tion system

aryanJul 15, 2020, 10:09 AM
−8 points
3 comments6 min readLW link

Clas­sifi­ca­tion of AI al­ign­ment re­search: de­con­fu­sion, “good enough” non-su­per­in­tel­li­gent AI al­ign­ment, su­per­in­tel­li­gent AI alignment

philip_bJul 14, 2020, 10:48 PM
35 points
25 comments3 min readLW link

Mazes and Duality

Jul 14, 2020, 7:54 PM
62 points
10 comments6 min readLW link

The Gold­bach con­jec­ture is prob­a­bly cor­rect; so was Fer­mat’s last theorem

Stuart_ArmstrongJul 14, 2020, 7:30 PM
82 points
28 comments4 min readLW link

Dremeling

CzynskiJul 14, 2020, 7:23 PM
64 points
8 comments1 min readLW link

Cal­ibra­tion Prac­tice: Retro­d­ic­tions on Metaculus

RaemonJul 14, 2020, 6:35 PM
34 points
2 comments1 min readLW link

AI Benefits Post 4: Out­stand­ing Ques­tions on Select­ing Benefits

CullenJul 14, 2020, 5:26 PM
4 points
4 comments5 min readLW link

Ra­tion­al­ity Vienna Meetup July 2020

Laszlo_TreszkaiJul 14, 2020, 1:06 PM
2 points
2 comments1 min readLW link

Limits of Cur­rent US Pre­dic­tion Mar­kets (Pre­dic­tIt Case Study)

aphyerJul 14, 2020, 7:24 AM
210 points
50 comments7 min readLW link

Al­gorith­mic In­tent: A Han­so­nian Gen­er­al­ized Anti-Zom­bie Principle

Zack_M_DavisJul 14, 2020, 6:03 AM
51 points
20 comments12 min readLW link

[Question] What are the mostly likely ways AGI will emerge?

Craig QuiterJul 14, 2020, 12:58 AM
3 points
7 comments1 min readLW link

[Question] How to per­suade peo­ple/​groups out of sunk cost fal­lacy?

vernamcipherJul 13, 2020, 10:51 PM
3 points
2 comments1 min readLW link

Life Through Quan­tum An­neal­ing: How a quan­tum com­put­ing tech­nique could shape existence

ChrisMJul 13, 2020, 10:48 PM
2 points
1 comment16 min readLW link

[Question] 3-P Group op­ti­mal for dis­cus­sion?

AiresJLJul 13, 2020, 10:23 PM
3 points
0 comments1 min readLW link

Book Re­view: Fooled by Randomness

SherrinfordJul 13, 2020, 9:02 PM
34 points
10 comments5 min readLW link

Roll for Sanity

eapacheJul 13, 2020, 4:39 PM
16 points
2 comments4 min readLW link

Null-box­ing New­comb’s Problem

YitzJul 13, 2020, 4:32 PM
33 points
9 comments4 min readLW link

2020 LessWrong De­mo­graph­ics Sur­vey Results

B JacobsJul 13, 2020, 1:53 PM
18 points
7 comments1 min readLW link

What You Are

Jarred FilmerJul 13, 2020, 11:35 AM
2 points
3 comments1 min readLW link

Up­date more slowly!

DavidmanheimJul 13, 2020, 7:10 AM
51 points
4 comments2 min readLW link

In praise of con­tribut­ing ex­am­ples, analo­gies and lingo

Adam ZernerJul 13, 2020, 6:43 AM
37 points
3 comments1 min readLW link