Notes from “Don’t Shoot the Dog”

juliawiseApr 2, 2021, 4:34 PM
255 points
12 comments12 min readLW link1 review

Another (outer) al­ign­ment failure story

paulfchristianoApr 7, 2021, 8:12 PM
248 points
38 comments12 min readLW link1 review

An­nounc­ing the Align­ment Re­search Center

paulfchristianoApr 26, 2021, 11:30 PM
178 points
6 comments1 min readLW link
(ai-alignment.com)

Pre­dic­tive Cod­ing has been Unified with Backpropagation

lsusrApr 2, 2021, 9:42 PM
176 points
51 comments2 min readLW link

Spe­cial­iz­ing in Prob­lems We Don’t Understand

johnswentworthApr 10, 2021, 10:40 PM
174 points
29 comments8 min readLW link1 review

I’m from a par­allel Earth with much higher co­or­di­na­tion: AMA

Apr 5, 2021, 10:09 PM
173 points
35 comments61 min readLW link1 review

Test­ing The Nat­u­ral Ab­strac­tion Hy­poth­e­sis: Pro­ject Intro

johnswentworthApr 6, 2021, 9:24 PM
168 points
41 comments6 min readLW link1 review

Why has nu­clear power been a flop?

jasoncrawfordApr 16, 2021, 4:49 PM
148 points
50 comments15 min readLW link2 reviews
(rootsofprogress.org)

The Case for Ex­treme Vac­cine Effectiveness

RubyApr 13, 2021, 9:08 PM
142 points
37 comments23 min readLW link

Opinions on In­ter­pretable Ma­chine Learn­ing and 70 Sum­maries of Re­cent Papers

Apr 9, 2021, 7:19 PM
141 points
17 comments102 min readLW link

“AI and Com­pute” trend isn’t pre­dic­tive of what is happening

alexlyzhovApr 2, 2021, 12:44 AM
133 points
16 comments1 min readLW link

Why We Launched LessWrong.SubStack

Ben PaceApr 1, 2021, 6:34 AM
132 points
44 comments4 min readLW link

Tales from Pre­dic­tion Markets

ikeApr 3, 2021, 11:38 PM
128 points
15 comments3 min readLW link1 review
(misinfounderload.substack.com)

AMA: Paul Chris­ti­ano, al­ign­ment researcher

paulfchristianoApr 28, 2021, 6:55 PM
117 points
197 comments1 min readLW link

A new acausal trad­ing plat­form: RobinShould

Matthew BarnettApr 1, 2021, 4:56 PM
116 points
5 comments1 min readLW link

Monastery and Throne

Jacob FalkovichApr 6, 2021, 7:00 PM
115 points
42 comments10 min readLW link

Jaan Tal­linn’s 2020 Philan­thropy Overview

jaanApr 27, 2021, 4:22 PM
113 points
4 comments1 min readLW link
(jaan.online)

The ir­rele­vance of test scores is greatly exaggerated

dynomightApr 15, 2021, 2:15 PM
111 points
13 comments1 min readLW link
(dynomight.net)

How to Play a Sup­port Role in Re­search Conversations

johnswentworthApr 23, 2021, 8:57 PM
105 points
4 comments5 min readLW link

Covid 4/​22: Cri­sis in India

ZviApr 22, 2021, 1:40 PM
100 points
25 comments12 min readLW link
(thezvi.wordpress.com)

[Let­ter] Ad­vice for High School #1

lsusrApr 20, 2021, 4:09 AM
93 points
28 comments4 min readLW link

High­lights from The Au­to­bi­og­ra­phy of An­drew Carnegie

jasoncrawfordApr 8, 2021, 10:03 PM
92 points
9 comments19 min readLW link1 review
(rootsofprogress.org)

“Tak­ing your en­vi­ron­ment as ob­ject” vs “Be­ing sub­ject to your en­vi­ron­ment”

Ben PaceApr 11, 2021, 10:47 PM
87 points
17 comments3 min readLW link

Draft re­port on ex­is­ten­tial risk from power-seek­ing AI

Joe CarlsmithApr 28, 2021, 9:41 PM
85 points
23 comments1 min readLW link

Peo­ple Will Listen

sapphireApr 11, 2021, 4:51 PM
85 points
36 comments4 min readLW link

Gra­da­tions of In­ner Align­ment Obstacles

abramdemskiApr 20, 2021, 10:18 PM
84 points
22 comments9 min readLW link

What are all these chil­dren do­ing in my ponds?

dominicqApr 3, 2021, 8:16 PM
84 points
15 comments3 min readLW link

Covid 4/​15: Are We Se­ri­ously Do­ing This Again

ZviApr 15, 2021, 1:00 PM
82 points
37 comments9 min readLW link
(thezvi.wordpress.com)

A Brief Re­view of Cur­rent and Near-Fu­ture Meth­ods of Ge­netic Engineering

GeneSmithApr 10, 2021, 7:16 PM
82 points
33 comments15 min readLW link

Cen­ter for Ap­plied Pos­tra­tional­ity: An Update

Pee DoomApr 1, 2021, 8:13 AM
78 points
1 comment3 min readLW link

Beliefs as emo­tional strategies

Kaj_SotalaApr 9, 2021, 2:28 PM
75 points
4 comments8 min readLW link

Iter­ated Trust Kickstarters

RaemonApr 20, 2021, 3:18 AM
74 points
19 comments10 min readLW link

Up­dat­ing the Lot­tery Ticket Hypothesis

johnswentworthApr 18, 2021, 9:45 PM
73 points
41 comments2 min readLW link

Hell is wasted on the evil

lsusrApr 15, 2021, 8:52 AM
73 points
20 comments3 min readLW link

Don’t Sell Your Soul

Jacob FalkovichApr 6, 2021, 7:02 PM
72 points
43 comments9 min readLW link

Want­ing to Suc­ceed on Every Met­ric Presented

Logan RiggsApr 12, 2021, 8:43 PM
72 points
25 comments3 min readLW link

The se­cret of Wikipe­dia’s success

Aaron BergmanApr 14, 2021, 10:18 PM
68 points
11 comments6 min readLW link
(aaronbergman.substack.com)

FAQ: Ad­vice for AI Align­ment Researchers

Rohin ShahApr 26, 2021, 6:59 PM
67 points
2 comments1 min readLW link
(rohinshah.com)

Agents Over Carte­sian World Models

Apr 27, 2021, 2:06 AM
67 points
4 comments27 min readLW link

Solv­ing the whole AGI con­trol prob­lem, ver­sion 0.0001

Steven ByrnesApr 8, 2021, 3:14 PM
63 points
7 comments26 min readLW link

Reflec­tive Bayesianism

abramdemskiApr 6, 2021, 7:48 PM
62 points
27 comments13 min readLW link

A New Cen­ter? [Poli­tics] [Wish­ful Think­ing]

abramdemskiApr 12, 2021, 3:19 PM
62 points
36 comments3 min readLW link

Against “Con­text-Free In­tegrity”

Ben PaceApr 14, 2021, 8:20 AM
62 points
28 comments5 min readLW link

Thiel on se­crets and indefiniteness

Rob BensingerApr 20, 2021, 9:59 PM
60 points
7 comments9 min readLW link

My take on Michael Littman on “The HCI of HAI”

Alex FlintApr 2, 2021, 7:51 PM
59 points
4 comments7 min readLW link

Affordances

abramdemskiApr 2, 2021, 8:53 PM
58 points
18 comments6 min readLW link

You Can Now Embed Flash­card Quizzes in Your LessWrong posts!

spencergApr 19, 2021, 1:44 PM
58 points
25 comments4 min readLW link

Ra­tion­al­ity Cardinality

jimrandomhApr 27, 2021, 10:27 PM
57 points
7 comments1 min readLW link

Young kids catch­ing COVID: how much to worry?

Steven ByrnesApr 20, 2021, 6:03 PM
55 points
22 comments6 min readLW link

Covid 4/​29: Vac­ci­na­tion Slowdown

ZviApr 29, 2021, 1:50 PM
55 points
19 comments15 min readLW link
(thezvi.wordpress.com)