Deferring

owencbMay 12, 2022, 11:56 PM
18 points
2 comments11 min readLW link

RLHF

Ansh RadhakrishnanMay 12, 2022, 9:18 PM
18 points
5 comments5 min readLW link

[Question] What to do when start­ing a busi­ness in an im­mi­nent-AGI world?

ryan_bMay 12, 2022, 9:07 PM
25 points
7 comments1 min readLW link

In­ter­pretabil­ity’s Align­ment-Solv­ing Po­ten­tial: Anal­y­sis of 7 Scenarios

Evan R. MurphyMay 12, 2022, 8:01 PM
58 points
0 comments59 min readLW link

In­tro­duc­tion to the se­quence: In­ter­pretabil­ity Re­search for the Most Im­por­tant Century

Evan R. MurphyMay 12, 2022, 7:59 PM
16 points
0 comments8 min readLW link

A ten­ta­tive di­alogue with a Friendly-boxed-su­per-AGI on brain uploads

Ramiro P.May 12, 2022, 7:40 PM
1 point
12 comments4 min readLW link

The Last Paperclip

Logan ZoellnerMay 12, 2022, 7:25 PM
63 points
15 comments18 min readLW link

Deep­mind’s Gato: Gen­er­al­ist Agent

Daniel KokotajloMay 12, 2022, 4:01 PM
165 points
62 comments1 min readLW link

“A Gen­er­al­ist Agent”: New Deep­Mind Publication

1a3ornMay 12, 2022, 3:30 PM
79 points
43 comments1 min readLW link

Covid 5/​12/​22: Other Priorities

ZviMay 12, 2022, 1:30 PM
31 points
4 comments15 min readLW link
(thezvi.wordpress.com)

[Question] How would pub­lic me­dia out­lets need to be gov­erned to cover all poli­ti­cal views?

ChristianKlMay 12, 2022, 12:55 PM
13 points
14 comments1 min readLW link

[Question] What’s keep­ing con­cerned ca­pa­bil­ities gain re­searchers from leav­ing the field?

sovranMay 12, 2022, 12:16 PM
19 points
4 comments1 min readLW link

Pos­i­tive out­comes un­der an un­al­igned AGI takeover

YitzMay 12, 2022, 7:45 AM
19 points
10 comments3 min readLW link

[Question] What are your recom­men­da­tions for tech­ni­cal AI al­ign­ment pod­casts?

Evan_GaensbauerMay 11, 2022, 9:52 PM
5 points
4 comments1 min readLW link

Grace­fully cor­rect­ing un­cal­ibrated shame

AF2022May 11, 2022, 7:51 PM
−31 points
34 comments4 min readLW link

[In­tro to brain-like-AGI safety] 14. Con­trol­led AGI

Steven ByrnesMay 11, 2022, 1:17 PM
45 points
25 comments20 min readLW link

Pro­jec­tLawful.com: Eliezer’s lat­est story, past 1M words

Eliezer YudkowskyMay 11, 2022, 6:18 AM
234 points
112 comments1 min readLW link4 reviews

An In­side View of AI Alignment

Ansh RadhakrishnanMay 11, 2022, 2:16 AM
32 points
2 comments2 min readLW link

Fight­ing in var­i­ous places for a re­ally long time

KatjaGraceMay 11, 2022, 1:50 AM
36 points
12 comments4 min readLW link
(worldspiritsockpuppet.com)

Stuff I might do if I had covid

KatjaGraceMay 11, 2022, 12:00 AM
39 points
9 comments1 min readLW link
(worldspiritsockpuppet.com)

Crises Don’t Need Your Software

GabrielExistsMay 10, 2022, 9:06 PM
59 points
18 comments6 min readLW link

Ceiling Fan Air Filter

jefftkMay 10, 2022, 2:20 PM
18 points
9 comments1 min readLW link
(www.jefftk.com)

The limits of AI safety via debate

Marius HobbhahnMay 10, 2022, 1:33 PM
35 points
8 comments10 min readLW link

Ex­am­in­ing Arm­strong’s cat­e­gory of gen­er­al­ized models

Morgan_RogersMay 10, 2022, 9:07 AM
14 points
0 comments7 min readLW link

Dath Ilani Rule of Law

David UdellMay 10, 2022, 6:17 AM
24 points
25 comments4 min readLW link

AI safety should be made more ac­cessible us­ing non text-based media

MassimogMay 10, 2022, 3:14 AM
2 points
4 comments4 min readLW link

LessWrong Now Has Dark Mode

jimrandomhMay 10, 2022, 1:21 AM
135 points
31 comments1 min readLW link

Con­di­tions for math­e­mat­i­cal equiv­alence of Stochas­tic Gra­di­ent Des­cent and Nat­u­ral Selection

Oliver SourbutMay 9, 2022, 9:38 PM
70 points
19 comments8 min readLW link1 review
(www.oliversourbut.net)

AI Align­ment YouTube Playlists

May 9, 2022, 9:33 PM
30 points
4 comments1 min readLW link

When is AI safety re­search harm­ful?

NathanBarnardMay 9, 2022, 6:19 PM
2 points
0 comments8 min readLW link

A Bird’s Eye View of the ML Field [Prag­matic AI Safety #2]

May 9, 2022, 5:18 PM
163 points
8 comments35 min readLW link

In­tro­duc­tion to Prag­matic AI Safety [Prag­matic AI Safety #1]

May 9, 2022, 5:06 PM
80 points
3 comments6 min readLW link

Jobs: Help scale up LM al­ign­ment re­search at NYU

Sam BowmanMay 9, 2022, 2:12 PM
60 points
1 comment1 min readLW link

Micro­phone on Elec­tric Mandolin

jefftkMay 9, 2022, 2:00 PM
16 points
0 comments1 min readLW link
(www.jefftk.com)

[Question] Thought ex­per­i­ment: Imag­ine you were as­signed to help a ran­dom per­son in your com­mu­nity be­come as peace­ful and joyful as the most peace­ful and joyful per­son you’d ever met. What would you try?

nonzerosumMay 9, 2022, 1:53 PM
5 points
5 comments1 min readLW link

[Question] Willing to be your mu­sic men­tor in ex­change for video edit­ing mentorship

monkymindMay 9, 2022, 11:57 AM
8 points
0 comments1 min readLW link

Up­dat­ing Utility Functions

May 9, 2022, 9:44 AM
41 points
6 comments8 min readLW link

Tran­scripts of in­ter­views with AI researchers

Vael GatesMay 9, 2022, 5:57 AM
170 points
9 comments2 min readLW link

[Scrib­ble] Bad Rea­sons Be­hind Differ­ent Sys­tems and a Story with No Good Moral

Rana DexsinMay 9, 2022, 5:21 AM
9 points
0 comments5 min readLW link

[Question] What is the best day to cel­e­brate Smal­lpox Erad­i­ca­tion Day?

OrbordeMay 9, 2022, 4:02 AM
7 points
6 comments1 min readLW link

A rea­son be­hind bad sys­tems, and moral im­pli­ca­tions of see­ing this reason

Edward PascalMay 9, 2022, 3:16 AM
4 points
12 comments2 min readLW link

An Alter­na­tive In­ter­pre­ta­tion of Physics

dadadarrenMay 9, 2022, 12:52 AM
18 points
10 comments5 min readLW link
(www.sleepingbeautyproblem.com)

Ion Im­plan­ta­tion: The­ory, Equip­ment, Pro­cess, Alternatives

nomagicpillMay 8, 2022, 10:30 PM
5 points
0 comments16 min readLW link
(210ethan.github.io)

[Question] Long COVID risk: How to main­tain an up to date risk as­sess­ment so we can go back to nor­mal life?

SameerishereMay 8, 2022, 7:56 PM
19 points
34 comments1 min readLW link

De­mon­strat­ing MWI by in­terfer­ing hu­man simulations

Yair HalberstadtMay 8, 2022, 5:28 PM
12 points
25 comments2 min readLW link

Notes from a con­ver­sa­tion with Ing. Agr. Adri­ana Balzarini

Pablo RepettoMay 8, 2022, 3:56 PM
5 points
0 comments2 min readLW link
(pabloernesto.github.io)

Ele­men­tary In­fra-Bayesianism

JanMay 8, 2022, 12:23 PM
41 points
3 comments7 min readLW link
(universalprior.substack.com)

Cam­bridge LW Meetup: Books That Change

May 8, 2022, 5:23 AM
5 points
0 comments1 min readLW link

Video and Tran­script of Pre­sen­ta­tion on Ex­is­ten­tial Risk from Power-Seek­ing AI

Joe CarlsmithMay 8, 2022, 3:50 AM
20 points
1 comment29 min readLW link

[Question] Al­gorith­mic for­mal­iza­tion of FDT?

ShmiMay 8, 2022, 1:36 AM
12 points
8 comments1 min readLW link