All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Clarifying the confusion around inner alignment

Rauno ArikeMay 13, 2022, 11:05 PM

31 points

0 comments11 min readLW link

Costs and benefits of amniocentesis for normal pregnancies

bracesMay 13, 2022, 10:47 PM

13 points

4 comments3 min readLW link

Frame for Take-Off Speeds to inform compute governance & scaling alignment

Logan RiggsMay 13, 2022, 10:23 PM

15 points

2 comments2 min readLW link

Alignment as Constraints

Logan RiggsMay 13, 2022, 10:07 PM

10 points

0 comments2 min readLW link

How close to nuclear war did we get over Cuba?

NathanBarnardMay 13, 2022, 7:58 PM

13 points

0 comments10 min readLW link

Against Time in Agent Models

johnswentworthMay 13, 2022, 7:55 PM

62 points

13 comments3 min readLW link

Agency As a Natural Abstraction

Thane RuthenisMay 13, 2022, 6:02 PM

55 points

9 comments13 min readLW link

Fermi estimation of the impact you might have working on AI safety

Fabien RogerMay 13, 2022, 5:49 PM

10 points

0 comments1 min readLW link

“Tech company singularities”, and steering them to reduce x-risk

Andrew_CritchMay 13, 2022, 5:24 PM

75 points

11 comments4 min readLW link

An observation about Hubinger et al.’s framework for learned optimization

carboniferous_umbraculum May 13, 2022, 4:20 PM

34 points

9 comments8 min readLW link

[Question] The Economics of a New Energy Source

casualphysicsenjoyerMay 13, 2022, 2:08 PM

2 points

13 comments1 min readLW link

[Question] Still possible to change username?

gabrielreccMay 13, 2022, 1:41 PM

7 points

4 comments1 min readLW link

[Rough notes, BAIS] Human values and cyclical preferences

pranomostro, Jayjay and Lucie Philippon

May 13, 2022, 1:28 PM

5 points

0 comments4 min readLW link

[Question] Can moderators fix old sequences posts?

EniScienMay 13, 2022, 12:30 PM

10 points

1 comment1 min readLW link

DeepMind is hiring for the Scalable Alignment and Alignment Teams

Rohin Shah and Geoffrey Irving

May 13, 2022, 12:17 PM

150 points

34 comments9 min readLW link

Thoughts on AI Safety Camp

Charlie SteinerMay 13, 2022, 7:16 AM

33 points

8 comments7 min readLW link

Deferring

owencbMay 12, 2022, 11:56 PM

18 points

2 comments11 min readLW link

RLHF

Ansh RadhakrishnanMay 12, 2022, 9:18 PM

18 points

5 comments5 min readLW link

[Question] What to do when starting a business in an imminent-AGI world?

ryan_bMay 12, 2022, 9:07 PM

25 points

7 comments1 min readLW link

Interpretability’s Alignment-Solving Potential: Analysis of 7 Scenarios

Evan R. MurphyMay 12, 2022, 8:01 PM

58 points

0 comments59 min readLW link

Introduction to the sequence: Interpretability Research for the Most Important Century

Evan R. MurphyMay 12, 2022, 7:59 PM

16 points

0 comments8 min readLW link

A tentative dialogue with a Friendly-boxed-super-AGI on brain uploads

Ramiro P.May 12, 2022, 7:40 PM

1 point

12 comments4 min readLW link

The Last Paperclip

Logan ZoellnerMay 12, 2022, 7:25 PM

63 points

15 comments18 min readLW link

Deepmind’s Gato: Generalist Agent

Daniel KokotajloMay 12, 2022, 4:01 PM

165 points

62 comments1 min readLW link

“A Generalist Agent”: New DeepMind Publication

1a3ornMay 12, 2022, 3:30 PM

79 points

43 comments1 min readLW link

Covid 5/12/22: Other Priorities

ZviMay 12, 2022, 1:30 PM

31 points

4 comments15 min readLW link

(thezvi.wordpress.com)

[Question] How would public media outlets need to be governed to cover all political views?

ChristianKlMay 12, 2022, 12:55 PM

13 points

14 comments1 min readLW link

[Question] What’s keeping concerned capabilities gain researchers from leaving the field?

sovranMay 12, 2022, 12:16 PM

19 points

4 comments1 min readLW link

Positive outcomes under an unaligned AGI takeover

YitzMay 12, 2022, 7:45 AM

19 points

10 comments3 min readLW link

[Question] What are your recommendations for technical AI alignment podcasts?

Evan_GaensbauerMay 11, 2022, 9:52 PM

5 points

4 comments1 min readLW link

Gracefully correcting uncalibrated shame

AF2022May 11, 2022, 7:51 PM

−31 points

34 comments4 min readLW link

[Intro to brain-like-AGI safety] 14. Controlled AGI

Steven ByrnesMay 11, 2022, 1:17 PM

45 points

25 comments20 min readLW link

ProjectLawful.com: Eliezer’s latest story, past 1M words

Eliezer YudkowskyMay 11, 2022, 6:18 AM

234 points

112 comments1 min readLW link 4 reviews

An Inside View of AI Alignment

Ansh RadhakrishnanMay 11, 2022, 2:16 AM

32 points

2 comments2 min readLW link

Fighting in various places for a really long time

KatjaGraceMay 11, 2022, 1:50 AM

36 points

12 comments4 min readLW link

(worldspiritsockpuppet.com)

Stuff I might do if I had covid

KatjaGraceMay 11, 2022, 12:00 AM

39 points

9 comments1 min readLW link

(worldspiritsockpuppet.com)

Crises Don’t Need Your Software

GabrielExistsMay 10, 2022, 9:06 PM

59 points

18 comments6 min readLW link

Ceiling Fan Air Filter

jefftkMay 10, 2022, 2:20 PM

18 points

9 comments1 min readLW link

(www.jefftk.com)

The limits of AI safety via debate

Marius HobbhahnMay 10, 2022, 1:33 PM

35 points

8 comments10 min readLW link

Examining Armstrong’s category of generalized models

Morgan_RogersMay 10, 2022, 9:07 AM

14 points

0 comments7 min readLW link

Dath Ilani Rule of Law

David UdellMay 10, 2022, 6:17 AM

24 points

25 comments4 min readLW link

AI safety should be made more accessible using non text-based media

MassimogMay 10, 2022, 3:14 AM

2 points

4 comments4 min readLW link

LessWrong Now Has Dark Mode

jimrandomhMay 10, 2022, 1:21 AM

135 points

31 comments1 min readLW link

Conditions for mathematical equivalence of Stochastic Gradient Descent and Natural Selection

Oliver SourbutMay 9, 2022, 9:38 PM

70 points

19 comments8 min readLW link 1 review

(www.oliversourbut.net)

AI Alignment YouTube Playlists

jacquesthibs and remember

May 9, 2022, 9:33 PM

30 points

4 comments1 min readLW link

When is AI safety research harmful?

NathanBarnardMay 9, 2022, 6:19 PM

2 points

0 comments8 min readLW link

A Bird’s Eye View of the ML Field [Pragmatic AI Safety #2]

Dan H and TW123

May 9, 2022, 5:18 PM

163 points

8 comments35 min readLW link

Introduction to Pragmatic AI Safety [Pragmatic AI Safety #1]

Dan H and TW123

May 9, 2022, 5:06 PM

80 points

3 comments6 min readLW link

Jobs: Help scale up LM alignment research at NYU

Sam BowmanMay 9, 2022, 2:12 PM

60 points

1 comment1 min readLW link

Microphone on Electric Mandolin

jefftkMay 9, 2022, 2:00 PM

16 points

0 comments1 min readLW link

(www.jefftk.com)