All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 181920 21 22 23 24 25 26 27 28 29 30 31

Proposed Orthogonality Theses #2-5

rjbgJul 14, 2022, 10:59 PM

8 points

0 comments2 min readLW link

Better Quiddler

jefftkJul 14, 2022, 5:40 PM

17 points

0 comments1 min readLW link

(www.jefftk.com)

Circumventing interpretability: How to defeat mind-readers

Lee SharkeyJul 14, 2022, 4:59 PM

114 points

15 comments33 min readLW link

Covid 7/14/22: BA.2.75 Plus Tax

ZviJul 14, 2022, 2:40 PM

39 points

9 comments8 min readLW link

(thezvi.wordpress.com)

Criticism of EA Criticism Contest

ZviJul 14, 2022, 2:30 PM

108 points

17 comments31 min readLW link 1 review

(thezvi.wordpress.com)

Humans provide an untapped wealth of evidence about alignment

TurnTrout and Quintin Pope

Jul 14, 2022, 2:31 AM

212 points

94 comments9 min readLW link 1 review

[Question] Wacky, risky, anti-inductive intelligence-enhancement methods?

Nicholas / Heather KrossJul 14, 2022, 1:40 AM

20 points

30 comments1 min readLW link

[Question] How to impress students with recent advances in ML?

Charbel-RaphaëlJul 14, 2022, 12:03 AM

12 points

2 comments1 min readLW link

Notes on Love

David GrossJul 13, 2022, 11:35 PM

18 points

3 comments29 min readLW link

Deep learning curriculum for large language model alignment

Jacob_HiltonJul 13, 2022, 9:58 PM

57 points

3 comments1 min readLW link

(github.com)

Artificial Sandwiching: When can we test scalable alignment protocols without humans?

Sam BowmanJul 13, 2022, 9:14 PM

42 points

6 comments5 min readLW link

[Question] Any tips for eliciting one’s own latent knowledge?

MSRayneJul 13, 2022, 9:12 PM

16 points

20 comments2 min readLW link

Goal Alignment Is Robust To the Sharp Left Turn

Thane RuthenisJul 13, 2022, 8:23 PM

43 points

16 comments4 min readLW link

Making decisions using multiple worldviews

Richard_NgoJul 13, 2022, 7:15 PM

50 points

10 comments11 min readLW link

[Question] App idea to help with reading STEM textbooks (feedback request)

DirectedEvolutionJul 13, 2022, 6:28 PM

16 points

8 comments2 min readLW link

MIRI Conversations: Technology Forecasting & Gradualism (Distillation)

CallumMcDougallJul 13, 2022, 3:55 PM

31 points

1 comment20 min readLW link

Passing Up Pay

jefftkJul 13, 2022, 2:10 PM

29 points

8 comments5 min readLW link

(www.jefftk.com)

[Question] How could the universe be infinitely large?

amaraiJul 13, 2022, 1:45 PM

0 points

8 comments1 min readLW link

John von Neumann on how to safely progress with technology

Dalton MaberyJul 13, 2022, 11:07 AM

14 points

0 comments1 min readLW link

Everyone is an Imposter

TharinJul 13, 2022, 8:46 AM

19 points

1 comment9 min readLW link

(echoesandchimes.com)

[Question] Which AI Safety research agendas are the most promising?

Chris_LeongJul 13, 2022, 7:54 AM

27 points

5 comments1 min readLW link

Straw-Steelmanning

Chris van MerwijkJul 13, 2022, 5:48 AM

29 points

2 comments1 min readLW link

Alien Message Contest: Solution

DaemonicSigilJul 13, 2022, 4:07 AM

29 points

2 comments4 min readLW link

[Question] What is wrong with this approach to corrigibility?

Rafael CosmanJul 12, 2022, 10:55 PM

7 points

8 comments1 min readLW link

Acceptability Verification: A Research Agenda

David Udell and evhub

Jul 12, 2022, 8:11 PM

50 points

0 comments1 min readLW link

(docs.google.com)

Progress links and tweets, 2022-07-12

jasoncrawfordJul 12, 2022, 3:30 PM

12 points

0 comments1 min readLW link

(rootsofprogress.org)

Response to Blake Richards: AGI, generality, alignment, & loss functions

Steven ByrnesJul 12, 2022, 1:56 PM

62 points

9 comments15 min readLW link

Three Minimum Pivotal Acts Possible by Narrow AI

Michael SoareverixJul 12, 2022, 9:51 AM

0 points

4 comments2 min readLW link

Mosaic and Palimpsests: Two Shapes of Research

adamShimiJul 12, 2022, 9:05 AM

39 points

3 comments9 min readLW link

[Question] How do you concisely communicate & navigate the politics / culture at your job working at a large corporation or institution?

WillaJul 12, 2022, 3:22 AM

10 points

6 comments1 min readLW link

On how various plans miss the hard bits of the alignment challenge

So8resJul 12, 2022, 2:49 AM

313 points

89 comments29 min readLW link 3 reviews

Rainmaking

WalterLJul 12, 2022, 12:42 AM

26 points

5 comments1 min readLW link

(www.youtube.com)

Book Review: Neal Stephenson’s “Termination Shock”

Tyler SimmonsJul 12, 2022, 12:07 AM

13 points

0 comments30 min readLW link

(www.words-and-dirt.com)

Announcing Future Forum—Apply Now

wANIEL and freemany

Jul 11, 2022, 10:57 PM

8 points

0 comments4 min readLW link

(forum.effectivealtruism.org)

Defining Optimization in a Deeper Way Part 2

J Bostock11 Jul 2022 20:29 UTC

7 points

0 comments4 min readLW link

Marriage, the Giving What We Can Pledge, and the damage caused by vague public commitments

Jeffrey Ladish11 Jul 2022 19:38 UTC

98 points

27 comments6 min readLW link 1 review

Systemization

CFAR!Duncan11 Jul 2022 18:39 UTC

42 points

5 comments12 min readLW link

[Question] How do AI timelines affect how you live your life?

Quadratic Reciprocity11 Jul 2022 13:54 UTC

80 points

50 comments1 min readLW link

Cambridge LW Meetup: Free Speech

Darmani11 Jul 2022 4:36 UTC

7 points

0 comments1 min readLW link

Checksum Sensor Alignment

lsusr11 Jul 2022 3:31 UTC

12 points

2 comments1 min readLW link

The Alignment Problem

lsusr11 Jul 2022 3:03 UTC

47 points

18 comments3 min readLW link

Immanuel Kant and the Decision Theory App Store

Daniel Kokotajlo10 Jul 2022 16:04 UTC

93 points

12 comments5 min readLW link

Metaculus is seeking experienced leaders, researchers & operators for high-impact roles

ChristianWilliams10 Jul 2022 14:27 UTC

9 points

0 comments1 min readLW link

(apply.workable.com)

Avoid the abbreviation “FLOPs” – use “FLOP” or “FLOP/s” instead

Daniel_Eth10 Jul 2022 10:44 UTC

70 points

13 comments1 min readLW link

My Opportunity Costs

abstractapplic10 Jul 2022 10:14 UTC

22 points

3 comments3 min readLW link

Why Portland

Adam Zerner10 Jul 2022 7:20 UTC

25 points

18 comments9 min readLW link

Hessian and Basin volume

Vivek Hebbar10 Jul 2022 6:59 UTC

35 points

10 comments4 min readLW link

Taste & Shaping

CFAR!Duncan10 Jul 2022 5:50 UTC

67 points

1 comment16 min readLW link

Comment on “Propositions Concerning Digital Minds and Society”

Zack_M_Davis10 Jul 2022 5:48 UTC

99 points

12 comments8 min readLW link

Heaven: The last part of dystopia

Existism9 Jul 2022 22:36 UTC

−1 points

1 comment6 min readLW link