All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 91011 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Metaculus and medians

rossryAug 6, 2022, 3:34 AM

18 points

4 comments4 min readLW link

Announcing the Introduction to ML Safety course

Dan H, TW123 and ozhang

Aug 6, 2022, 2:46 AM

73 points

6 comments7 min readLW link

«Boundaries», Part 2: trends in EA’s handling of boundaries

Andrew_CritchAug 6, 2022, 12:42 AM

81 points

15 comments7 min readLW link

“Just hiring people” is sometimes still actually possible

lcAug 5, 2022, 9:44 PM

38 points

11 comments5 min readLW link

The need for certainty

Thomas McMurtryAug 5, 2022, 8:18 PM

2 points

0 comments4 min readLW link

Rant on Problem Factorization for Alignment

johnswentworthAug 5, 2022, 7:23 PM

104 points

53 comments6 min readLW link

Counterfactuals are Confusing because of an Ontological Shift

Chris_LeongAug 5, 2022, 7:03 PM

17 points

35 comments2 min readLW link

Orange county ACX/Less-Wrong discussion group and hang-out. (orange county)

Michael MichalchikAug 5, 2022, 6:25 PM

2 points

0 comments1 min readLW link

Gears-Level Understanding, Deliberate Performance, The Strategic Level

CFAR!DuncanAug 5, 2022, 5:11 PM

30 points

3 comments5 min readLW link

[Question] COVID-19 Group Testing Post-mortem?

gwernAug 5, 2022, 4:32 PM

72 points

6 comments2 min readLW link

Where are the red lines for AI?

Karl von WendtAug 5, 2022, 9:34 AM

26 points

10 comments6 min readLW link

Bridging Expected Utility Maximization and Optimization

Daniel HerrmannAug 5, 2022, 8:18 AM

25 points

5 comments14 min readLW link

Deontology and Tool AI

Nathan1123Aug 5, 2022, 5:20 AM

4 points

5 comments6 min readLW link

An attempt to understand the Complexity of Values

Dalton MaberyAug 5, 2022, 4:43 AM

3 points

0 comments5 min readLW link

$20K In Bounties for AI Safety Public Materials

Dan H, TW123 and ozhang

Aug 5, 2022, 2:52 AM

71 points

9 comments6 min readLW link

Two Kids Crosswise

jefftkAug 5, 2022, 2:40 AM

16 points

3 comments1 min readLW link

(www.jefftk.com)

The Falling Drill

ScrewtapeAug 5, 2022, 12:08 AM

46 points

3 comments2 min readLW link

Convergence Towards World-Models: A Gears-Level Model

Thane RuthenisAug 4, 2022, 11:31 PM

38 points

1 comment13 min readLW link

Cambist Booking

ScrewtapeAug 4, 2022, 10:40 PM

20 points

3 comments4 min readLW link

Calibration Trivia

ScrewtapeAug 4, 2022, 10:31 PM

12 points

9 comments4 min readLW link

Monthly Shorts 7/22

CelerAug 4, 2022, 10:30 PM

5 points

0 comments3 min readLW link

(keller.substack.com)

The Pragmascope Idea

johnswentworthAug 4, 2022, 9:52 PM

59 points

20 comments3 min readLW link

Running a Basic Meetup

ScrewtapeAug 4, 2022, 9:49 PM

21 points

1 comment2 min readLW link

Fiber arts, mysterious dodecahedrons, and waiting on “Eureka!”

eukaryoteAug 4, 2022, 8:37 PM

124 points

15 comments9 min readLW link 1 review

(eukaryotewritesblog.com)

[Question] Would “Manhattan Project” style be beneficial or deleterious for AI Alignment?

Valentin2026Aug 4, 2022, 7:12 PM

5 points

1 comment1 min readLW link

[Question] AI alignment: Would a lazy self-preservation instinct be sufficient?

BrainFrogAug 4, 2022, 5:53 PM

−1 points

4 comments1 min readLW link

Socratic Ducking, OODA Loops, Frame-by-Frame Debugging

CFAR!DuncanAug 4, 2022, 5:44 PM

26 points

1 comment5 min readLW link

What do ML researchers think about AI in 2022?

KatjaGraceAug 4, 2022, 3:40 PM

221 points

33 comments3 min readLW link

(aiimpacts.org)

Interpretability isn’t Free

Joel BurgetAug 4, 2022, 3:02 PM

12 points

1 comment2 min readLW link

Covid 8/4/22: Rebound

ZviAug 4, 2022, 11:20 AM

36 points

0 comments11 min readLW link

(thezvi.wordpress.com)

High Reliability Orgs, and AI Companies

RaemonAug 4, 2022, 5:45 AM

86 points

7 comments12 min readLW link 1 review

Surprised by ELK report’s counterexample to Debate, IDA

Evan R. MurphyAug 4, 2022, 2:12 AM

18 points

0 comments5 min readLW link

Clapping Lower

jefftkAug 4, 2022, 2:10 AM

38 points

7 comments1 min readLW link

(www.jefftk.com)

[Question] How do I know if my first post should be a post, or a question?

Nathan1123Aug 4, 2022, 1:46 AM

3 points

4 comments1 min readLW link

Three pillars for avoiding AGI catastrophe: Technical alignment, deployment decisions, and coordination

LintzAAug 3, 2022, 11:15 PM

24 points

0 comments11 min readLW link

Precursor checking for deceptive alignment

evhubAug 3, 2022, 10:56 PM

24 points

0 comments14 min readLW link

Transformer language models are doing something more general

NumendilAug 3, 2022, 9:13 PM

53 points

6 comments2 min readLW link

[Question] Some doubts about Non Superintelligent AIs

aditya malikAug 3, 2022, 7:55 PM

0 points

4 comments1 min readLW link

Announcing Squiggle: Early Access

ozziegooenAug 3, 2022, 7:48 PM

51 points

7 comments7 min readLW link

(forum.effectivealtruism.org)

Survey: What (de)motivates you about AI risk?

Daniel_FriedrichAug 3, 2022, 7:17 PM

1 point

0 comments1 min readLW link

(forms.gle)

Externalized reasoning oversight: a research direction for language model alignment

tameraAug 3, 2022, 12:03 PM

136 points

23 comments6 min readLW link

Open & Welcome Thread—Aug/Sep 2022

ThomasAug 3, 2022, 10:22 AM

9 points

32 comments1 min readLW link

[Question] How does one recognize information and differentiate it from noise?

M. Y. ZuoAug 3, 2022, 3:57 AM

4 points

29 comments1 min readLW link

Law-Following AI 4: Don’t Rely on Vicarious Liability

CullenAug 2, 2022, 11:26 PM

5 points

2 comments3 min readLW link

Two-year update on my personal AI timelines

Ajeya Cotra2 Aug 2022 23:07 UTC

293 points

60 comments16 min readLW link

What are the Red Flags for Neural Network Suffering? - Seeds of Science call for reviewers

rogersbacon2 Aug 2022 22:37 UTC

24 points

6 comments1 min readLW link

Againstness

CFAR!Duncan2 Aug 2022 19:29 UTC

50 points

8 comments9 min readLW link

(Summary) Sequence Highlights—Thinking Better on Purpose

qazzquimby2 Aug 2022 17:45 UTC

33 points

3 comments11 min readLW link

Progress links and tweets, 2022-08-02

jasoncrawford2 Aug 2022 17:03 UTC

9 points

0 comments1 min readLW link

(rootsofprogress.org)

[Question] I want to donate some money (not much, just what I can afford) to AGI Alignment research, to whatever organization has the best chance of making sure that AGI goes well and doesn’t kill us all. What are my best options, where can I make the most difference per dollar?

lumenwrites2 Aug 2022 12:08 UTC

15 points

9 comments1 min readLW link