All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 262728 29 30

Don’t align agents to evaluations of plans

TurnTroutNov 26, 2022, 9:16 PM

48 points

49 comments18 min readLW link

What videos should Rational Animations make?

WriterNov 26, 2022, 8:28 PM

30 points

24 comments LW link

The First Filter

adamShimi and Gabriel Alfour

Nov 26, 2022, 7:37 PM

67 points

5 comments1 min readLW link

Respecting your Local Preferences

Scott GarrabrantNov 26, 2022, 7:04 PM

73 points

1 comment4 min readLW link

[Question] Opinions on the sleep synaptic homeostasis hypothesis?

Angela PretoriusNov 26, 2022, 7:01 PM

3 points

0 comments1 min readLW link

Why square errors?

AprillionNov 26, 2022, 1:40 PM

41 points

11 comments2 min readLW link

[Question] Assuming that at least one religion is true, what would you expect it to be?

risediveNov 26, 2022, 8:34 AM

−9 points

9 comments1 min readLW link

Three Alignment Schemas & Their Problems

Shoshannah TekofskyNov 26, 2022, 4:25 AM

19 points

1 comment6 min readLW link

The many types of blog posts

Adam ZernerNov 26, 2022, 3:57 AM

10 points

2 comments4 min readLW link

New Frontiers in Mojibake

Adam ScherlisNov 26, 2022, 2:37 AM

60 points

7 comments6 min readLW link 1 review

(adam.scherlis.com)

Semi-conductor/AI Stock Discussion.

sapphireNov 25, 2022, 11:35 PM

28 points

25 comments1 min readLW link

NEFFA Should Allow Small Children

jefftkNov 25, 2022, 11:00 PM

10 points

2 comments2 min readLW link

(www.jefftk.com)

Podcast: Shoshannah Tekofsky on skilling up in AI safety, visiting Berkeley, and developing novel research ideas

Orpheus16Nov 25, 2022, 8:47 PM

37 points

2 comments9 min readLW link

The man and the tool

pedroalvaradoNov 25, 2022, 7:51 PM

−1 points

0 comments4 min readLW link

[Question] What AI newsletters or substacks about AI do you recommend?

wunanNov 25, 2022, 7:29 PM

6 points

1 comment1 min readLW link

Mechanistic anomaly detection and ELK

paulfchristianoNov 25, 2022, 6:50 PM

138 points

22 comments21 min readLW link

(ai-alignment.com)

The Least Controversial Application of Geometric Rationality

Scott GarrabrantNov 25, 2022, 4:50 PM

60 points

22 comments4 min readLW link

Planes are still decades away from displacing most bird jobs

guzeyNov 25, 2022, 4:49 PM

168 points

13 comments3 min readLW link

Take part in our giant study of cognitive abilities and get a customized report of your strengths and weaknesses!

spencergNov 25, 2022, 4:28 PM

8 points

1 comment1 min readLW link

(www.guidedtrack.com)

Guardian AI (Misaligned systems are all around us.)

Jessica RumbelowNov 25, 2022, 3:55 PM

15 points

6 comments2 min readLW link

Intuitions by ML researchers may get progressively worse concerning likely candidates for transformative AI

Viktor RehnbergNov 25, 2022, 3:49 PM

7 points

0 comments2 min readLW link

Refining the Sharp Left Turn threat model, part 2: applying alignment techniques

Vika, Vikrant Varma, Ramana Kumar and Rohin Shah

Nov 25, 2022, 2:36 PM

39 points

9 comments6 min readLW link

(vkrakovna.wordpress.com)

[Question] Who holds all the USDT?

ChristianKlNov 25, 2022, 11:58 AM

17 points

6 comments1 min readLW link

Fair Collective Efficient Altruism

Jobst HeitzigNov 25, 2022, 9:38 AM

2 points

1 comment5 min readLW link

[Question] If humanity one day discovers that it is a form of disease that threatens to destroy the universe, should it allow itself to be shut down?

ShmiNov 25, 2022, 8:27 AM

4 points

12 comments1 min readLW link

Could a single alien message destroy us?

Writer and Matthew Barnett

Nov 25, 2022, 7:32 AM

61 points

23 comments6 min readLW link

(youtu.be)

How do I start a programming career in the West?

Lao MeinNov 25, 2022, 6:37 AM

38 points

7 comments2 min readLW link

The AI Safety community has four main work groups, Strategy, Governance, Technical and Movement Building

peterslatteryNov 25, 2022, 3:45 AM

1 point

0 comments6 min readLW link

Less Successful Cider Adventures

jefftkNov 25, 2022, 1:50 AM

11 points

1 comment1 min readLW link

(www.jefftk.com)

Gliders in Language Models

Alexandre VariengienNov 25, 2022, 12:38 AM

30 points

11 comments10 min readLW link

On Kelly and altruism

philhNov 24, 2022, 11:40 PM

17 points

6 comments12 min readLW link

(reasonableapproximation.net)

Open technical problem: A Quinean proof of Löb’s theorem, for an easier cartoon guide

Andrew_CritchNov 24, 2022, 9:16 PM

58 points

35 comments3 min readLW link 1 review

[Question] Historical examples of people gaining unusual cognitive abilities?

Nicholas / Heather KrossNov 24, 2022, 7:01 PM

8 points

2 comments1 min readLW link

Corrigibility Via Thought-Process Deference

Thane RuthenisNov 24, 2022, 5:06 PM

18 points

5 comments9 min readLW link

Geometric Exploration, Arithmetic Exploitation

Scott GarrabrantNov 24, 2022, 3:36 PM

126 points

5 comments7 min readLW link

What I Learned Running Refine

adamShimiNov 24, 2022, 2:49 PM

108 points

5 comments4 min readLW link

Covid 11/24/22: Thanks for Good Health

ZviNov 24, 2022, 1:00 PM

26 points

4 comments8 min readLW link

(thezvi.wordpress.com)

[Question] Dumb and ill-posed question: Is conceptual research like this MIRI paper on the shutdown problem/Corrigibility “real”

joraineNov 24, 2022, 5:08 AM

26 points

11 comments1 min readLW link

Clarifying wireheading terminology

leogaoNov 24, 2022, 4:53 AM

66 points

6 comments1 min readLW link

LW Beta Feature: Side-Comments

jimrandomhNov 24, 2022, 1:55 AM

103 points

47 comments1 min readLW link

Against “Classic Style”

Cleo NardoNov 23, 2022, 10:10 PM

67 points

30 comments4 min readLW link

South Bay ACX/LW Meetup

ISNov 23, 2022, 10:05 PM

2 points

0 comments1 min readLW link

Meme Dialects

jefftkNov 23, 2022, 9:30 PM

26 points

1 comment2 min readLW link

(www.jefftk.com)

[Question] When do you visualize (or not) while doing math?

Alex_Altair23 Nov 2022 20:15 UTC

21 points

9 comments1 min readLW link

When AI solves a game, focus on the game’s mechanics, not its theme.

Cleo Nardo23 Nov 2022 19:16 UTC

89 points

7 comments2 min readLW link

The Geometric Expectation

Scott Garrabrant23 Nov 2022 18:05 UTC

159 points

22 comments4 min readLW link

“Far Coordination”

DragonGod23 Nov 2022 17:14 UTC

6 points

17 comments9 min readLW link

Conjecture Second Hiring Round

Connor Leahy, Sid Black, Gabriel Alfour and Chris Scammell

23 Nov 2022 17:11 UTC

92 points

0 comments1 min readLW link

Conjecture: a retrospective after 8 months of work

Connor Leahy, Sid Black, Gabriel Alfour and Chris Scammell

23 Nov 2022 17:10 UTC

180 points

9 comments8 min readLW link

Against a General Factor of Doom

Jeffrey Heninger23 Nov 2022 16:50 UTC

61 points

19 comments4 min readLW link 1 review

(aiimpacts.org)