All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Fun math facts about 2023

Adam Scherlis1 Jan 2023 23:38 UTC

9 points

6 comments1 min readLW link

The Thingness of Things

TsviBT1 Jan 2023 22:19 UTC

48 points

35 comments10 min readLW link

Thoughts On Expanding the AI Safety Community: Benefits and Challenges of Outreach to Non-Technical Professionals

Yashvardhan Sharma1 Jan 2023 19:21 UTC

4 points

4 comments7 min readLW link

[Question] Would it be good or bad for the US military to get involved in AI risk?

Grant Demaree1 Jan 2023 19:02 UTC

50 points

12 comments1 min readLW link

Better New Year’s Goals through Aligning the Elephant and the Rider

moridinamael1 Jan 2023 17:54 UTC

20 points

0 comments2 min readLW link

(guildoftherose.org)

A Löbian argument pattern for implicit reasoning in natural language: Löbian party invitations

Andrew_Critch1 Jan 2023 17:39 UTC

23 points

8 comments7 min readLW link

woke offline, anti-woke online

Yair Halberstadt1 Jan 2023 8:24 UTC

13 points

12 comments1 min readLW link

Summary of 80k’s AI problem profile

JakubK1 Jan 2023 7:30 UTC

7 points

0 comments5 min readLW link

(forum.effectivealtruism.org)

What percent of people work in moral mazes?

Raemon1 Jan 2023 4:33 UTC

21 points

9 comments4 min readLW link

Recursive Middle Manager Hell

Raemon1 Jan 2023 4:33 UTC

221 points

46 comments11 min readLW link 1 review

Challenge to the notion that anything is (maybe) possible with AGI

Remmelt and flandry19

1 Jan 2023 3:57 UTC

−27 points

4 comments1 min readLW link

(mflb.com)

The Roots of Progress’s 2022 in review

jasoncrawford1 Jan 2023 2:54 UTC

14 points

2 comments15 min readLW link

(rootsofprogress.org)

Investing for a World Transformed by AI

PeterMcCluskey1 Jan 2023 2:47 UTC

67 points

24 comments6 min readLW link 1 review

(bayesianinvestor.com)

Why Free Will is NOT an illusion

Akira Pyinya1 Jan 2023 2:29 UTC

0 points

16 comments1 min readLW link

Localhost Security Messaging

jefftk1 Jan 2023 2:20 UTC

7 points

3 comments1 min readLW link

(www.jefftk.com)

0 and 1 aren’t probabilities

Alok Singh1 Jan 2023 0:09 UTC

2 points

4 comments2 min readLW link

(en.wikipedia.org)

‘simulator’ framing and confusions about LLMs

Beth Barnes31 Dec 2022 23:38 UTC

104 points

11 comments4 min readLW link

Monitoring devices I have loved

Elizabeth31 Dec 2022 22:51 UTC

62 points

13 comments3 min readLW link 1 review

Slack matters more than any outcome

Valentine31 Dec 2022 20:11 UTC

156 points

56 comments19 min readLW link 1 review

To Be Particular About Morality

AGO31 Dec 2022 19:58 UTC

6 points

2 comments7 min readLW link

200 COP in MI: Interpreting Algorithmic Problems

Neel Nanda31 Dec 2022 19:55 UTC

33 points

2 comments10 min readLW link

The Feeling of Idea Scarcity

johnswentworth31 Dec 2022 17:34 UTC

246 points

22 comments5 min readLW link 1 review

Curse of knowledge and Naive realism: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

31 Dec 2022 13:33 UTC

−7 points

1 comment1 min readLW link

(www.lesswrong.com)

[Question] What career advice do you give to software engineers?

Antb31 Dec 2022 12:01 UTC

15 points

4 comments1 min readLW link

[Question] Are Mixture-of-Experts Transformers More Interpretable Than Dense Transformers?

simeon_c31 Dec 2022 11:34 UTC

8 points

5 comments1 min readLW link

[Question] In which cases can ChatGPT be used as an aid for thesis or scientific paper writing?

Bob Guran31 Dec 2022 10:50 UTC

1 point

1 comment1 min readLW link

Two Issues with Playing Chicken with the Universe

Chris_Leong31 Dec 2022 6:47 UTC

4 points

4 comments2 min readLW link

Extreme risk neutrality isn’t always wrong

Grant Demaree31 Dec 2022 4:05 UTC

28 points

19 comments4 min readLW link

Verbal parity: What is it and how to measure it? + an edited version of “Against John Searle, Gary Marcus, the Chinese Room thought experiment and its world”

philosophybear31 Dec 2022 3:46 UTC

2 points

0 comments11 min readLW link

Should AI systems have to identify themselves?

Darren McKee31 Dec 2022 2:57 UTC

2 points

2 comments1 min readLW link

[Question] What do you imagine, when you imagine “taking over the world”?

johnswentworth31 Dec 2022 1:04 UTC

22 points

16 comments1 min readLW link

A few thoughts on my self-study for alignment research

Thomas Kehrenberg30 Dec 2022 22:05 UTC

6 points

0 comments2 min readLW link

Christmas Microscopy

jefftk30 Dec 2022 21:10 UTC

27 points

0 comments1 min readLW link

(www.jefftk.com)

What “upside” of AI?

False Name30 Dec 2022 20:58 UTC

0 points

5 comments4 min readLW link

Evidence on recursive self-improvement from current ML

beren30 Dec 2022 20:53 UTC

31 points

12 comments6 min readLW link

[Question] Is ChatGPT TAI?

Amal 30 Dec 2022 19:44 UTC

14 points

5 comments1 min readLW link

My thoughts on OpenAI’s alignment plan

Akash30 Dec 2022 19:33 UTC

55 points

3 comments20 min readLW link

Beyond Rewards and Values: A Non-dualistic Approach to Universal Intelligence

Akira Pyinya30 Dec 2022 19:05 UTC

10 points

4 comments14 min readLW link

10 Years of LessWrong

SebastianG 30 Dec 2022 17:15 UTC

73 points

2 comments4 min readLW link

Chatbots as a Publication Format

derek shiller30 Dec 2022 14:11 UTC

6 points

6 comments4 min readLW link

Human sexuality as an interesting case study of alignment

beren30 Dec 2022 13:37 UTC

39 points

26 comments3 min readLW link

The Twitter Files: Covid Edition

Zvi30 Dec 2022 13:30 UTC

32 points

2 comments10 min readLW link

(thezvi.wordpress.com)

Worldly Positions archive, briefly with private drafts

KatjaGrace30 Dec 2022 12:20 UTC

11 points

0 comments1 min readLW link

(worldspiritsockpuppet.com)

Models Don’t “Get Reward”

Sam Ringer30 Dec 2022 10:37 UTC

313 points

61 comments5 min readLW link 1 review

The hyperfinite timeline

Alok Singh30 Dec 2022 9:30 UTC

3 points

6 comments1 min readLW link

(alok.github.io)

Reactive devaluation: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

30 Dec 2022 9:02 UTC

−15 points

9 comments1 min readLW link

Things I carry almost every day, as of late December 2022

DanielFilan30 Dec 2022 7:40 UTC

38 points

9 comments5 min readLW link

(danielfilan.com)

More ways to spot abysses

KatjaGrace30 Dec 2022 6:30 UTC

21 points

1 comment1 min readLW link

(worldspiritsockpuppet.com)

Language models are nearly AGIs but we don’t notice it because we keep shifting the bar

philosophybear30 Dec 2022 5:15 UTC

105 points

13 comments7 min readLW link

Progress links and tweets, 2022-12-29

jasoncrawford30 Dec 2022 4:54 UTC

12 points

0 comments1 min readLW link

(rootsofprogress.org)