All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 234 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Averting Catastrophe: Decision Theory for COVID-19, Climate Change, and Potential Disasters of All Kinds

JakubK2 May 2023 22:50 UTC

10 points

0 comments1 min readLW link

A Case for the Least Forgiving Take On Alignment

Thane Ruthenis2 May 2023 21:34 UTC

100 points

84 comments22 min readLW link

Are Emergent Abilities of Large Language Models a Mirage? [linkpost]

Matthew Barnett2 May 2023 21:01 UTC

53 points

19 comments1 min readLW link

(arxiv.org)

Does descaling a kettle help? Theory and practice

philh2 May 2023 20:20 UTC

35 points

25 comments8 min readLW link

(reasonableapproximation.net)

Avoiding xrisk from AI doesn’t mean focusing on AI xrisk

Stuart_Armstrong2 May 2023 19:27 UTC

64 points

7 comments3 min readLW link

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks

ozhang, Dan H, Akash and aogara

2 May 2023 18:41 UTC

32 points

0 comments5 min readLW link

(newsletter.safe.ai)

My best system yet: text-based project management

jt2 May 2023 17:44 UTC

6 points

8 comments5 min readLW link

[Question] What’s the state of AI safety in Japan?

ChristianKl2 May 2023 17:06 UTC

5 points

1 comment1 min readLW link

Five Worlds of AI (by Scott Aaronson and Boaz Barak)

mishka2 May 2023 13:23 UTC

22 points

6 comments1 min readLW link 1 review

(scottaaronson.blog)

Systems that cannot be unsafe cannot be safe

Davidmanheim2 May 2023 8:53 UTC

62 points

27 comments2 min readLW link

AGI safety career advice

Richard_Ngo2 May 2023 7:36 UTC

132 points

24 comments13 min readLW link

An Impossibility Proof Relevant to the Shutdown Problem and Corrigibility

Audere2 May 2023 6:52 UTC

65 points

13 comments9 min readLW link

Some Thoughts on Virtue Ethics for AIs

peligrietzer2 May 2023 5:46 UTC

76 points

8 comments4 min readLW link

Technological unemployment as another test for rationalist winning

RomanHauksson2 May 2023 4:16 UTC

14 points

5 comments1 min readLW link

The Moral Copernican Principle

Legionnaire2 May 2023 3:25 UTC

5 points

7 comments2 min readLW link

Open & Welcome Thread—May 2023

Ruby2 May 2023 2:58 UTC

21 points

41 comments1 min readLW link

Summaries of top forum posts (24th − 30th April 2023)

Zoe Williams2 May 2023 2:30 UTC

12 points

1 comment1 min readLW link

AXRP Episode 21 - Interpretability for Engineers with Stephen Casper

DanielFilan2 May 2023 0:50 UTC

12 points

1 comment66 min readLW link

Getting Your Eyes On

LoganStrohl2 May 2023 0:33 UTC

58 points

11 comments14 min readLW link

What 2025 looks like

Ruby1 May 2023 22:53 UTC

75 points

17 comments15 min readLW link

[Question] Natural Selection vs Gradient Descent

CuriousApe111 May 2023 22:16 UTC

4 points

3 comments1 min readLW link

A[I] Zombie Apocalypse Is Already Upon Us

NickHarris1 May 2023 22:02 UTC

−6 points

4 comments2 min readLW link

Geoff Hinton Quits Google

Adam Shai1 May 2023 21:03 UTC

98 points

14 comments1 min readLW link

The Apprentice Thread 2

hath1 May 2023 20:09 UTC

50 points

19 comments1 min readLW link

Budapest, Hungary – ACX Meetups Everywhere Spring 2023

Richard Horvath, Timothy Underwood and marta_k

1 May 2023 17:36 UTC

4 points

0 comments1 min readLW link

In favor of steelmanning

jp1 May 2023 17:12 UTC

36 points

6 comments1 min readLW link

Shah (DeepMind) and Leahy (Conjecture) Discuss Alignment Cruxes

OliviaJ, Rohin Shah, Connor Leahy and Andrea_Miotti

1 May 2023 16:47 UTC

96 points

10 comments30 min readLW link

Distinguishing misuse is difficult and uncomfortable

lemonhope1 May 2023 16:23 UTC

17 points

3 comments1 min readLW link

[Question] Does agency necessarily imply self-preservation instinct?

Mislav Jurić1 May 2023 16:06 UTC

5 points

8 comments1 min readLW link

What Boston Can Teach Us About What a Woman Is

ymeskhout1 May 2023 15:34 UTC

18 points

45 comments12 min readLW link

The Rocket Alignment Problem, Part 2

Zvi1 May 2023 14:30 UTC

40 points

20 comments9 min readLW link

(thezvi.wordpress.com)

Socialist Democratic-Republic GAME: 12 Amendments to the Constitutions of the Free World

monkymind1 May 2023 13:13 UTC

−34 points

0 comments1 min readLW link

[Question] Where is all this evidence of UFOs?

Logan Zoellner1 May 2023 12:13 UTC

29 points

42 comments1 min readLW link

LessWrong Community Weekend 2023 [Applications now closed]

Henry Prowbell1 May 2023 9:31 UTC

43 points

0 comments6 min readLW link

LessWrong Community Weekend 2023 [Applications now closed]

Henry Prowbell1 May 2023 9:08 UTC

89 points

0 comments6 min readLW link

[Question] In AI Risk what is the base model of the AI?

jmh1 May 2023 3:25 UTC

3 points

1 comment1 min readLW link

Hell is Game Theory Folk Theorems

jessicata1 May 2023 3:16 UTC

81 points

102 comments5 min readLW link 1 review

(unstableontology.com)

Safety standards: a framework for AI regulation

joshc1 May 2023 0:56 UTC

19 points

0 comments8 min readLW link

neuron spike computational capacity

bhauth1 May 2023 0:28 UTC

16 points

0 comments2 min readLW link

Cult of Error

bayesyatina30 Apr 2023 23:33 UTC

5 points

2 comments3 min readLW link

How can one rationally have very high or very low probabilities of extinction in a pre-paradigmatic field?

Shmi30 Apr 2023 21:53 UTC

39 points

15 comments1 min readLW link

A small update to the Sparse Coding interim research report

Lee Sharkey, Dan Braun and beren

30 Apr 2023 19:54 UTC

61 points

5 comments1 min readLW link

Discussion about AI Safety funding (FB transcript)

Akash30 Apr 2023 19:05 UTC

75 points

8 comments1 min readLW link

Support me in a Week-Long Picketing Campaign Near OpenAI’s HQ: Seeking Support and Ideas from the LessWrong Community

Percy30 Apr 2023 17:48 UTC

−21 points

15 comments1 min readLW link

money ≠ value

stonefly30 Apr 2023 17:47 UTC

2 points

3 comments3 min readLW link

Vaccine Policies Need Updating

jefftk30 Apr 2023 17:20 UTC

11 points

0 comments1 min readLW link

(www.jefftk.com)

Fundamental Uncertainty: Chapter 7 - Why is truth useful?

Gordon Seidoh Worley30 Apr 2023 16:48 UTC

10 points

3 comments10 min readLW link

Simulators Increase the Likelihood of Alignment by Default

Wuschel Schulz30 Apr 2023 16:32 UTC

13 points

1 comment5 min readLW link

Connectomics seems great from an AI x-risk perspective

Steven Byrnes30 Apr 2023 14:38 UTC

98 points

7 comments10 min readLW link 1 review

The voyage of novelty

TsviBT30 Apr 2023 12:52 UTC

11 points

0 comments6 min readLW link