All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 3031

Humans, chimpanzees and other animals

gjm30 May 2023 23:53 UTC

21 points

18 comments1 min readLW link

The case for removing alignment and ML research from the training dataset

beren30 May 2023 20:54 UTC

48 points

8 comments5 min readLW link

Why Job Displacement Predictions are Wrong: Explanations of Cognitive Automation

Moritz Wallawitsch30 May 2023 20:43 UTC

−4 points

0 comments8 min readLW link

PaLM-2 & GPT-4 in “Extrapolating GPT-N performance”

Lukas Finnveden30 May 2023 18:33 UTC

55 points

6 comments6 min readLW link

RoboNet—A new internet protocol for AI

antoniomax30 May 2023 17:55 UTC

−13 points

1 comment18 min readLW link

Why I don’t think that the probability that AGI kills everyone is roughly 1 (but rather around 0.995).

Bastumannen30 May 2023 17:54 UTC

−6 points

0 comments2 min readLW link

AI X-risk is a possible solution to the Fermi Paradox

magic9mushroom30 May 2023 17:42 UTC

11 points

20 comments2 min readLW link

LIMA: Less Is More for Alignment

Ulisse Mini30 May 2023 17:10 UTC

16 points

6 comments1 min readLW link

(arxiv.org)

Boomerang—protocol to dissolve some commitment races

Filip Sondej30 May 2023 16:21 UTC

37 points

10 comments8 min readLW link

Announcing Apollo Research

Marius Hobbhahn, beren, Lee Sharkey, Lucius Bushnaq, Dan Braun, Mikita Balesni and Jérémy Scheurer

30 May 2023 16:17 UTC

217 points

11 comments8 min readLW link

Advice for new alignment people: Info Max

Jonas Hallgren30 May 2023 15:42 UTC

27 points

4 comments5 min readLW link

[Question] Who is liable for AI?

jmh30 May 2023 13:54 UTC

14 points

4 comments1 min readLW link

AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI

Dan H, Akash and aogara

30 May 2023 11:52 UTC

20 points

0 comments6 min readLW link

(newsletter.safe.ai)

The bullseye framework: My case against AI doom

titotal30 May 2023 11:52 UTC

89 points

35 comments1 min readLW link

Statement on AI Extinction—Signed by AGI Labs, Top Academics, and Many Other Notable Figures

Dan H30 May 2023 9:05 UTC

372 points

77 comments1 min readLW link

(www.safe.ai)

Theoretical Limitations of Autoregressive Models

Gabriel Wu30 May 2023 2:37 UTC

20 points

1 comment10 min readLW link

(gabrieldwu.github.io)

A book review for “Animal Weapons” and cross-applying the lessons to x-risk

Habeeb Abdulfatah30 May 2023 0:58 UTC

−6 points

1 comment1 min readLW link

(www.super-linear.org)

Without a trajectory change, the development of AGI is likely to go badly

Max H29 May 2023 23:42 UTC

16 points

2 comments13 min readLW link

Winners-take-how-much?

YonatanK29 May 2023 21:56 UTC

3 points

2 comments3 min readLW link

Reply to a fertility doctor concerning polygenic embryo screening

GeneSmith29 May 2023 21:50 UTC

58 points

6 comments8 min readLW link

Sentience matters

So8res29 May 2023 21:25 UTC

143 points

96 comments2 min readLW link

Wikipedia as an introduction to the alignment problem

SoerenMind29 May 2023 18:43 UTC

83 points

10 comments1 min readLW link

(en.wikipedia.org)

[Question] What are some of the best introductions/breakdowns of AI existential risk for those unfamiliar?

Isaac King29 May 2023 17:04 UTC

17 points

2 comments1 min readLW link

Creating Flashcards with LLMs

Diogo Cruz29 May 2023 16:55 UTC

14 points

3 comments9 min readLW link

On the Impossibility of Intelligent Paperclip Maximizers

Michael Simkin29 May 2023 16:55 UTC

−21 points

5 comments4 min readLW link

Minimum Viable Exterminator

Richard Horvath29 May 2023 16:32 UTC

14 points

5 comments5 min readLW link

An LLM-based “exemplary actor”

Roman Leventov29 May 2023 11:12 UTC

16 points

0 comments12 min readLW link

Aligning an H-JEPA agent via training on the outputs of an LLM-based “exemplary actor”

Roman Leventov29 May 2023 11:08 UTC

12 points

10 comments30 min readLW link

Gemini will bring the next big timeline update

p.b.29 May 2023 6:05 UTC

50 points

6 comments1 min readLW link

Proposed Alignment Technique: OSNR (Output Sanitization via Noising and Reconstruction) for Safer Usage of Potentially Misaligned AGI

sudo29 May 2023 1:35 UTC

14 points

9 comments6 min readLW link

Morality is Accidental & Self-Congratulatory

ymeskhout29 May 2023 0:40 UTC

25 points

40 comments5 min readLW link

TinyStories: Small Language Models That Still Speak Coherent English

Ulisse Mini28 May 2023 22:23 UTC

66 points

8 comments2 min readLW link

(arxiv.org)

“Membranes” is better terminology than “boundaries” alone

Chipmonk and the gears to ascension

28 May 2023 22:16 UTC

30 points

12 comments3 min readLW link

The king token

p.b.28 May 2023 19:18 UTC

17 points

0 comments4 min readLW link

Language Agents Reduce the Risk of Existential Catastrophe

cdkg and Simon Goldstein

28 May 2023 19:10 UTC

39 points

14 comments26 min readLW link

Devil’s Advocate: Adverse Selection Against Conscientiousness

lionhearted (Sebastian Marshall)28 May 2023 17:53 UTC

10 points

2 comments1 min readLW link

Reacts now enabled on 100% of posts, though still just experimenting

Ruby28 May 2023 5:36 UTC

88 points

73 comments2 min readLW link

My AI Alignment Research Agenda and Threat Model, right now (May 2023)

Nicholas / Heather Kross28 May 2023 3:23 UTC

25 points

0 comments6 min readLW link

(www.thinkingmuchbetter.com)

Kelly betting vs expectation maximization

MorgneticField28 May 2023 1:54 UTC

35 points

33 comments5 min readLW link

Why and When Interpretability Work is Dangerous

Nicholas / Heather Kross28 May 2023 0:27 UTC

20 points

9 comments8 min readLW link

(www.thinkingmuchbetter.com)

Twin Cities ACX Meetup—June 2023

Timothy M.27 May 2023 20:11 UTC

1 point

1 comment1 min readLW link

Project Idea: Challenge Groups for Alignment Researchers

Adam Zerner27 May 2023 20:10 UTC

13 points

0 comments1 min readLW link

Introspective Bayes

False Name27 May 2023 19:35 UTC

−3 points

2 comments16 min readLW link

Should Rational Animations invite viewers to read content on LessWrong?

Writer27 May 2023 19:26 UTC

40 points

9 comments3 min readLW link

Who are the Experts on Cryonics?

Mati_Roy27 May 2023 19:24 UTC

30 points

9 comments1 min readLW link

(biostasis.substack.com)

AI and Planet Earth are incompatible.

archeon27 May 2023 18:59 UTC

−4 points

2 comments1 min readLW link

South Bay ACX/LW Meetup

IS27 May 2023 17:25 UTC

2 points

0 comments1 min readLW link

Hands-On Experience Is Not Magic

Thane Ruthenis27 May 2023 16:57 UTC

21 points

14 comments5 min readLW link

Is Deontological AI Safe? [Feedback Draft]

Dan H and William D'Alessandro

27 May 2023 16:39 UTC

19 points

15 comments20 min readLW link

San Francisco ACX Meetup “First Saturday” June 3, 1 pm

guenael27 May 2023 13:58 UTC

1 point

0 comments1 min readLW link