All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 272829 30 31

Twin Cities ACX Meetup—June 2023

Timothy M.27 May 2023 20:11 UTC

1 point

1 comment1 min readLW link

Project Idea: Challenge Groups for Alignment Researchers

Adam Zerner27 May 2023 20:10 UTC

13 points

0 comments1 min readLW link

Introspective Bayes

False Name27 May 2023 19:35 UTC

−3 points

2 comments16 min readLW link

Should Rational Animations invite viewers to read content on LessWrong?

Writer27 May 2023 19:26 UTC

40 points

9 comments3 min readLW link

Who are the Experts on Cryonics?

Mati_Roy27 May 2023 19:24 UTC

30 points

9 comments1 min readLW link

(biostasis.substack.com)

AI and Planet Earth are incompatible.

archeon27 May 2023 18:59 UTC

−4 points

2 comments1 min readLW link

South Bay ACX/LW Meetup

IS27 May 2023 17:25 UTC

2 points

0 comments1 min readLW link

Hands-On Experience Is Not Magic

Thane Ruthenis27 May 2023 16:57 UTC

21 points

14 comments5 min readLW link

Is Deontological AI Safe? [Feedback Draft]

Dan H and William D'Alessandro

27 May 2023 16:39 UTC

19 points

15 comments20 min readLW link

San Francisco ACX Meetup “First Saturday” June 3, 1 pm

guenael27 May 2023 13:58 UTC

1 point

0 comments1 min readLW link

Papers on protein design

alexlyzhov27 May 2023 1:18 UTC

9 points

0 comments3 min readLW link

D&D.Sci 5E: Return of the League of Defenders

aphyer26 May 2023 20:39 UTC

42 points

11 comments3 min readLW link

Seeking (Paid) Case Studies on Standards

HoldenKarnofsky26 May 2023 17:58 UTC

69 points

9 comments11 min readLW link

Conditional Prediction with Zero-Sum Training Solves Self-Fulfilling Prophecies

Rubi J. Hudson and Johannes Treutlein

26 May 2023 17:44 UTC

88 points

13 comments24 min readLW link

Request: stop advancing AI capabilities

So8res26 May 2023 17:42 UTC

153 points

24 comments1 min readLW link

Bandgaps, Brains, and Bioweapons: The limitations of computational science and what it means for AGI

titotal26 May 2023 15:57 UTC

36 points

20 comments1 min readLW link

The American Information Revolution in Global Perspective

jasoncrawford26 May 2023 12:39 UTC

16 points

1 comment5 min readLW link

(rootsofprogress.org)

Helio-Selenic Laser Telescope (in SPACE!?)

Alexander Gietelink Oldenziel26 May 2023 11:24 UTC

8 points

2 comments4 min readLW link

[Question] Why is violence against AI labs a taboo?

ArisC26 May 2023 8:00 UTC

−21 points

63 comments1 min readLW link

Where do you lie on two axes of world manipulability?

Max H26 May 2023 3:04 UTC

30 points

15 comments3 min readLW link

Some thoughts on automating alignment research

Lukas Finnveden26 May 2023 1:50 UTC

30 points

4 comments6 min readLW link

[Question] What’s your viewpoint on the likelihood of GPT-5 being able to autonomously create, train, and implement an AI superior to GPT-5?

Super AGI26 May 2023 1:43 UTC

7 points

15 comments1 min readLW link

Before smart AI, there will be many mediocre or specialized AIs

Lukas Finnveden26 May 2023 1:38 UTC

57 points

10 comments9 min readLW link 1 review

how humans are aligned

bhauth26 May 2023 0:09 UTC

14 points

3 comments1 min readLW link

[Question] What vegan food resources have you found useful?

Elizabeth25 May 2023 22:46 UTC

29 points

6 comments1 min readLW link

Mob and Bailey

Screwtape25 May 2023 22:14 UTC

78 points

16 comments7 min readLW link

Look At What’s In Front Of You (Conclusion to The Nuts and Bolts of Naturalism)

LoganStrohl25 May 2023 19:00 UTC

50 points

1 comment2 min readLW link

[Market] Will AI xrisk seem to be handled seriously by the end of 2026?

tailcalled25 May 2023 18:51 UTC

15 points

2 comments1 min readLW link

(manifold.markets)

[Question] What should my college major be if I want to do AI alignment research?

metachirality25 May 2023 18:23 UTC

8 points

7 comments1 min readLW link

Is behavioral safety “solved” in non-adversarial conditions?

Robert_AIZI25 May 2023 17:56 UTC

26 points

8 comments2 min readLW link

(aizi.substack.com)

Book Review: How Minds Change

bc4026bd4aaa5b7fe25 May 2023 17:55 UTC

310 points

52 comments15 min readLW link

Self-administered EMDR without a therapist is very useful for a lot of things!

EternallyBlissful25 May 2023 17:54 UTC

49 points

12 comments11 min readLW link

RecurrentGPT: a loom-type tool with a twist

mishka25 May 2023 17:09 UTC

10 points

0 comments3 min readLW link

(arxiv.org)

The Genie in the Bottle: An Introduction to AI Alignment and Risk

Snorkelfarsan25 May 2023 16:30 UTC

5 points

1 comment25 min readLW link

AI #13: Potential Algorithmic Improvements

Zvi25 May 2023 15:40 UTC

45 points

4 comments67 min readLW link

(thezvi.wordpress.com)

Solving the Mechanistic Interpretability challenges: EIS VII Challenge 2

StefanHex and Marius Hobbhahn

25 May 2023 15:37 UTC

71 points

1 comment13 min readLW link

Malthusian Competition (not as bad as it seems)

Logan Zoellner25 May 2023 15:30 UTC

6 points

11 comments2 min readLW link

You Don’t Always Need Indexes

jefftk25 May 2023 14:20 UTC

22 points

6 comments1 min readLW link

(www.jefftk.com)

Theories of Biological Inspiration

Eric Zhang25 May 2023 13:07 UTC

7 points

3 comments1 min readLW link

Evaluating strategic reasoning in GPT models

phelps-sg25 May 2023 11:51 UTC

4 points

1 comment8 min readLW link

Requirements for a STEM-capable AGI Value Learner (my Case for Less Doom)

RogerDearnaley25 May 2023 9:26 UTC

33 points

4 comments15 min readLW link

Alignment solutions for weak AI don’t (necessarily) scale to strong AI

Michael Tontchev25 May 2023 8:26 UTC

6 points

0 comments5 min readLW link

[Question] What features would you like to see in a personal forcasting / prediction tracking app?

regnarg25 May 2023 8:18 UTC

9 points

0 comments1 min readLW link

Announcing the Confido app: bringing forecasting to everyone

regnarg25 May 2023 8:18 UTC

6 points

2 comments10 min readLW link

(forum.effectivealtruism.org)

But What If We Actually Want To Maximize Paperclips?

snerx25 May 2023 7:13 UTC

−17 points

6 comments7 min readLW link

Exploiting Newcomb’s Game Show

carterallen25 May 2023 4:01 UTC

8 points

2 comments2 min readLW link

DeepMind: Model evaluation for extreme risks

Zach Stein-Perlman25 May 2023 3:00 UTC

94 points

12 comments1 min readLW link 1 review

(arxiv.org)

Why I’m Not (Yet) A Full-Time Technical Alignment Researcher

Nicholas / Heather Kross25 May 2023 1:26 UTC

39 points

21 comments4 min readLW link

(www.thinkingmuchbetter.com)

Two ideas for alignment, perpetual mutual distrust and induction

APaleBlueDot25 May 2023 0:56 UTC

1 point

2 comments4 min readLW link

Evaluating Evidence Reconstructions of Mock Crimes -Submission 2

Alan E Dunne24 May 2023 22:17 UTC

−1 points

1 comment3 min readLW link