All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 3031

And the word was “God”

pchvykovAug 30, 2022, 9:13 PM

−22 points

4 comments3 min readLW link

Worlds Where Iterative Design Fails

johnswentworthAug 30, 2022, 8:48 PM

208 points

30 comments10 min readLW link 1 review

Inner Alignment via Superpowers

JamesH, Thomas Larsen and Jeremy Gillen

Aug 30, 2022, 8:01 PM

37 points

13 comments4 min readLW link

ML Model Attribution Challenge [Linkpost]

aogAug 30, 2022, 7:34 PM

11 points

0 comments1 min readLW link

(mlmac.io)

How likely is deceptive alignment?

evhubAug 30, 2022, 7:34 PM

104 points

28 comments60 min readLW link

Built-In Bundling For Faster Loading

jefftkAug 30, 2022, 7:20 PM

15 points

0 comments2 min readLW link

(www.jefftk.com)

[Question] A bayesian updating on expert opinions

amaraiAug 30, 2022, 11:56 AM

1 point

1 comment1 min readLW link

Any Utilitarianism Makes Sense As Policy

George3d6Aug 30, 2022, 9:55 AM

6 points

6 comments7 min readLW link

(www.epistem.ink)

A gentle primer on caring, including in strange senses, with applications

KaarelAug 30, 2022, 8:05 AM

10 points

4 comments18 min readLW link

Modified Guess Culture

konstellAug 30, 2022, 2:30 AM

5 points

5 comments1 min readLW link

(konstell.com)

[Question] What is the best critique of AI existential risk arguments?

joshcAug 30, 2022, 2:18 AM

6 points

11 comments1 min readLW link

How to plan for a radically uncertain future?

KerryAug 30, 2022, 2:14 AM

57 points

35 comments1 min readLW link

EA & LW Forums Weekly Summary (21 Aug − 27 Aug 22′)

Zoe WilliamsAug 30, 2022, 1:42 AM

57 points

4 comments12 min readLW link

Can We Align a Self-Improving AGI?

Peter S. ParkAug 30, 2022, 12:14 AM

8 points

5 comments11 min readLW link

On the nature of help—a framework for helping

FaustifyAug 29, 2022, 8:42 PM

3 points

2 comments13 min readLW link

Fundamental Uncertainty: Chapter 4 - Why don’t we do what we think we should?

Gordon Seidoh WorleyAug 29, 2022, 7:25 PM

15 points

6 comments13 min readLW link

[Question] How can I reconcile the two most likely requirements for humanities near-term survival.

Erlja Jkdf.Aug 29, 2022, 6:46 PM

1 point

6 comments1 min readLW link

New Canada AI Safety & Governance community

Wyatt Tessari L'AlliéAug 29, 2022, 6:45 PM

21 points

0 comments1 min readLW link

Are Generative World Models a Mesa-Optimization Risk?

Thane RuthenisAug 29, 2022, 6:37 PM

14 points

2 comments3 min readLW link

Sequencing Intro

jefftkAug 29, 2022, 5:50 PM

39 points

3 comments5 min readLW link

(www.jefftk.com)

How Do AI Timelines Affect Existential Risk?

Stephen McAleeseAug 29, 2022, 4:57 PM

7 points

9 comments23 min readLW link

How might we align transformative AI if it’s developed very soon?

HoldenKarnofskyAug 29, 2022, 3:42 PM

140 points

55 comments45 min readLW link 1 review

An Audio Introduction to Nick Bostrom

PeterHAug 29, 2022, 8:50 AM

12 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Please Do Fight the Hypothetical

Lone PineAug 29, 2022, 8:35 AM

18 points

6 comments3 min readLW link

Have you considered getting rid of death?

WillaAug 29, 2022, 1:31 AM

20 points

19 comments1 min readLW link

(immortalityisgreat.substack.com)

(My understanding of) What Everyone in Technical Alignment is Doing and Why

Thomas Larsen and elifland

Aug 29, 2022, 1:23 AM

413 points

90 comments37 min readLW link 1 review

Breaking down the training/deployment dichotomy

Erik JennerAug 28, 2022, 9:45 PM

30 points

3 comments3 min readLW link

More Clothes Over Time?

jefftkAug 28, 2022, 8:30 PM

30 points

1 comment1 min readLW link

(www.jefftk.com)

The Expanding Moral Cinematic Universe

RaemonAug 28, 2022, 6:42 PM

67 points

9 comments14 min readLW link

An Introduction to Current Theories of Consciousness

hohenheimAug 28, 2022, 5:55 PM

60 points

43 comments49 min readLW link

[Linkpost] Can lab-grown brains become conscious?

Jack RAug 28, 2022, 5:45 PM

14 points

3 comments1 min readLW link

Robert Long On Why Artificial Sentience Might Matter

Michaël TrazziAug 28, 2022, 5:30 PM

29 points

5 comments5 min readLW link

(theinsideview.ai)

Artificial Moral Advisors: A New Perspective from Moral Psychology

David GrossAug 28, 2022, 4:37 PM

25 points

1 comment1 min readLW link

(dl.acm.org)

Pronunciations

Solenoid_EntityAug 28, 2022, 11:43 AM

15 points

7 comments2 min readLW link

First thing AI will do when it takes over is get fission going

visiaxAug 28, 2022, 5:56 AM

−2 points

0 comments1 min readLW link

Who ordered alignment’s apple?

Eleni AngelouAug 28, 2022, 4:05 AM

6 points

3 comments3 min readLW link

Sufficiently many Godzillas as an alignment strategy

142857Aug 28, 2022, 12:08 AM

8 points

3 comments1 min readLW link

[Question] What would you expect a massive multimodal online federated learner to be capable of?

Aryeh EnglanderAug 27, 2022, 5:31 PM

13 points

4 comments1 min readLW link

Basin broadness depends on the size and number of orthogonal features

CallumMcDougall, Avery and Lucius Bushnaq

Aug 27, 2022, 5:29 PM

36 points

21 comments6 min readLW link

Informal semantics and Orders

Q HomeAug 27, 2022, 4:17 AM

14 points

10 comments26 min readLW link

Help Understanding Preferences And Evil

NetcentricaAug 27, 2022, 3:42 AM

6 points

7 comments2 min readLW link

Contra Dance Contact Tracing

jefftkAug 27, 2022, 1:50 AM

9 points

0 comments1 min readLW link

(www.jefftk.com)

Annual AGI Benchmarking Event

Lawrence PhillipsAug 27, 2022, 12:06 AM

24 points

3 comments2 min readLW link

(www.metaculus.com)

Is there a benefit in low capability AI Alignment research?

LettiAug 26, 2022, 11:51 PM

1 point

1 comment2 min readLW link

AI Risk in Terms of Unstable Nuclear Software

Thane RuthenisAug 26, 2022, 6:49 PM

30 points

1 comment6 min readLW link

Taking the parameters which seem to matter and rotating them until they don’t

Garrett BakerAug 26, 2022, 6:26 PM

120 points

48 comments1 min readLW link

ACX Meetups Everywhere List

Scott AlexanderAug 26, 2022, 6:12 PM

63 points

1 comment41 min readLW link

What’s the Most Impressive Thing That GPT-4 Could Plausibly Do?

bayesedAug 26, 2022, 3:34 PM

24 points

22 comments1 min readLW link

[Question] Is population collapse due to low birth rates a problem?

mukashiAug 26, 2022, 3:28 PM

6 points

36 comments1 min readLW link

[Question] Could you please share a tool to help with reasoning or make better decisions?

hodovaniAug 26, 2022, 10:36 AM

1 point

0 comments1 min readLW link