All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Three camps in AI x-risk discussions: My personal very oversimplified overview

Aryeh Englander4 Jul 2023 20:42 UTC

21 points

0 comments1 min readLW link

Six (and a half) intuitions for SVD

CallumMcDougall4 Jul 2023 19:23 UTC

70 points

1 comment1 min readLW link

Animal Weapons: Lessons for Humans in the Age of X-Risk

Damin Curtis4 Jul 2023 18:14 UTC

3 points

0 comments10 min readLW link

Apocalypse Prepping—Concise SHTF guide to prepare for AGI doomsday

prepper4 Jul 2023 17:41 UTC

−7 points

9 comments1 min readLW link

(prepper.i2phides.me)

Ways I Expect AI Regulation To Increase Extinction Risk

1a3orn4 Jul 2023 17:32 UTC

227 points

32 comments7 min readLW link

AI labs’ statements on governance

Zach Stein-Perlman4 Jul 2023 16:30 UTC

30 points

0 comments36 min readLW link

AIs teams will probably be more superintelligent than individual AIs

Robert_AIZI4 Jul 2023 14:06 UTC

3 points

1 comment2 min readLW link

(aizi.substack.com)

What I Think About When I Think About History

Jacob G-W4 Jul 2023 14:02 UTC

2 points

4 comments3 min readLW link

(g-w1.github.io)

My Time As A Goddess

Evenstar4 Jul 2023 13:14 UTC

30 points

5 comments6 min readLW link

Twitter Twitches

Zvi4 Jul 2023 13:00 UTC

34 points

9 comments7 min readLW link

(thezvi.wordpress.com)

Rational Unilateralists Aren’t So Cursed

SCP4 Jul 2023 12:19 UTC

47 points

5 comments6 min readLW link

[Question] The literature on aluminum adjuvants is very suspicious. Small IQ tax is plausible—can any experts help me estimate it?

mikes4 Jul 2023 9:33 UTC

58 points

39 comments3 min readLW link

Two Percolation Puzzles

Adam Scherlis4 Jul 2023 5:34 UTC

43 points

14 comments1 min readLW link

(adam.scherlis.com)

Mechanistic Interpretability is Being Pursued for the Wrong Reasons

Cole Wyeth4 Jul 2023 2:17 UTC

8 points

0 comments7 min readLW link

(colewyeth.com)

Should you announce your bets publicly?

Ege Erdil4 Jul 2023 0:11 UTC

15 points

1 comment4 min readLW link

Ten Levels of AI Alignment Difficulty

Sammy Martin3 Jul 2023 20:20 UTC

121 points

14 comments12 min readLW link

Security, Cryptograhy AI Workshop in SF

Allison Duettmann3 Jul 2023 19:01 UTC

7 points

0 comments1 min readLW link

[Question] What in your opinion is the biggest open problem in AI alignment?

tailcalled3 Jul 2023 16:34 UTC

39 points

35 comments1 min readLW link

A Subtle Selection Effect in Overconfidence Studies

Kevin Dorst3 Jul 2023 14:43 UTC

24 points

0 comments6 min readLW link

(kevindorst.substack.com)

Monthly Roundup #8: July 2023

Zvi3 Jul 2023 13:20 UTC

40 points

4 comments46 min readLW link

(thezvi.wordpress.com)

Complex Signs Bad

Evenstar3 Jul 2023 13:09 UTC

5 points

2 comments3 min readLW link

6/23

Celer3 Jul 2023 6:30 UTC

8 points

0 comments10 min readLW link

(keller.substack.com)

Marginal charity

Pat Myron3 Jul 2023 2:13 UTC

3 points

1 comment1 min readLW link

My Central Alignment Priority (2 July 2023)

Nicholas / Heather Kross3 Jul 2023 1:46 UTC

12 points

1 comment3 min readLW link

My Alignment Timeline

Nicholas / Heather Kross3 Jul 2023 1:04 UTC

22 points

0 comments2 min readLW link

Douglas Hofstadter changes his mind on Deep Learning & AI risk (June 2023)?

gwern3 Jul 2023 0:48 UTC

425 points

54 comments7 min readLW link

(www.youtube.com)

Frames in context

Richard_Ngo3 Jul 2023 0:38 UTC

39 points

9 comments6 min readLW link

Meta-rationality and frames

Richard_Ngo3 Jul 2023 0:33 UTC

64 points

2 comments5 min readLW link

VC Theory Overview

Joar Skalse2 Jul 2023 22:45 UTC

11 points

2 comments11 min readLW link

Sources of evidence in Alignment

Martín Soto2 Jul 2023 20:38 UTC

20 points

0 comments11 min readLW link

Quantitative cruxes in Alignment

Martín Soto2 Jul 2023 20:38 UTC

19 points

0 comments23 min readLW link

Going Crazy and Getting Better Again

Evenstar2 Jul 2023 18:55 UTC

139 points

13 comments7 min readLW link 1 review

Shall We Throw A Huge Party Before AGI Bids Us Adieu?

GeorgeMan2 Jul 2023 17:56 UTC

−1 points

6 comments1 min readLW link

Why it’s so hard to talk about Consciousness

Rafael Harth2 Jul 2023 15:56 UTC

131 points

159 comments9 min readLW link 1 review

How Smart Are Humans?

Joar Skalse2 Jul 2023 15:46 UTC

9 points

19 comments2 min readLW link

Through a panel, darkly: a case study in internet BS detection

jchan2 Jul 2023 13:40 UTC

22 points

7 comments3 min readLW link

LLMs, Batches, and Emergent Episodic Memory

Lao Mein2 Jul 2023 7:55 UTC

5 points

4 comments1 min readLW link

Negativity enhances positivity

Adam Zerner2 Jul 2023 2:47 UTC

12 points

7 comments2 min readLW link

faster latent diffusion

bhauth2 Jul 2023 1:30 UTC

10 points

8 comments2 min readLW link

(www.bhauth.com)

Using (Uninterpretable) LLMs to Generate Interpretable AI Code

Joar Skalse2 Jul 2023 1:01 UTC

13 points

12 comments3 min readLW link

Grant applications and grand narratives

Elizabeth2 Jul 2023 0:16 UTC

191 points

22 comments6 min readLW link

An Introduction, an Overview of my personal resources, and how one might make use of them

ProofBySonnet1 Jul 2023 21:00 UTC

4 points

6 comments3 min readLW link

My “2.9 trauma limit”

Raemon1 Jul 2023 19:32 UTC

193 points

31 comments7 min readLW link

Alpha

Erich_Grunewald1 Jul 2023 16:05 UTC

65 points

2 comments14 min readLW link

(www.erichgrunewald.com)

Forum Karma: view stats and find highly-rated comments for any LW user

Max H1 Jul 2023 15:36 UTC

60 points

16 comments2 min readLW link

(forumkarma.com)

[ASoT] GPT2 Steering & The Tuned Lens

Ulisse Mini1 Jul 2023 14:12 UTC

23 points

0 comments2 min readLW link

[Linkpost] A shared linguistic space for transmitting our thoughts from brain to brain in natural conversations

Bogdan Ionut Cirstea1 Jul 2023 13:57 UTC

17 points

2 comments1 min readLW link

Elements of Computational Philosophy, Vol. I: Truth

Paul Bricman and Tom Feeney

1 Jul 2023 11:44 UTC

12 points

6 comments1 min readLW link

(compphil.github.io)

Micro Habits that Improve One’s Day

silentbob1 Jul 2023 10:53 UTC

62 points

9 comments5 min readLW link

Ateliers: But what is an Atelier?

Stephen Fowler1 Jul 2023 5:57 UTC

4 points

2 comments10 min readLW link