All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Utility is not the selection target

tailcalled4 Nov 2023 22:48 UTC

24 points

1 comment1 min readLW link

Stuxnet, not Skynet: Humanity’s disempowerment by AI

Roko4 Nov 2023 22:23 UTC

107 points

24 comments6 min readLW link

The 6D effect: When companies take risks, one email can be very powerful.

scasper4 Nov 2023 20:08 UTC

275 points

42 comments3 min readLW link

Genetic fitness is a measure of selection strength, not the selection target

Kaj_Sotala4 Nov 2023 19:02 UTC

56 points

43 comments18 min readLW link

The Soul Key

Richard_Ngo4 Nov 2023 17:51 UTC

97 points

9 comments8 min readLW link

(www.narrativeark.xyz)

[Linkpost] Concept Alignment as a Prerequisite for Value Alignment

Bogdan Ionut Cirstea4 Nov 2023 17:34 UTC

27 points

0 comments1 min readLW link

(arxiv.org)

We are already in a persuasion-transformed world and must take precautions

trevor4 Nov 2023 15:53 UTC

36 points

14 comments6 min readLW link

Being good at the basics

dominicq4 Nov 2023 14:18 UTC

32 points

1 comment3 min readLW link

If a little is good, is more better?

DanielFilan4 Nov 2023 7:10 UTC

25 points

16 comments2 min readLW link

(danielfilan.com)

Untrusted smart models and trusted dumb models

Buck4 Nov 2023 3:06 UTC

87 points

17 comments6 min readLW link 1 review

As Many Ideas

Screwtape3 Nov 2023 22:47 UTC

11 points

0 comments4 min readLW link

Paul Christiano on Dwarkesh Podcast

ESRogs3 Nov 2023 22:13 UTC

19 points

0 comments1 min readLW link

(www.dwarkeshpatel.com)

Deception Chess: Game #1

Zane, aphyer, Alex A and AdamYedidia

3 Nov 2023 21:13 UTC

104 points

21 comments8 min readLW link 1 review

8 examples informing my pessimism on uploading without reverse engineering

Steven Byrnes3 Nov 2023 20:03 UTC

117 points

12 comments12 min readLW link

Integrity in AI Governance and Advocacy

habryka and OliviaJ

3 Nov 2023 19:52 UTC

134 points

57 comments23 min readLW link

Averaging samples from a population with log-normal distribution

CrimsonChin3 Nov 2023 19:42 UTC

8 points

2 comments1 min readLW link

Securing Civilization Against Catastrophic Pandemics

jefftk3 Nov 2023 19:33 UTC

13 points

0 comments1 min readLW link

(dam.gcsp.ch)

The Unavoidable Experience of Free Will in a Deterministic World

gmax3 Nov 2023 17:55 UTC

−10 points

0 comments2 min readLW link

Thoughts on open source AI

Sam Marks3 Nov 2023 15:35 UTC

62 points

17 comments10 min readLW link

Knowledge Base 6: Consensus theory of truth

iwis3 Nov 2023 13:56 UTC

−8 points

0 comments1 min readLW link

[Question] Shouldn’t we ‘Just’ Superimitate Low-Res Uploads?

lukemarks3 Nov 2023 7:42 UTC

15 points

2 comments2 min readLW link

The other side of the tidal wave

KatjaGrace3 Nov 2023 5:40 UTC

187 points

86 comments1 min readLW link

(worldspiritsockpuppet.com)

Does davidad’s uploading moonshot work?

jacobjacob, lisathiergart, Anders_Sandberg, davidad and Arenamontanus

3 Nov 2023 2:21 UTC

146 points

35 comments25 min readLW link

Twin Cities ACX Meetup—November 2023

Timothy M.3 Nov 2023 0:47 UTC

1 point

1 comment1 min readLW link

San Francisco ACX Meetup “First Saturday”

guenael3 Nov 2023 0:10 UTC

4 points

0 comments1 min readLW link

[Question] What are your favorite posts, podcast episodes, and recorded talks, on AI timelines, or factors that would influence AI timelines?

nonzerosum2 Nov 2023 22:42 UTC

2 points

0 comments1 min readLW link

One Day Sooner

Screwtape2 Nov 2023 19:00 UTC

106 points

7 comments8 min readLW link

Propaganda or Science: A Look at Open Source AI and Bioterrorism Risk

1a3orn2 Nov 2023 18:20 UTC

193 points

79 comments23 min readLW link

AI #36: In the Background

Zvi2 Nov 2023 18:00 UTC

45 points

5 comments37 min readLW link

(thezvi.wordpress.com)

Doubt Certainty

RationalDino2 Nov 2023 17:43 UTC

4 points

13 comments3 min readLW link

Saying the quiet part out loud: trading off x-risk for personal immortality

disturbance2 Nov 2023 17:43 UTC

83 points

89 comments5 min readLW link

Mech Interp Challenge: November—Deciphering the Cumulative Sum Model

CallumMcDougall2 Nov 2023 17:10 UTC

18 points

2 comments2 min readLW link

Estimating effective dimensionality of MNIST models

Arjun Panickssery2 Nov 2023 14:13 UTC

41 points

3 comments1 min readLW link

Averages and sample sizes

mruwnik2 Nov 2023 9:52 UTC

15 points

2 comments8 min readLW link

ACX/LW/EA crossover meetup

RasmusHB2 Nov 2023 5:57 UTC

2 points

0 comments1 min readLW link

Upcoming Feedback Opportunity on Dual-Use Foundation Models

Chris_Leong2 Nov 2023 4:28 UTC

3 points

0 comments1 min readLW link

Public Weights?

jefftk2 Nov 2023 2:50 UTC

49 points

19 comments3 min readLW link

(www.jefftk.com)

[Question] Should people build productizations of open source AI models?

lc2 Nov 2023 1:26 UTC

23 points

0 comments1 min readLW link

Singular learning theory and bridging from ML to brain emulations

kave and Garrett Baker

1 Nov 2023 21:31 UTC

26 points

16 comments29 min readLW link

My thoughts on the social response to AI risk

Matthew Barnett1 Nov 2023 21:17 UTC

157 points

37 comments10 min readLW link

Reactions to the Executive Order

Zvi1 Nov 2023 20:40 UTC

77 points

4 comments29 min readLW link

(thezvi.wordpress.com)

Dario Amodei’s prepared remarks from the UK AI Safety Summit, on Anthropic’s Responsible Scaling Policy

Zac Hatfield-Dodds1 Nov 2023 18:10 UTC

85 points

1 comment4 min readLW link

(www.anthropic.com)

Book Review: Determined by Sapolsky

Kailuo Wang1 Nov 2023 17:37 UTC

1 point

0 comments7 min readLW link

AI Alignment: A Comprehensive Survey

Stephen McAleer1 Nov 2023 17:35 UTC

15 points

1 comment1 min readLW link

(arxiv.org)

A list of all the deadlines in Biden’s Executive Order on AI

Valentin Baltadzhiev1 Nov 2023 17:14 UTC

26 points

2 comments11 min readLW link

2023 LessWrong Community Census, Request for Comments

Screwtape1 Nov 2023 16:32 UTC

43 points

37 comments2 min readLW link

[Question] Snapshot of narratives and frames against regulating AI

Jan_Kulveit1 Nov 2023 16:30 UTC

36 points

19 comments3 min readLW link

Commensal Institutions

Sable1 Nov 2023 16:01 UTC

8 points

12 comments4 min readLW link

(affablyevil.substack.com)

ChatGPT’s Ontological Landscape

Bill Benzon1 Nov 2023 15:12 UTC

7 points

0 comments4 min readLW link

On the Executive Order

Zvi1 Nov 2023 14:20 UTC

100 points

4 comments30 min readLW link

(thezvi.wordpress.com)