Utility is not the se­lec­tion target

tailcalled4 Nov 2023 22:48 UTC
24 points
1 comment1 min readLW link

Stuxnet, not Skynet: Hu­man­ity’s dis­em­pow­er­ment by AI

Roko4 Nov 2023 22:23 UTC
107 points
24 comments6 min readLW link

The 6D effect: When com­pa­nies take risks, one email can be very pow­er­ful.

scasper4 Nov 2023 20:08 UTC
275 points
42 comments3 min readLW link

Ge­netic fit­ness is a mea­sure of se­lec­tion strength, not the se­lec­tion target

Kaj_Sotala4 Nov 2023 19:02 UTC
56 points
43 comments18 min readLW link

The Soul Key

Richard_Ngo4 Nov 2023 17:51 UTC
97 points
9 comments8 min readLW link
(www.narrativeark.xyz)

[Linkpost] Con­cept Align­ment as a Pr­ereq­ui­site for Value Alignment

Bogdan Ionut Cirstea4 Nov 2023 17:34 UTC
27 points
0 comments1 min readLW link
(arxiv.org)

We are already in a per­sua­sion-trans­formed world and must take precautions

trevor4 Nov 2023 15:53 UTC
36 points
14 comments6 min readLW link

Be­ing good at the basics

dominicq4 Nov 2023 14:18 UTC
32 points
1 comment3 min readLW link

If a lit­tle is good, is more bet­ter?

DanielFilan4 Nov 2023 7:10 UTC
25 points
16 comments2 min readLW link
(danielfilan.com)

Un­trusted smart mod­els and trusted dumb models

Buck4 Nov 2023 3:06 UTC
87 points
17 comments6 min readLW link1 review

As Many Ideas

Screwtape3 Nov 2023 22:47 UTC
11 points
0 comments4 min readLW link

Paul Chris­ti­ano on Dwarkesh Podcast

ESRogs3 Nov 2023 22:13 UTC
19 points
0 comments1 min readLW link
(www.dwarkeshpatel.com)

De­cep­tion Chess: Game #1

3 Nov 2023 21:13 UTC
104 points
21 comments8 min readLW link1 review

8 ex­am­ples in­form­ing my pes­simism on up­load­ing with­out re­verse engineering

Steven Byrnes3 Nov 2023 20:03 UTC
117 points
12 comments12 min readLW link

In­tegrity in AI Gover­nance and Advocacy

3 Nov 2023 19:52 UTC
134 points
57 comments23 min readLW link

Aver­ag­ing sam­ples from a pop­u­la­tion with log-nor­mal distribution

CrimsonChin3 Nov 2023 19:42 UTC
8 points
2 comments1 min readLW link

Se­cur­ing Civ­i­liza­tion Against Catas­trophic Pandemics

jefftk3 Nov 2023 19:33 UTC
13 points
0 comments1 min readLW link
(dam.gcsp.ch)

The Unavoid­able Ex­pe­rience of Free Will in a Deter­minis­tic World

gmax3 Nov 2023 17:55 UTC
−10 points
0 comments2 min readLW link

Thoughts on open source AI

Sam Marks3 Nov 2023 15:35 UTC
62 points
17 comments10 min readLW link

Knowl­edge Base 6: Con­sen­sus the­ory of truth

iwis3 Nov 2023 13:56 UTC
−8 points
0 comments1 min readLW link

[Question] Shouldn’t we ‘Just’ Su­per­im­i­tate Low-Res Uploads?

lukemarks3 Nov 2023 7:42 UTC
15 points
2 comments2 min readLW link

The other side of the tidal wave

KatjaGrace3 Nov 2023 5:40 UTC
187 points
86 comments1 min readLW link
(worldspiritsockpuppet.com)

Does davi­dad’s up­load­ing moon­shot work?

3 Nov 2023 2:21 UTC
146 points
35 comments25 min readLW link

Twin Cities ACX Meetup—Novem­ber 2023

Timothy M.3 Nov 2023 0:47 UTC
1 point
1 comment1 min readLW link

San Fran­cisco ACX Meetup “First Satur­day”

guenael3 Nov 2023 0:10 UTC
4 points
0 comments1 min readLW link

[Question] What are your fa­vorite posts, pod­cast epi­sodes, and recorded talks, on AI timelines, or fac­tors that would in­fluence AI timelines?

nonzerosum2 Nov 2023 22:42 UTC
2 points
0 comments1 min readLW link

One Day Sooner

Screwtape2 Nov 2023 19:00 UTC
106 points
7 comments8 min readLW link

Pro­pa­ganda or Science: A Look at Open Source AI and Bioter­ror­ism Risk

1a3orn2 Nov 2023 18:20 UTC
193 points
79 comments23 min readLW link

AI #36: In the Background

Zvi2 Nov 2023 18:00 UTC
45 points
5 comments37 min readLW link
(thezvi.wordpress.com)

Doubt Certainty

RationalDino2 Nov 2023 17:43 UTC
4 points
13 comments3 min readLW link

Say­ing the quiet part out loud: trad­ing off x-risk for per­sonal immortality

disturbance2 Nov 2023 17:43 UTC
83 points
89 comments5 min readLW link

Mech In­terp Challenge: Novem­ber—De­ci­pher­ing the Cu­mu­la­tive Sum Model

CallumMcDougall2 Nov 2023 17:10 UTC
18 points
2 comments2 min readLW link

Es­ti­mat­ing effec­tive di­men­sion­al­ity of MNIST models

Arjun Panickssery2 Nov 2023 14:13 UTC
41 points
3 comments1 min readLW link

Aver­ages and sam­ple sizes

mruwnik2 Nov 2023 9:52 UTC
15 points
2 comments8 min readLW link

ACX/​LW/​EA crossover meetup

RasmusHB2 Nov 2023 5:57 UTC
2 points
0 comments1 min readLW link

Up­com­ing Feed­back Op­por­tu­nity on Dual-Use Foun­da­tion Models

Chris_Leong2 Nov 2023 4:28 UTC
3 points
0 comments1 min readLW link

Public Weights?

jefftk2 Nov 2023 2:50 UTC
49 points
19 comments3 min readLW link
(www.jefftk.com)

[Question] Should peo­ple build pro­duc­ti­za­tions of open source AI mod­els?

lc2 Nov 2023 1:26 UTC
23 points
0 comments1 min readLW link

Sin­gu­lar learn­ing the­ory and bridg­ing from ML to brain emulations

1 Nov 2023 21:31 UTC
26 points
16 comments29 min readLW link

My thoughts on the so­cial re­sponse to AI risk

Matthew Barnett1 Nov 2023 21:17 UTC
157 points
37 comments10 min readLW link

Re­ac­tions to the Ex­ec­u­tive Order

Zvi1 Nov 2023 20:40 UTC
77 points
4 comments29 min readLW link
(thezvi.wordpress.com)

Dario Amodei’s pre­pared re­marks from the UK AI Safety Sum­mit, on An­thropic’s Re­spon­si­ble Scal­ing Policy

Zac Hatfield-Dodds1 Nov 2023 18:10 UTC
85 points
1 comment4 min readLW link
(www.anthropic.com)

Book Re­view: Deter­mined by Sapolsky

Kailuo Wang1 Nov 2023 17:37 UTC
1 point
0 comments7 min readLW link

AI Align­ment: A Com­pre­hen­sive Survey

Stephen McAleer1 Nov 2023 17:35 UTC
15 points
1 comment1 min readLW link
(arxiv.org)

A list of all the dead­lines in Bi­den’s Ex­ec­u­tive Order on AI

Valentin Baltadzhiev1 Nov 2023 17:14 UTC
26 points
2 comments11 min readLW link

2023 LessWrong Com­mu­nity Cen­sus, Re­quest for Comments

Screwtape1 Nov 2023 16:32 UTC
43 points
37 comments2 min readLW link

[Question] Snap­shot of nar­ra­tives and frames against reg­u­lat­ing AI

Jan_Kulveit1 Nov 2023 16:30 UTC
36 points
19 comments3 min readLW link

Com­men­sal Institutions

Sable1 Nov 2023 16:01 UTC
8 points
12 comments4 min readLW link
(affablyevil.substack.com)

ChatGPT’s On­tolog­i­cal Land­scape

Bill Benzon1 Nov 2023 15:12 UTC
7 points
0 comments4 min readLW link

On the Ex­ec­u­tive Order

Zvi1 Nov 2023 14:20 UTC
100 points
4 comments30 min readLW link
(thezvi.wordpress.com)