The con­ver­gent dy­namic we missed

Remmelt12 Dec 2023 23:19 UTC
2 points
2 comments1 min readLW link

A Kind­ness, or The Inevitable Con­se­quence of Perfect In­fer­ence (a short story)

samhealy12 Dec 2023 23:03 UTC
6 points
0 comments9 min readLW link

Love, Rev­er­ence, and Life

12 Dec 2023 21:49 UTC
33 points
7 comments28 min readLW link

Ta­boo “pro­cras­ti­na­tion”

Neil 12 Dec 2023 21:33 UTC
19 points
7 comments1 min readLW link

En­hanc­ing in­tel­li­gence by bang­ing your head on the wall

Bezzi12 Dec 2023 21:00 UTC
37 points
26 comments1 min readLW link

Yamaha P-Series Overview

jefftk12 Dec 2023 20:30 UTC
10 points
1 comment1 min readLW link
(www.jefftk.com)

Balsa Up­date and Gen­eral Thank You

Zvi12 Dec 2023 20:30 UTC
61 points
8 comments8 min readLW link
(thezvi.wordpress.com)

Towards an Ethics Calcu­la­tor for Use by an AGI

sweenesm12 Dec 2023 18:37 UTC
3 points
2 comments11 min readLW link

Why Psy­chol­o­gists Are Wrong About The Illu­sion Of Ex­plana­tory Depth

moses onyedikachukwu12 Dec 2023 18:32 UTC
1 point
0 comments4 min readLW link

A de­sign con­cept for su­per­in­tel­li­gent ma­chines (and Pop­per’s cri­tique of in­duc­tion)

tiplur-bilrex12 Dec 2023 18:31 UTC
−7 points
6 comments1 min readLW link
(tiplur-bilrex.tlon.network)

Sig­nifi­cantly En­hanc­ing Adult In­tel­li­gence With Gene Edit­ing May Be Possible

12 Dec 2023 18:14 UTC
451 points
168 comments33 min readLW link

Some bi­ases and se­lec­tion effects in AI risk discourse

Tamsin Leake12 Dec 2023 17:55 UTC
22 points
21 comments4 min readLW link
(carado.moe)

[Question] Why No Au­to­mated Plagerism De­tec­tion For Past Papers?

Lao Mein12 Dec 2023 17:24 UTC
7 points
10 comments1 min readLW link

OpenAI: Leaks Con­firm the Story

Zvi12 Dec 2023 14:00 UTC
77 points
9 comments16 min readLW link
(thezvi.wordpress.com)

Nav­i­gat­ing the Attackspace

Jonas Kgomo12 Dec 2023 13:59 UTC
1 point
0 comments2 min readLW link

Non­lin­ear’s Ev­i­dence: De­bunk­ing False and Mislead­ing Claims

KatWoods12 Dec 2023 13:16 UTC
104 points
171 comments1 min readLW link

AI In­sti­tu­tion De­sign Hackathon (EAG Bay Area Satel­lite Event)

12 Dec 2023 13:10 UTC
1 point
0 comments1 min readLW link

Fund­ing case: AI Safety Camp

12 Dec 2023 9:08 UTC
66 points
5 comments6 min readLW link
(manifund.org)

What is the next level of ra­tio­nal­ity?

12 Dec 2023 8:14 UTC
48 points
24 comments7 min readLW link

Embed­ded Agents are Quines

12 Dec 2023 4:57 UTC
11 points
7 comments8 min readLW link

Pre­dict the fu­ture! Earn fake in­ter­net points! Get a (free) gam­bling ad­dic­tion!

Robert Cousineau12 Dec 2023 4:39 UTC
3 points
0 comments1 min readLW link

The likely first longevity drug is based on sketchy sci­ence. This is bad for sci­ence and bad for longevity.

BobBurgers12 Dec 2023 2:42 UTC
161 points
34 comments5 min readLW link

When will GPT-5 come out? Pre­dic­tion mar­kets vs. Extrapolation

Malte12 Dec 2023 2:41 UTC
12 points
9 comments3 min readLW link

On plans for a func­tional society

12 Dec 2023 0:07 UTC
41 points
8 comments13 min readLW link

Se­condary Risk Markets

Vaniver11 Dec 2023 21:52 UTC
35 points
4 comments4 min readLW link

Has any­one ex­per­i­mented with Do­drio, a tool for ex­plor­ing trans­former mod­els through in­ter­ac­tive vi­su­al­iza­tion?

Bill Benzon11 Dec 2023 20:34 UTC
4 points
0 comments1 min readLW link

[Valence se­ries] 3. Valence & Beliefs

Steven Byrnes11 Dec 2023 20:21 UTC
75 points
11 comments21 min readLW link

[Question] Am I eth­i­cally obli­gated to ex­tend the life of my dog with life-ex­ten­sion treat­ments about to hit the mar­ket?

TrudosKudos11 Dec 2023 19:41 UTC
−3 points
2 comments1 min readLW link

Ad­ver­sar­ial Ro­bust­ness Could Help Prevent Catas­trophic Misuse

aogara11 Dec 2023 19:12 UTC
30 points
18 comments9 min readLW link

The Con­scious­ness Box

GradualImprovement11 Dec 2023 16:45 UTC
33 points
22 comments4 min readLW link

Em­piri­cal work that might shed light on schem­ing (Sec­tion 6 of “Schem­ing AIs”)

Joe Carlsmith11 Dec 2023 16:30 UTC
8 points
0 comments21 min readLW link

Into AI Safety: Epi­sode 3

jacobhaimes11 Dec 2023 16:30 UTC
6 points
0 comments1 min readLW link
(into-ai-safety.github.io)

Im­plic­itly Typed C

jefftk11 Dec 2023 16:10 UTC
16 points
0 comments1 min readLW link
(www.jefftk.com)

37C3 Hacker x Ra­tion­al­ist Meetup

11 Dec 2023 16:02 UTC
5 points
5 comments1 min readLW link

re: Yud­kowsky on biolog­i­cal materials

bhauth11 Dec 2023 13:28 UTC
179 points
30 comments5 min readLW link

Ideoculture

elv11 Dec 2023 10:29 UTC
8 points
2 comments6 min readLW link

Quick thoughts on the im­pli­ca­tions of multi-agent views of mind on AI takeover

Kaj_Sotala11 Dec 2023 6:34 UTC
45 points
14 comments4 min readLW link

Au­dit­ing failures vs con­cen­trated failures

11 Dec 2023 2:47 UTC
44 points
0 comments7 min readLW link

Deeply Cover Car Crashes?

jefftk10 Dec 2023 22:20 UTC
36 points
31 comments1 min readLW link
(www.jefftk.com)

Prin­ci­ples For Product Li­a­bil­ity (With Ap­pli­ca­tion To AI)

johnswentworth10 Dec 2023 21:27 UTC
37 points
55 comments10 min readLW link

[Question] What do you do to re­mem­ber and refer­ence the LessWrong posts that were most per­son­ally sig­nifi­cant to you, in terms of in­tel­lec­tual de­vel­op­ment or gen­eral use­ful­ness?

lillybaeum10 Dec 2023 17:52 UTC
5 points
7 comments1 min readLW link

[Question] Do web­sites and apps ac­tu­ally gen­er­ally get worse af­ter up­dates, or is it just an effect of the fear of change?

lillybaeum10 Dec 2023 17:26 UTC
33 points
34 comments2 min readLW link

How LDT helps re­duce the AI arms race

Tamsin Leake10 Dec 2023 16:21 UTC
65 points
13 comments4 min readLW link
(carado.moe)

Un­der­stand­ing Sub­jec­tive Probabilities

Isaac King10 Dec 2023 6:03 UTC
30 points
16 comments10 min readLW link

Send us ex­am­ple gnarly bugs

10 Dec 2023 5:23 UTC
77 points
10 comments2 min readLW link

Con­cep­tual co­her­ence for con­crete cat­e­gories in hu­mans and LLMs

Bill Benzon9 Dec 2023 23:49 UTC
13 points
1 comment2 min readLW link

2d ai-part­ners as a com­pre­hen­sive mo­ti­va­tion tool

AiresJL9 Dec 2023 21:59 UTC
3 points
0 comments1 min readLW link

Without—MicroFic­tion 250 words

Carissa Cassiel9 Dec 2023 21:49 UTC
19 points
1 comment1 min readLW link

Some nega­tive steganog­ra­phy results

Fabien Roger9 Dec 2023 20:22 UTC
59 points
5 comments2 min readLW link

Sum­ming up “Schem­ing AIs” (Sec­tion 5)

Joe Carlsmith9 Dec 2023 15:48 UTC
2 points
1 comment11 min readLW link