Learn to write well BEFORE you have some­thing worth saying

eukaryote29 Dec 2024 23:42 UTC
49 points
14 comments3 min readLW link
(eukaryotewritesblog.com)

Teach­ing Claude to Meditate

Gordon Seidoh Worley29 Dec 2024 22:27 UTC
2 points
3 comments23 min readLW link

Ac­tion: how do you REALLY go about do­ing?

DDthinker29 Dec 2024 22:00 UTC
−3 points
0 comments4 min readLW link

Be­gan a pay-on-re­sults coach­ing ex­per­i­ment, made $40,300 since July

Chipmonk29 Dec 2024 21:12 UTC
45 points
14 comments3 min readLW link
(chrislakin.blog)

Cor­rigi­bil­ity should be an AI’s Only Goal

PeterMcCluskey29 Dec 2024 20:25 UTC
11 points
1 comment8 min readLW link
(bayesianinvestor.com)

Mak­ing LLMs safer is more in­tu­itive than you think: How Com­mon Sense and Diver­sity Im­prove AI Align­ment

Jeba Sania29 Dec 2024 19:27 UTC
−3 points
0 comments6 min readLW link

[Question] Could my work, “Beyond HaHa” benefit the LessWrong com­mu­nity?

P. João29 Dec 2024 16:14 UTC
6 points
0 comments1 min readLW link

Book Sum­mary: Zero to One

bilalchughtai29 Dec 2024 16:13 UTC
27 points
1 comment8 min readLW link

Bos­ton Sols­tice 2024 Retrospective

jefftk29 Dec 2024 15:40 UTC
10 points
0 comments4 min readLW link
(www.jefftk.com)

Some ar­gu­ments against a land value tax

Matthew Barnett29 Dec 2024 15:17 UTC
78 points
29 comments15 min readLW link

Pre­dic­tions of Near-Term So­cietal Changes Due to Ar­tifi­cial Intelligence

Annapurna29 Dec 2024 14:53 UTC
10 points
0 comments6 min readLW link
(jorgevelez.substack.com)

Con­sid­er­a­tions on orca intelligence

Towards_Keeperhood29 Dec 2024 14:35 UTC
43 points
5 comments9 min readLW link

AI Align­ment, and where we stand.

afeller0829 Dec 2024 14:08 UTC
−17 points
0 comments2 min readLW link

The Le­gacy of Com­puter Science

Johannes C. Mayer29 Dec 2024 13:15 UTC
15 points
0 comments1 min readLW link
(groups.csail.mit.edu)

Shal­low re­view of tech­ni­cal AI safety, 2024

29 Dec 2024 12:01 UTC
142 points
27 comments41 min readLW link

Dish­brain and im­pli­ca­tions.

RussellThor29 Dec 2024 10:42 UTC
4 points
0 comments2 min readLW link

Notes on Altruism

David Gross29 Dec 2024 3:13 UTC
8 points
0 comments34 min readLW link

Re­ject­ing An­thro­po­mor­phic Bias: Ad­dress­ing Fears of AGI and Transformation

Gedankensprünge29 Dec 2024 1:48 UTC
−17 points
1 comment3 min readLW link

What hap­pens next?

Logan Zoellner29 Dec 2024 1:41 UTC
41 points
19 comments2 min readLW link

The Mis­con­cep­tion of AGI as an Ex­is­ten­tial Threat: A Reassessment

Gedankensprünge29 Dec 2024 1:39 UTC
−25 points
0 comments2 min readLW link

Does Claude Pri­ori­tize Some Prompt In­put Chan­nels Over Others?

keltan29 Dec 2024 1:21 UTC
9 points
2 comments5 min readLW link

Im­pact in AI Safety Now Re­quires Spe­cific Strate­gic Insight

MiloSal29 Dec 2024 0:40 UTC
15 points
1 comment6 min readLW link
(ameliorology.substack.com)

Mo­ral­ity Is Still Demanding

utilistrutil29 Dec 2024 0:33 UTC
−2 points
2 comments1 min readLW link

Emer­gence and Am­plifi­ca­tion of Survival

jgraves0128 Dec 2024 23:52 UTC
−1 points
0 comments3 min readLW link

[Question] Has Some­one Checked The Cold-Water-In-Left-Ear Thing?

Maloew28 Dec 2024 20:15 UTC
7 points
0 comments1 min readLW link

By de­fault, cap­i­tal will mat­ter more than ever af­ter AGI

L Rudolf L28 Dec 2024 17:52 UTC
222 points
88 comments16 min readLW link
(nosetgauge.substack.com)

AI As­sis­tants Should Have a Direct Line to Their Developers

Jan_Kulveit28 Dec 2024 17:01 UTC
53 points
4 comments2 min readLW link

No, the Poly­mar­ket price does not mean we can im­me­di­ately con­clude what the prob­a­bil­ity of a bird flu pan­demic is. We also need to know the in­ter­est rate!

Christopher King28 Dec 2024 16:05 UTC
5 points
7 comments1 min readLW link

The av­er­age ra­tio­nal­ist IQ is about 122

Rockenots28 Dec 2024 15:42 UTC
22 points
21 comments1 min readLW link

Why OpenAI’s Struc­ture Must Evolve To Ad­vance Our Mission

stuhlmueller28 Dec 2024 4:24 UTC
17 points
1 comment1 min readLW link
(openai.com)

The Eng­ineer­ing Ar­gu­ment Fal­lacy: Why Tech­nolog­i­cal Suc­cess Doesn’t Val­i­date Physics

Wenitte Apiou28 Dec 2024 0:49 UTC
−16 points
5 comments2 min readLW link

The Robot, the Pup­pet-mas­ter, and the Psychohistorian

WillPetillo28 Dec 2024 0:12 UTC
6 points
2 comments3 min readLW link

[Question] What is your per­sonal to­tal­iz­ing and self-con­sis­tent wor­ld­view/​philos­o­phy?

lsusr27 Dec 2024 23:59 UTC
30 points
12 comments2 min readLW link

Progress links and short notes, 2024-12-27: Clini­cal trial abun­dance, grid-scale fu­sion, per­mit­ting vs. com­pli­ance, cross­word ma­nia, and more

jasoncrawford27 Dec 2024 23:34 UTC
11 points
0 comments2 min readLW link
(newsletter.rootsofprogress.org)

Greedy-Ad­van­tage-Aware RLHF

sej202027 Dec 2024 19:47 UTC
48 points
15 comments13 min readLW link

De­con­struct­ing ar­gu­ments against AI art

DMMF27 Dec 2024 19:40 UTC
7 points
2 comments5 min readLW link
(danfrank.ca)

From the Archives: a story

Richard_Ngo27 Dec 2024 16:36 UTC
18 points
1 comment16 min readLW link
(www.narrativeark.xyz)

[Question] What’s the best met­ric for mea­sur­ing qual­ity of life?

ChristianKl27 Dec 2024 14:29 UTC
9 points
4 comments1 min readLW link

Re­view: Planecrash

L Rudolf L27 Dec 2024 14:18 UTC
264 points
31 comments21 min readLW link
(nosetgauge.substack.com)

Good For­tune and Many Worlds

Jonah Wilberg27 Dec 2024 13:21 UTC
4 points
0 comments5 min readLW link

Let­ter from an Alien Mind

Shoshannah Tekofsky27 Dec 2024 13:20 UTC
23 points
7 comments3 min readLW link
(open.substack.com)

Coin Flip

XelaP27 Dec 2024 11:53 UTC
16 points
0 comments1 min readLW link

If all trade is vol­un­tary, then what is “ex­ploita­tion?”

Darmani27 Dec 2024 11:21 UTC
31 points
48 comments4 min readLW link

Du­pli­cate to­ken neu­rons in the first layer of gpt2-small

Alex Gibson27 Dec 2024 4:21 UTC
2 points
0 comments5 min readLW link

[Question] What are the most in­ter­est­ing /​ challeng­ing evals (for hu­mans) available?

Raemon27 Dec 2024 3:05 UTC
40 points
13 comments2 min readLW link

Al­gorith­mic Asub­jec­tive An­throp­ics, Carte­sian Sub­jec­tive Anthropics

Lorec27 Dec 2024 1:58 UTC
2 points
0 comments4 min readLW link

Cor­rigi­bil­ity’s De­sir­a­bil­ity is Timing-Sensitive

RobertM26 Dec 2024 22:24 UTC
26 points
4 comments3 min readLW link

PCR retrospective

bhauth26 Dec 2024 21:20 UTC
22 points
0 comments8 min readLW link
(bhauth.com)

AI #96: o3 But Not Yet For Thee

Zvi26 Dec 2024 20:30 UTC
58 points
8 comments36 min readLW link
(thezvi.wordpress.com)

Su­per hu­man AI is a very low hang­ing fruit!

Hzn26 Dec 2024 19:00 UTC
3 points
0 comments5 min readLW link