Cog­ni­tive Work and AI Safety: A Ther­mo­dy­namic Perspective

Daniel Murfet8 Dec 2024 21:42 UTC
61 points
9 comments4 min readLW link

Causal Un­der­tow: A Work of Seed Fiction

Daniel Murfet8 Dec 2024 21:41 UTC
41 points
0 comments3 min readLW link

Mis­for­tune and Many Worlds

Jonah Wilberg8 Dec 2024 20:25 UTC
10 points
4 comments9 min readLW link

Luck Based Medicine: No Good Very Bad Win­ter Cured My Hypothyroidism

Elizabeth8 Dec 2024 20:10 UTC
54 points
3 comments2 min readLW link
(acesounderglass.com)

Dens­ing Law of LLMs

Bogdan Ionut Cirstea8 Dec 2024 19:35 UTC
9 points
2 comments1 min readLW link
(arxiv.org)

[Question] Are there ways to ar­tifi­cially fix laz­i­ness?

Aidar8 Dec 2024 18:26 UTC
4 points
2 comments1 min readLW link

Fred the Heretic, a GPT for poetry

Bill Benzon8 Dec 2024 16:52 UTC
4 points
0 comments1 min readLW link

Re­think Wel­lbe­ing’s Year 2 Up­date: Foster Sus­tain­able High Perfor­mance for Am­bi­tious Altru­ists

Inga G.8 Dec 2024 14:32 UTC
11 points
1 comment1 min readLW link

Alter­na­tives to Masks for In­fec­tious Aerosols

jefftk8 Dec 2024 14:00 UTC
25 points
9 comments7 min readLW link
(www.jefftk.com)

Parable of the vanilla ice cream curse (and how it would pre­vent a car from start­ing!)

Mati_Roy8 Dec 2024 6:57 UTC
89 points
21 comments3 min readLW link

A good way to build many air filters on the cheap

winstonBosan8 Dec 2024 1:47 UTC
14 points
5 comments3 min readLW link

His­tor­i­cal Net Worth

jefftk7 Dec 2024 23:10 UTC
19 points
1 comment1 min readLW link
(www.jefftk.com)

RL, but don’t do any­thing I wouldn’t do

Gunnar_Zarncke7 Dec 2024 22:54 UTC
63 points
5 comments1 min readLW link
(arxiv.org)

Liti­gate-for-Im­pact: Prepar­ing Le­gal Ac­tion against an AGI Fron­tier Lab Leader

Sonia Joseph7 Dec 2024 21:42 UTC
38 points
7 comments2 min readLW link

Alge­braic Linguistics

abstractapplic7 Dec 2024 19:18 UTC
34 points
27 comments5 min readLW link

Paper High­lights, Novem­ber ’24

gasteigerjo7 Dec 2024 19:15 UTC
7 points
0 comments8 min readLW link
(aisafetyfrontier.substack.com)

In­tri­ca­cies of Fea­ture Geom­e­try in Large Lan­guage Models

7 Dec 2024 18:10 UTC
68 points
0 comments12 min readLW link

The Way Ac­cord­ing To Zvi

Sable7 Dec 2024 17:35 UTC
38 points
3 comments32 min readLW link
(affablyevil.substack.com)

Deep Learn­ing is cheap Solomonoff in­duc­tion?

7 Dec 2024 11:00 UTC
44 points
1 comment17 min readLW link

minifest

Austin Chen7 Dec 2024 3:50 UTC
19 points
1 comment1 min readLW link

Mask and Re­s­pi­ra­tor In­tel­ligi­bil­ity Comparison

jefftk7 Dec 2024 3:20 UTC
26 points
5 comments1 min readLW link
(www.jefftk.com)

Broad­en­ing Hori­zons: Re­think­ing So­cial Mo­bil­ity Through Skill Diversification

Yanling Guo7 Dec 2024 0:04 UTC
−1 points
0 comments2 min readLW link

Back­doors have uni­ver­sal rep­re­sen­ta­tions across large lan­guage models

6 Dec 2024 22:56 UTC
14 points
0 comments16 min readLW link

Gra­di­ent Rout­ing: Mask­ing Gra­di­ents to Lo­cal­ize Com­pu­ta­tion in Neu­ral Networks

6 Dec 2024 22:19 UTC
161 points
12 comments11 min readLW link
(arxiv.org)

Un­der­stand­ing Shap­ley Values with Venn Diagrams

Carson L6 Dec 2024 21:56 UTC
213 points
34 comments1 min readLW link
(medium.com)

Model Integrity

6 Dec 2024 21:28 UTC
4 points
1 comment18 min readLW link

Can AI im­prove the cur­rent state of molec­u­lar simu­la­tion?

Abhishaike Mahajan6 Dec 2024 20:22 UTC
5 points
0 comments1 min readLW link
(www.owlposting.com)

Low Tem­per­a­ture Solomonoff Induction

dil-leik-og6 Dec 2024 18:55 UTC
10 points
4 comments11 min readLW link

Ex­per­i­ments are in the ter­ri­tory, re­sults are in the map

Tahp6 Dec 2024 15:44 UTC
5 points
1 comment6 min readLW link

A car jour­ney with con­ser­va­tive evan­gel­i­cals—Un­der­stand­ing some Bri­tish poli­ti­cal-re­li­gious beliefs

Nathan Young6 Dec 2024 11:22 UTC
41 points
8 comments6 min readLW link
(nathanpmyoung.substack.com)

Fron­tier Models are Ca­pable of In-con­text Scheming

5 Dec 2024 22:11 UTC
203 points
24 comments7 min readLW link

Should you be wor­ried about H5N1?

gw5 Dec 2024 21:11 UTC
89 points
2 comments5 min readLW link
(www.georgeyw.com)

o1 tried to avoid be­ing shut down

Raelifin5 Dec 2024 19:52 UTC
10 points
5 comments1 min readLW link
(www.transformernews.ai)

More Growth, Me­lan­choly, and MindCraft @3QD [re­vised and up­dated]

Bill Benzon5 Dec 2024 19:36 UTC
4 points
0 comments4 min readLW link

Ex­pevolu, a laissez-faire ap­proach to coun­try creation

Fernando5 Dec 2024 19:29 UTC
4 points
4 comments44 min readLW link
(expevolu.substack.com)

Are SAE fea­tures from the Base Model still mean­ingful to LLaVA?

Shan23Chen5 Dec 2024 19:24 UTC
5 points
2 comments10 min readLW link

OpenAI o1 + ChatGPT Pro release

anaguma5 Dec 2024 19:13 UTC
5 points
0 comments1 min readLW link
(openai.com)

Smart peo­ple should do biology

Haotian5 Dec 2024 19:11 UTC
10 points
2 comments3 min readLW link

An­nounce­ment: AI for Math Fund

sarahconstantin5 Dec 2024 18:33 UTC
20 points
9 comments2 min readLW link
(renaissancephilanthropy.org)

De­tec­tion of Asymp­tomat­i­cally Spread­ing Pathogens

jefftk5 Dec 2024 18:20 UTC
45 points
8 comments7 min readLW link
(www.jefftk.com)

Model In­tegrity: MAI on Value Alignment

Jonas Hallgren5 Dec 2024 17:11 UTC
6 points
11 comments1 min readLW link
(meaningalignment.substack.com)

So­cial Science in its episte­molog­i­cal context

Arturo Macias5 Dec 2024 16:12 UTC
3 points
0 comments1 min readLW link
(www.theseedsofscience.pub)

Higher and lower pleasures

Chris_Leong5 Dec 2024 13:13 UTC
19 points
3 comments1 min readLW link

Sam Har­ris’s Ar­gu­ment For Ob­jec­tive Morality

Zero Contradictions5 Dec 2024 10:19 UTC
7 points
5 comments1 min readLW link
(thewaywardaxolotl.blogspot.com)

Mo­ral­ity as Co­op­er­a­tion Part III: Failure Modes

DeLesley Hutchins5 Dec 2024 9:39 UTC
4 points
0 comments20 min readLW link

Mo­ral­ity as Co­op­er­a­tion Part II: The­ory and Experiment

DeLesley Hutchins5 Dec 2024 9:04 UTC
2 points
0 comments17 min readLW link

Mo­ral­ity as Co­op­er­a­tion Part I: Humans

DeLesley Hutchins5 Dec 2024 8:16 UTC
5 points
0 comments19 min readLW link

I Fi­nally Worked Through Bayes’ The­o­rem (Per­sonal Achieve­ment)

keltan5 Dec 2024 2:04 UTC
51 points
6 comments9 min readLW link

The Dream Machine

sarahconstantin5 Dec 2024 0:00 UTC
117 points
6 comments12 min readLW link
(sarahconstantin.substack.com)

Should you have chil­dren? A de­ci­sion frame­work for a cru­cial life choice that af­fects your­self, your child and the world

Sherrinford4 Dec 2024 23:14 UTC
0 points
1 comment20 min readLW link