[Question] Am I con­fused about the “ma­lign uni­ver­sal prior” ar­gu­ment?

nostalgebraist27 Aug 2024 23:17 UTC
92 points
33 comments8 min readLW link

The In­for­ma­tion: OpenAI shows ‘Straw­berry’ to feds, races to launch it

Martín Soto27 Aug 2024 23:10 UTC
145 points
15 comments3 min readLW link

SB 1047: Fi­nal Takes and Also AB 3211

Zvi27 Aug 2024 22:10 UTC
92 points
11 comments21 min readLW link
(thezvi.wordpress.com)

LessWrong email sub­scrip­tions?

Raemon27 Aug 2024 21:59 UTC
26 points
6 comments1 min readLW link

GPT-3.5 judges can su­per­vise GPT-4o de­baters in ca­pa­bil­ity asym­met­ric debates

27 Aug 2024 20:44 UTC
23 points
7 comments4 min readLW link

Why Large Bureau­cratic Or­ga­ni­za­tions?

johnswentworth27 Aug 2024 18:30 UTC
68 points
52 comments12 min readLW link

In defense of tech­nolog­i­cal un­em­ploy­ment as the main AI concern

tailcalled27 Aug 2024 17:58 UTC
44 points
36 comments1 min readLW link

[Question] I’m do­ing Yolov8 model train­ing but the ac­cu­racy rate is 70%

Sezer Karataş27 Aug 2024 17:53 UTC
−14 points
0 comments1 min readLW link

What De­pres­sion Is Like

Sable27 Aug 2024 17:43 UTC
85 points
23 comments4 min readLW link
(affablyevil.substack.com)

Unit eco­nomics of LLM APIs

27 Aug 2024 16:51 UTC
42 points
0 comments2 min readLW link

On In­terpters, Op­ti­miz­ing Com­pilers, and JIT

Johannes C. Mayer27 Aug 2024 16:01 UTC
−3 points
2 comments1 min readLW link

Soft Na­tion­al­iza­tion: how the USG will con­trol AI labs

27 Aug 2024 15:11 UTC
76 points
7 comments21 min readLW link
(www.convergenceanalysis.org)

[Question] On Nothing

Hudjefa27 Aug 2024 6:50 UTC
−14 points
12 comments1 min readLW link

“Real sum­mer”?

duck_master26 Aug 2024 22:11 UTC
2 points
0 comments1 min readLW link

Me­tac­u­lus’s ‘Mini­tac­u­lus’ Ex­per­i­ments — Col­lab­o­rate With Us

ChristianWilliams26 Aug 2024 20:44 UTC
6 points
0 comments1 min readLW link
(www.metaculus.com)

My Apart­ment Art Com­mis­sion Process

jenn26 Aug 2024 18:36 UTC
34 points
4 comments7 min readLW link
(jenn.site)

My (cur­rent) model of what an AI gov­er­nance re­searcher does

Johan de Kock26 Aug 2024 17:58 UTC
1 point
2 comments5 min readLW link

Would catch­ing your AIs try­ing to es­cape con­vince AI de­vel­op­ers to slow down or un­de­ploy?

Buck26 Aug 2024 16:46 UTC
294 points
76 comments4 min readLW link

… Wait, our mod­els of se­man­tics should in­form fluid me­chan­ics?!?

26 Aug 2024 16:38 UTC
56 points
18 comments4 min readLW link

Day Zero An­tivirals for Fu­ture Pandemics

Niko_McCarty26 Aug 2024 15:18 UTC
22 points
2 comments10 min readLW link
(www.asimov.press)

Molec­u­lar dy­nam­ics data will be es­sen­tial for the next gen­er­a­tion of ML pro­tein models

Abhishaike Mahajan26 Aug 2024 14:50 UTC
9 points
0 comments11 min readLW link
(www.owlposting.com)

My luke­warm take on GLP-1 agonists

George3d626 Aug 2024 12:34 UTC
16 points
0 comments1 min readLW link
(cerebralab.com)

In­ter­view with Robert Kral­isch on Simulators

WillPetillo26 Aug 2024 5:49 UTC
17 points
0 comments75 min readLW link

One per­son’s worth of men­tal en­ergy for AI doom aver­sion jobs. What should I do?

Lorec26 Aug 2024 1:29 UTC
9 points
17 comments1 min readLW link

Sec­u­lar in­ter­pre­ta­tions of core peren­ni­al­ist claims

zhukeepa25 Aug 2024 23:41 UTC
83 points
33 comments14 min readLW link

Dar­wi­nian Traps and Ex­is­ten­tial Risks

KristianRonn25 Aug 2024 22:37 UTC
80 points
14 comments10 min readLW link

DIY LessWrong Jewelry

Fluffnutt25 Aug 2024 21:33 UTC
33 points
0 comments1 min readLW link

Meta: On view­ing the lat­est LW posts

quiet_NaN25 Aug 2024 19:31 UTC
5 points
2 comments1 min readLW link

you should prob­a­bly eat oat­meal sometimes

bhauth25 Aug 2024 14:50 UTC
42 points
32 comments3 min readLW link
(bhauth.com)

Referen­dum Me­chan­ics in a Mar­ket­place of Ideas

Martin Sustrik25 Aug 2024 8:30 UTC
57 points
2 comments5 min readLW link
(250bpm.substack.com)

Please stop us­ing mediocre AI art in your posts

Raemon25 Aug 2024 0:13 UTC
110 points
24 comments2 min readLW link

AXRP Epi­sode 35 - Peter Hase on LLM Beliefs and Easy-to-Hard Generalization

DanielFilan24 Aug 2024 22:30 UTC
21 points
0 comments74 min readLW link

The top 30 books to ex­pand the ca­pa­bil­ities of AI: a bi­ased read­ing list

Jonathan Mugan24 Aug 2024 21:48 UTC
−6 points
0 comments16 min readLW link

The Ap Distribution

criticalpoints24 Aug 2024 21:45 UTC
22 points
7 comments3 min readLW link
(eregis.github.io)

What is it to solve the al­ign­ment prob­lem?

Joe Carlsmith24 Aug 2024 21:19 UTC
69 points
17 comments53 min readLW link

Ex­am­ine self mod­ifi­ca­tion as an in­tu­ition provider for the con­cept of con­scious­ness

weightt an24 Aug 2024 20:48 UTC
−5 points
2 comments10 min readLW link

[Question] Look­ing to in­ter­view AI Safety re­searchers for a book

jeffreycaruso24 Aug 2024 19:57 UTC
14 points
0 comments1 min readLW link

Per­plex­ity wins my AI race

Elizabeth24 Aug 2024 19:20 UTC
107 points
12 comments10 min readLW link
(acesounderglass.com)

Why should any­one boot *you* up?

onur24 Aug 2024 17:51 UTC
−1 points
5 comments3 min readLW link
(solmaz.io)

Un­der­stand­ing Hid­den Com­pu­ta­tions in Chain-of-Thought Reasoning

rokosbasilisk24 Aug 2024 16:35 UTC
6 points
1 comment1 min readLW link

Au­gust 2024 Time Tracking

jefftk24 Aug 2024 13:50 UTC
22 points
0 comments3 min readLW link
(www.jefftk.com)

Train­ing a Sparse Au­toen­coder in < 30 min­utes on 16GB of VRAM us­ing an S3 cache

Louka Ewington-Pitsos24 Aug 2024 7:39 UTC
17 points
0 comments5 min readLW link

[Question] Look­ing for in­tu­itions to ex­tend bar­gain­ing notions

ProgramCrafter24 Aug 2024 5:00 UTC
13 points
0 comments1 min readLW link

Owain Evans on Si­tu­a­tional Aware­ness and Out-of-Con­text Rea­son­ing in LLMs

Michaël Trazzi24 Aug 2024 4:30 UTC
55 points
0 comments5 min readLW link

[Question] Devel­op­ing Pos­i­tive Habits through Video Games

pzas24 Aug 2024 3:47 UTC
1 point
5 comments1 min readLW link

“Can AI Scal­ing Con­tinue Through 2030?”, Epoch AI (yes)

gwern24 Aug 2024 1:40 UTC
129 points
4 comments3 min readLW link
(epochai.org)

What’s im­por­tant in “AI for epistemics”?

Lukas Finnveden24 Aug 2024 1:27 UTC
41 points
0 comments28 min readLW link
(lukasfinnveden.substack.com)

Show­ing SAE La­tents Are Not Atomic Us­ing Meta-SAEs

24 Aug 2024 0:56 UTC
61 points
9 comments20 min readLW link

Us­ing ide­olog­i­cally-charged lan­guage to get gpt-3.5-turbo to di­s­obey it’s sys­tem prompt: a demo

Milan W24 Aug 2024 0:13 UTC
3 points
0 comments6 min readLW link

Craft­ing Poly­se­man­tic Trans­former Bench­marks with Known Circuits

23 Aug 2024 22:03 UTC
10 points
0 comments25 min readLW link