Re­lease of UN’s draft re­lated to the gov­er­nance of AI (a sum­mary of the Si­mon In­sti­tute’s re­sponse)

Sebastian Schmidt27 Apr 2024 18:34 UTC
7 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Mercy to the Ma­chine: Thoughts & Rights

False Name27 Apr 2024 16:36 UTC
7 points
6 comments17 min readLW link

Con­structabil­ity: Plainly-coded AGIs may be fea­si­ble in the near future

27 Apr 2024 16:04 UTC
82 points
13 comments13 min readLW link

So What’s Up With PUFAs Chem­i­cally?

J Bostock27 Apr 2024 13:32 UTC
57 points
23 comments6 min readLW link

Link: Let’s Think Dot by Dot: Hid­den Com­pu­ta­tion in Trans­former Lan­guage Models by Ja­cob Pfau, William Mer­rill & Sa­muel R. Bowman

Chris_Leong27 Apr 2024 13:22 UTC
12 points
0 comments1 min readLW link
(twitter.com)

Two Ver­nor Vinge Book Reviews

Maxwell Tabarrok27 Apr 2024 12:14 UTC
17 points
0 comments2 min readLW link
(www.maximum-progress.com)

Re­fusal in LLMs is me­di­ated by a sin­gle direction

27 Apr 2024 11:13 UTC
236 points
93 comments10 min readLW link

[Question] Plau­si­bil­ity of Get­ting Early Warn­ing Shots be­cause AIs can’t co­or­di­nate?

hmys27 Apr 2024 8:02 UTC
12 points
0 comments1 min readLW link

AI Safety Sphere

Myles H27 Apr 2024 1:49 UTC
6 points
2 comments2 min readLW link

Ex­plor­ing the Eso­teric Path­ways to AI Sen­tience (Part One)

jeffreycaruso27 Apr 2024 1:02 UTC
−11 points
6 comments2 min readLW link

Su­per­po­si­tion is not “just” neu­ron polysemanticity

LawrenceC26 Apr 2024 23:22 UTC
64 points
4 comments13 min readLW link

D&D.Sci Long War: Defen­der of Data-mocracy

aphyer26 Apr 2024 22:30 UTC
44 points
20 comments4 min readLW link

On Not Pul­ling The Lad­der Up Be­hind You

Screwtape26 Apr 2024 21:58 UTC
188 points
21 comments9 min readLW link

We are headed into an ex­treme com­pute overhang

devrandom26 Apr 2024 21:38 UTC
53 points
33 comments2 min readLW link

[Con­cept Depen­dency] Edge Reg­u­lar Lat­tice Graph

Johannes C. Mayer26 Apr 2024 21:14 UTC
9 points
1 comment1 min readLW link

[Con­cept Depen­dency] Con­cept Depen­dency Posts

Johannes C. Mayer26 Apr 2024 20:57 UTC
10 points
3 comments2 min readLW link

[Question] Wouldn’t weak AI agents provide warn­ing?

Mandatory Topic26 Apr 2024 19:34 UTC
5 points
0 comments1 min readLW link

World models

A*26 Apr 2024 19:11 UTC
1 point
0 comments1 min readLW link

Duct Tape security

Isaac King26 Apr 2024 18:57 UTC
68 points
11 comments5 min readLW link

Fun­da­men­tal Uncer­tainty: Chap­ter 8 - When does fun­da­men­tal un­cer­tainty mat­ter?

Gordon Seidoh Worley26 Apr 2024 18:10 UTC
11 points
2 comments32 min readLW link

Scal­ing of AI train­ing runs will slow down af­ter GPT-5

Maxime Riché26 Apr 2024 16:05 UTC
40 points
5 comments3 min readLW link

Spa­tial at­ten­tion as a “tell” for em­pa­thetic simu­la­tion?

Steven Byrnes26 Apr 2024 15:10 UTC
55 points
12 comments8 min readLW link

Arch-anarchy

Peter lawless 26 Apr 2024 15:05 UTC
−1 points
1 comment25 min readLW link

Bread­board­ing a Whis­tle Synth

jefftk26 Apr 2024 15:00 UTC
9 points
2 comments2 min readLW link
(www.jefftk.com)

An In­tro­duc­tion to AI Sandbagging

26 Apr 2024 13:40 UTC
45 points
13 comments8 min readLW link

LLMs seem (rel­a­tively) safe

JustisMills25 Apr 2024 22:13 UTC
53 points
24 comments7 min readLW link
(justismills.substack.com)

Los­ing Faith In Con­trar­i­anism

omnizoid25 Apr 2024 20:53 UTC
38 points
44 comments5 min readLW link

Why I stopped be­ing into basin broadness

tailcalled25 Apr 2024 20:47 UTC
16 points
3 comments2 min readLW link

AXRP Epi­sode 29 - Science of Deep Learn­ing with Vikrant Varma

DanielFilan25 Apr 2024 19:10 UTC
20 points
1 comment63 min readLW link

Im­prov­ing Dic­tionary Learn­ing with Gated Sparse Autoencoders

25 Apr 2024 18:43 UTC
63 points
38 comments1 min readLW link
(arxiv.org)

“Why I Write” by Ge­orge Or­well (1946)

Arjun Panickssery25 Apr 2024 16:02 UTC
58 points
2 comments9 min readLW link
(www.orwellfoundation.com)

Knowl­edge Base 8: The truth as an at­trac­tor in the in­for­ma­tion space

iwis25 Apr 2024 15:28 UTC
−8 points
0 comments2 min readLW link

Cy­ber­se­cu­rity of Fron­tier AI Models: A Reg­u­la­tory Review

25 Apr 2024 14:51 UTC
8 points
0 comments8 min readLW link

The first fu­ture and the best future

KatjaGrace25 Apr 2024 6:40 UTC
106 points
12 comments1 min readLW link
(worldspiritsockpuppet.com)

NIH Cancer Myths Myths

25 Apr 2024 5:43 UTC
15 points
1 comment2 min readLW link

so­cial lemon markets

bhauth25 Apr 2024 2:18 UTC
22 points
6 comments3 min readLW link
(www.bhauth.com)

Bayesian in­fer­ence with­out priors

DanielFilan24 Apr 2024 23:50 UTC
26 points
8 comments8 min readLW link
(danielfilan.com)

The In­ner Ring by C. S. Lewis

Saul Munn24 Apr 2024 22:48 UTC
69 points
6 comments13 min readLW link
(www.lewissociety.org)

This is Water by David Foster Wallace

Nathan Young24 Apr 2024 21:21 UTC
58 points
16 comments13 min readLW link
(fs.blog)

Is be­ing a trans woman (or just low-T) +20 IQ?

lemonhope24 Apr 2024 20:04 UTC
6 points
29 comments1 min readLW link

Be­ta­dine oral rinses for covid and other viral infections

Elizabeth24 Apr 2024 17:50 UTC
22 points
3 comments5 min readLW link
(acesounderglass.com)

At last! ChatGPT does, shall we say, in­ter­est­ing imi­ta­tions of “Kubla Khan”

Bill Benzon24 Apr 2024 14:56 UTC
−3 points
0 comments4 min readLW link

Magic by forgetting

avturchin24 Apr 2024 14:32 UTC
18 points
39 comments4 min readLW link

Changes in Col­lege Admissions

Zvi24 Apr 2024 13:50 UTC
50 points
11 comments39 min readLW link
(thezvi.wordpress.com)

1-page out­line of Car­l­smith’s oth­er­ness and con­trol series

Nathan Young24 Apr 2024 11:25 UTC
22 points
3 comments3 min readLW link

How to use and in­ter­pret ac­ti­va­tion patching

24 Apr 2024 8:35 UTC
12 points
0 comments18 min readLW link

AI Gen­er­ated Mu­sic as a Method of In­stal­ling Essen­tial Ra­tion­al­ist Skills

keltan24 Apr 2024 7:48 UTC
13 points
3 comments1 min readLW link

Elec­tronic Harp Man­dolin Prototype

jefftk24 Apr 2024 2:20 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

[Question] Ex­am­ples of Highly Coun­ter­fac­tual Dis­cov­er­ies?

johnswentworth23 Apr 2024 22:19 UTC
194 points
101 comments1 min readLW link

[Question] Is there soft­ware to prac­tice read­ing ex­pres­sions?

lsusr23 Apr 2024 21:53 UTC
37 points
10 comments1 min readLW link