Deep­Seek beats o1-pre­view on math, ties on cod­ing; will re­lease weights

Zach Stein-Perlman20 Nov 2024 23:50 UTC
111 points
26 comments1 min readLW link

Ex­pected Utility, Geo­met­ric Utility, and Other Equiv­a­lent Representations

StrivingForLegibility20 Nov 2024 23:28 UTC
10 points
0 comments11 min readLW link

[Question] Green thumb

Pug stanky20 Nov 2024 21:52 UTC
−12 points
1 comment2 min readLW link

Cost, Not Sacrifice

Joe Rogero20 Nov 2024 21:32 UTC
74 points
13 comments1 min readLW link
(subatomicarticles.com)

[Question] How can we pre­vent AGI value drift?

Dakara20 Nov 2024 18:19 UTC
14 points
5 comments1 min readLW link

China Hawks are Man­u­fac­tur­ing an AI Arms Race

garrison20 Nov 2024 18:17 UTC
136 points
42 comments1 min readLW link
(garrisonlovely.substack.com)

Why I Think All The Species Of Sig­nifi­cantly De­bated Con­scious­ness Are Con­scious And Suffer Intensely

omnizoid20 Nov 2024 16:48 UTC
25 points
5 comments33 min readLW link

as­pira­tional leadership

dhruvmethi20 Nov 2024 16:07 UTC
2 points
0 comments7 min readLW link

Zvi’s Thoughts on His 2nd Round of SFF

Zvi20 Nov 2024 13:40 UTC
91 points
2 comments10 min readLW link
(thezvi.wordpress.com)

A Lit­tle Depth Goes a Long Way: the Ex­pres­sive Power of Log-Depth Transformers

Bogdan Ionut Cirstea20 Nov 2024 11:48 UTC
16 points
0 comments1 min readLW link
(openreview.net)

[Question] What changes should hap­pen in the HHS?

ChristianKl20 Nov 2024 11:04 UTC
0 points
19 comments1 min readLW link

[Question] What are the good ra­tio­nal­ity films?

Ben Pace20 Nov 2024 6:04 UTC
82 points
53 comments1 min readLW link

Valence Need Not Be Bounded; Utility Need Not Synthesize

Lorec20 Nov 2024 1:37 UTC
8 points
0 comments6 min readLW link

Value/​Utility: A History

Lorec19 Nov 2024 23:01 UTC
9 points
0 comments10 min readLW link

Why Don’t We Just… Shog­goth+Face+Para­phraser?

19 Nov 2024 20:53 UTC
123 points
52 comments14 min readLW link

Every niche event should also be a meetup

DMMF19 Nov 2024 20:47 UTC
16 points
0 comments3 min readLW link
(danfrank.ca)

U.S.-China Eco­nomic and Se­cu­rity Re­view Com­mis­sion pushes Man­hat­tan Pro­ject-style AI initiative

Phib19 Nov 2024 18:42 UTC
56 points
7 comments1 min readLW link

In­trin­sic Power-Seek­ing: AI Might Seek Power for Power’s Sake

TurnTrout19 Nov 2024 18:36 UTC
40 points
5 comments1 min readLW link
(turntrout.com)

Evolu­tion’s se­lec­tion tar­get de­pends on your weighting

tailcalled19 Nov 2024 18:24 UTC
23 points
22 comments1 min readLW link

AISN #44: The Trump Cir­cle on AI Safety Plus, Chi­nese re­searchers used Llama to cre­ate a mil­i­tary tool for the PLA, a Google AI sys­tem dis­cov­ered a zero-day cy­ber­se­cu­rity vuln­er­a­bil­ity, and Com­plex Sys­tems

19 Nov 2024 16:36 UTC
9 points
0 comments5 min readLW link
(newsletter.safe.ai)

Jakarta ACX De­cem­ber 2024 Meetup

Aud19 Nov 2024 15:01 UTC
1 point
0 comments1 min readLW link

Vi­su­al­iz­ing small At­ten­tion-only Transformers

WCargo19 Nov 2024 9:37 UTC
4 points
0 comments8 min readLW link

Amer­i­cans are fat and sick—and it’s their fault…right?

Declan Molony19 Nov 2024 6:41 UTC
6 points
3 comments7 min readLW link

An­nounc­ing the CLR Foun­da­tions Course and CLR S-Risk Seminars

JamesFaville19 Nov 2024 1:18 UTC
18 points
0 comments1 min readLW link

No Elec­tric­ity in Manchuria

winstonBosan19 Nov 2024 1:11 UTC
25 points
0 comments5 min readLW link

Look­ing back on the Fu­ture of Hu­man­ity In­sti­tute—Asterisk

jakeeaton19 Nov 2024 0:44 UTC
48 points
0 comments1 min readLW link

Don’t Dis­miss on Epistemics

ggex19 Nov 2024 0:44 UTC
8 points
3 comments2 min readLW link

Train­ing AI agents to solve hard prob­lems could lead to Scheming

19 Nov 2024 0:10 UTC
61 points
12 comments28 min readLW link

Proac­tive ‘If-Then’ Safety Cases

Nathan Helm-Burger18 Nov 2024 21:16 UTC
8 points
0 comments4 min readLW link

[Question] Will Orion/​Gem­ini 2/​Llama-4 out­perform o1

LuigiPagani18 Nov 2024 21:15 UTC
1 point
3 comments1 min readLW link

How to use bright light to im­prove your life.

Nat Martin18 Nov 2024 19:32 UTC
40 points
10 comments10 min readLW link

So­cial events with plau­si­ble deniability

Chipmonk18 Nov 2024 18:25 UTC
25 points
24 comments1 min readLW link
(chrislakin.blog)

How likely is brain preser­va­tion to work?

Andy_McKenzie18 Nov 2024 16:58 UTC
25 points
3 comments6 min readLW link

Why im­perfect ad­ver­sar­ial ro­bust­ness doesn’t doom AI control

18 Nov 2024 16:05 UTC
61 points
26 comments2 min readLW link

Eth­i­cal Im­pli­ca­tions of the Quan­tum Multiverse

Jonah Wilberg18 Nov 2024 16:00 UTC
7 points
22 comments6 min readLW link

Re­duc­ing x-risk might be ac­tively harmful

MountainPath18 Nov 2024 14:25 UTC
3 points
5 comments1 min readLW link

Monthly Roundup #24: Novem­ber 2024

Zvi18 Nov 2024 13:20 UTC
44 points
14 comments50 min readLW link
(thezvi.wordpress.com)

A Straight­for­ward Ex­pla­na­tion of the Good Reg­u­la­tor Theorem

Alfred Harwood18 Nov 2024 12:45 UTC
24 points
3 comments14 min readLW link

The Choice Transition

18 Nov 2024 12:30 UTC
44 points
4 comments15 min readLW link
(strangecities.substack.com)

Chat Bankman-Fried: an Ex­plo­ra­tion of LLM Align­ment in Finance

claudia.biancotti18 Nov 2024 9:38 UTC
26 points
4 comments1 min readLW link

Pro­posal to in­crease fer­til­ity: Univer­sity par­ent clubs

Fluffnutt18 Nov 2024 4:21 UTC
17 points
3 comments1 min readLW link

A small im­prove­ment to Wikipe­dia page on Pareto Efficiency

ektimo18 Nov 2024 2:13 UTC
7 points
0 comments1 min readLW link

[Question] Why is Gem­ini tel­ling the user to die?

Burny18 Nov 2024 1:44 UTC
13 points
1 comment1 min readLW link

“It’s a 10% chance which I did 10 times, so it should be 100%”

egor.timatkov18 Nov 2024 1:14 UTC
150 points
57 comments2 min readLW link

The Catas­tro­phe of Shiny Objects

mindprison18 Nov 2024 0:24 UTC
−12 points
0 comments3 min readLW link

Do Deep Neu­ral Net­works Have Brain-like Rep­re­sen­ta­tions?: A Sum­mary of Disagreements

Joseph Emerson18 Nov 2024 0:07 UTC
9 points
0 comments26 min readLW link

Truth Ter­mi­nal: A re­con­struc­tion of events

17 Nov 2024 23:51 UTC
2 points
1 comment7 min readLW link

Which AI Safety Bench­mark Do We Need Most in 2025?

17 Nov 2024 23:50 UTC
2 points
2 comments8 min readLW link

“The Solomonoff Prior is Mal­ign” is a spe­cial case of a sim­pler argument

David Matolcsi17 Nov 2024 21:32 UTC
124 points
44 comments12 min readLW link

Chess As The Model Game

criticalpoints17 Nov 2024 19:45 UTC
19 points
0 comments8 min readLW link
(eregis.github.io)