[Question] (i no longer en­dorse this post) - cry­on­ics is a pas­cal’s mug­ging?

KvmanThinking25 Oct 2024 23:24 UTC
−12 points
4 comments1 min readLW link

A Case for Con­scious Sig­nifi­cance rather than Free Will.

James Stephen Brown25 Oct 2024 23:20 UTC
12 points
2 comments6 min readLW link

In­tro­duc­ing Kairos: a new AI safety field­build­ing or­ga­ni­za­tion (the new home for SPAR and FSP)

agucova25 Oct 2024 21:59 UTC
6 points
0 comments1 min readLW link

Brief anal­y­sis of OP Tech­ni­cal AI Safety Funding

22tom25 Oct 2024 19:37 UTC
64 points
5 comments1 min readLW link

UK AISI: Early les­sons from eval­u­at­ing fron­tier AI systems

Zach Stein-Perlman25 Oct 2024 19:00 UTC
26 points
0 comments2 min readLW link
(www.aisi.gov.uk)

Lab gov­er­nance read­ing list

Zach Stein-Perlman25 Oct 2024 18:00 UTC
20 points
3 comments1 min readLW link

En­abling New Ap­pli­ca­tions with To­day’s Mechanis­tic In­ter­pretabil­ity Toolkit

ananya_joshi25 Oct 2024 17:53 UTC
3 points
0 comments3 min readLW link

OpenAI’s cy­ber­se­cu­rity is prob­a­bly reg­u­lated by NIS Regulations

Adam Jones25 Oct 2024 11:06 UTC
11 points
2 comments2 min readLW link
(adamjones.me)

Linkpost: Me­moran­dum on Ad­vanc­ing the United States’ Lead­er­ship in Ar­tifi­cial Intelligence

Nisan25 Oct 2024 4:37 UTC
60 points
2 comments1 min readLW link
(www.whitehouse.gov)

Mak­ing a Pedalboard

jefftk25 Oct 2024 0:10 UTC
10 points
0 comments1 min readLW link
(www.jefftk.com)

What You Can Give In­stead of Advice

Karl Faulks24 Oct 2024 23:10 UTC
13 points
2 comments1 min readLW link

[Question] is it pos­si­ble to com­ment anony­mously on a post?

KvmanThinking24 Oct 2024 22:24 UTC
2 points
2 comments1 min readLW link

A Log­i­cal Proof for the Emer­gence and Sub­strate In­de­pen­dence of Sentience

rife24 Oct 2024 21:08 UTC
4 points
31 comments1 min readLW link
(awakenmoon.ai)

Against Job Boards: Hu­man Cap­i­tal and the Leg­i­bil­ity Trap

vaishnav9224 Oct 2024 20:50 UTC
6 points
1 comment5 min readLW link

IAPS: Map­ping Tech­ni­cal Safety Re­search at AI Companies

Zach Stein-Perlman24 Oct 2024 20:30 UTC
42 points
13 comments1 min readLW link
(www.iaps.ai)

Our Digi­tal and Biolog­i­cal Children

Eneasz24 Oct 2024 18:36 UTC
28 points
0 comments3 min readLW link
(deathisbad.substack.com)

Reflec­tions on the Me­tas­trate­gies Workshop

gw24 Oct 2024 18:30 UTC
41 points
5 comments11 min readLW link

How Should We Mea­sure In­tel­li­gence Models: Why Use Fre­quency of Ele­men­tal In­for­ma­tion Operations

hwj2024 Oct 2024 16:54 UTC
1 point
0 comments5 min readLW link

Meta AI (FAIR) lat­est pa­per in­te­grates sys­tem-1 and sys­tem-2 think­ing into rea­son­ing mod­els.

happy friday24 Oct 2024 16:54 UTC
8 points
0 comments1 min readLW link

Balanc­ing La­bel Quan­tity and Qual­ity for Scal­able Elicitation

Alex Mallen24 Oct 2024 16:49 UTC
31 points
1 comment2 min readLW link

Claude Son­net 3.5.1 and Haiku 3.5

Zvi24 Oct 2024 14:50 UTC
51 points
9 comments16 min readLW link
(thezvi.wordpress.com)

Big tech tran­si­tions are slow (with im­pli­ca­tions for AI)

jasoncrawford24 Oct 2024 14:25 UTC
36 points
16 comments4 min readLW link
(blog.rootsofprogress.org)

Deriva­tive AT a discontinuity

Alok Singh24 Oct 2024 2:48 UTC
9 points
5 comments10 min readLW link

how to rapidly as­similate new information

dhruvmethi24 Oct 2024 2:18 UTC
9 points
3 comments8 min readLW link

Ex-OpenAI re­searcher says OpenAI mass-vi­o­lated copy­right law

Remmelt24 Oct 2024 1:00 UTC
−2 points
0 comments1 min readLW link
(suchir.net)

Miles Brundage re­signed from OpenAI, and his AGI readi­ness team was disbanded

garrison23 Oct 2024 23:40 UTC
118 points
1 comment7 min readLW link
(garrisonlovely.substack.com)

A metaphor: what “green lights” for AGI would look like

Lorec23 Oct 2024 23:24 UTC
−1 points
6 comments2 min readLW link

Motte-and-Bailey: a Short Explanation

Lorec23 Oct 2024 22:29 UTC
12 points
0 comments1 min readLW link

Self-pre­dic­tion acts as an emer­gent regularizer

23 Oct 2024 22:27 UTC
84 points
4 comments4 min readLW link

Tech­ni­cal Risks of (Lethal) Au­tonomous Weapons Systems

Heramb23 Oct 2024 20:41 UTC
2 points
0 comments1 min readLW link
(encodejustice.org)

Ap­peal­ing to the Public

jefftk23 Oct 2024 19:00 UTC
16 points
0 comments5 min readLW link
(www.jefftk.com)

In­tro­duc­ing Transluce — A Let­ter from the Founders

jsteinhardt23 Oct 2024 18:10 UTC
74 points
2 comments3 min readLW link
(bounded-regret.ghost.io)

Are we drop­ping the ball on Recom­men­da­tion AIs?

Charbel-Raphaël23 Oct 2024 17:48 UTC
41 points
17 comments6 min readLW link

A bird’s eye view of ARC’s research

Jacob_Hilton23 Oct 2024 15:50 UTC
119 points
12 comments7 min readLW link
(www.alignment.org)

[Question] Ar­tifi­cial V/​S Organoid Intelligence

10xyz23 Oct 2024 14:31 UTC
5 points
0 comments1 min readLW link

AI safety tax dynamics

owencb23 Oct 2024 12:18 UTC
22 points
0 comments6 min readLW link
(strangecities.substack.com)

What is malev­olence? On the na­ture, mea­sure­ment, and dis­tri­bu­tion of dark traits

23 Oct 2024 8:41 UTC
76 points
15 comments1 min readLW link

Join a LessWrong Team for the Unag­ing Sys­tem Challenge

Crissman23 Oct 2024 6:01 UTC
15 points
5 comments1 min readLW link

Word Spaghetti

Gordon Seidoh Worley23 Oct 2024 5:39 UTC
18 points
9 comments3 min readLW link

Monose­man­tic­ity & Quantization

Rahul Chand22 Oct 2024 22:57 UTC
1 point
0 comments9 min readLW link

[Question] What is the alpha in one bit of ev­i­dence?

J Bostock22 Oct 2024 21:57 UTC
20 points
13 comments1 min readLW link

Catas­trophic sab­o­tage as a ma­jor threat model for hu­man-level AI systems

evhub22 Oct 2024 20:57 UTC
91 points
11 comments15 min readLW link

Why I quit effec­tive al­tru­ism, and why Ti­mothy Tel­leen-Law­ton is stay­ing (for now)

Elizabeth22 Oct 2024 18:20 UTC
75 points
79 comments1 min readLW link
(acesounderglass.com)

De­ci­sion-Mak­ing Un­der Uncer­tainty: Les­sons From AI

Jonasb22 Oct 2024 17:54 UTC
−1 points
0 comments5 min readLW link
(www.denominations.io)

Test­ing Ge­netic Eng­ineer­ing De­tec­tion with Spike-Ins

jefftk22 Oct 2024 17:20 UTC
9 points
0 comments1 min readLW link
(naobservatory.org)

Pre­dic­tions as Public Works Pro­ject — What Me­tac­u­lus Is Build­ing Next

ChristianWilliams22 Oct 2024 16:35 UTC
4 points
0 comments1 min readLW link
(www.metaculus.com)

Gorges of gen­der on a ter­rain of traits

dkl922 Oct 2024 16:18 UTC
−7 points
1 comment3 min readLW link
(dkl9.net)

A Defense of Peer Review

22 Oct 2024 16:16 UTC
23 points
1 comment22 min readLW link
(www.asimov.press)

BIG-Bench Ca­nary Con­tam­i­na­tion in GPT-4

Jozdien22 Oct 2024 15:40 UTC
123 points
13 comments4 min readLW link

[Paper Blog­post] When Your AIs De­ceive You: Challenges with Par­tial Ob­serv­abil­ity in RLHF

Leon Lang22 Oct 2024 13:57 UTC
50 points
1 comment18 min readLW link
(arxiv.org)