AI Safety at the Fron­tier: Paper High­lights, Au­gust ’24

gasteigerjoSep 3, 2024, 7:17 PM
28 points
0 comments6 min readLW link
(aisafetyfrontier.substack.com)

The Check­list: What Suc­ceed­ing at AI Safety Will In­volve

Sam BowmanSep 3, 2024, 6:18 PM
142 points
49 comments22 min readLW link
(sleepinyourhat.github.io)

Democ­racy be­yond majoritarianism

Arturo MaciasSep 3, 2024, 3:10 PM
5 points
2 comments4 min readLW link

On the UBI Paper

ZviSep 3, 2024, 2:50 PM
57 points
6 comments19 min readLW link
(thezvi.wordpress.com)

An Opinionated Look at In­fer­ence Rules

Gianluca CalcagniSep 3, 2024, 1:32 PM
−5 points
2 comments13 min readLW link

An­nounc­ing the PIBBSS Sym­po­sium ’24!

Sep 3, 2024, 11:19 AM
19 points
0 comments3 min readLW link

Re­duc­ing global AI com­pe­ti­tion through the Com­merce Con­trol List and Im­mi­gra­tion re­form: a dual-pronged approach

Ben SmithSep 3, 2024, 5:28 AM
16 points
2 comments1 min readLW link

How I got 4.2M YouTube views with­out mak­ing a sin­gle video

Closed Limelike CurvesSep 3, 2024, 3:52 AM
380 points
36 comments1 min readLW link

Duped: AI and the Mak­ing of a Global Suicide Cult

izzynessSep 2, 2024, 6:51 PM
−8 points
0 comments1 min readLW link

A gen­tle in­tro­duc­tion to sparse autoencoders

Nick JiangSep 2, 2024, 6:11 PM
9 points
0 comments6 min readLW link

What makes math prob­lems hard for re­in­force­ment learn­ing: a case study

Anibal, Bartek, Sergei, Shehper and PiotrSep 2, 2024, 6:11 PM
1 point
0 comments2 min readLW link
(arxiv.org)

Sur­vey: How Do Elite Chi­nese Stu­dents Feel About the Risks of AI?

Nick CorvinoSep 2, 2024, 6:11 PM
141 points
13 comments10 min readLW link

Data-driven dona­tions to help Democrats win fed­eral elec­tions: an update

Michael CohnSep 2, 2024, 4:32 PM
−1 points
2 comments1 min readLW link
(perplexedguide.net)

[Question] What are the effec­tive util­i­tar­ian pros and cons of hav­ing chil­dren (in rich coun­tries)?

SpectrumDTSep 2, 2024, 10:01 AM
2 points
4 comments1 min readLW link

My de­com­po­si­tion of the al­ign­ment problem

Daniel CSep 2, 2024, 12:21 AM
20 points
22 comments13 min readLW link

DC Fore­cast­ing & Pre­dic­tion Mar­kets Meetup

David GliddenSep 2, 2024, 12:00 AM
1 point
0 comments1 min readLW link

A primer on the next gen­er­a­tion of antibodies

Abhishaike MahajanSep 1, 2024, 10:37 PM
25 points
0 comments19 min readLW link
(www.owlposting.com)

[Question] Who looked into ex­treme nu­clear melt­downs?

RemmeltSep 1, 2024, 9:38 PM
2 points
8 comments1 min readLW link

Re­dun­dant At­ten­tion Heads in Large Lan­guage Models For In Con­text Learning

skunnavakkamSep 1, 2024, 8:08 PM
7 points
1 comment4 min readLW link
(skunnavakkam.github.io)

The Role of Trans­parency and Ex­plain­abil­ity in Re­spon­si­ble NLP

RAMEBC78Sep 1, 2024, 8:08 PM
−3 points
1 comment5 min readLW link

Book Re­view: What Even Is Gen­der?

Joey MarcellinoSep 1, 2024, 4:09 PM
31 points
14 comments12 min readLW link

Can a Bayesian Or­a­cle Prevent Harm from an Agent? (Ben­gio et al. 2024)

mattmacdermottSep 1, 2024, 7:46 AM
26 points
0 comments5 min readLW link
(yoshuabengio.org)

San Fran­cisco ACX Meetup “First Satur­day”

Nate SternbergSep 1, 2024, 4:48 AM
2 points
1 comment1 min readLW link

Fore­cast­ing One-Shot Games

RaemonAug 31, 2024, 11:10 PM
46 points
0 comments7 min readLW link

On epistemic autonomy

sanyerAug 31, 2024, 6:50 PM
11 points
0 comments2 min readLW link

Epistemic states as a po­ten­tial be­nign prior

Tamsin LeakeAug 31, 2024, 6:26 PM
31 points
2 comments8 min readLW link
(carado.moe)

My Model of Epistemology

adamShimiAug 31, 2024, 5:01 PM
35 points
0 comments8 min readLW link
(epistemologicalfascinations.substack.com)

Ver­ifi­ca­tion meth­ods for in­ter­na­tional AI agreements

AkashAug 31, 2024, 2:58 PM
14 points
1 comment4 min readLW link
(arxiv.org)

Fake Blog Posts as a Prob­lem Solv­ing Device

silentbobAug 31, 2024, 9:22 AM
7 points
0 comments2 min readLW link

Ac­tu­ally Ra­tional & Kind Se­quences Read­ing Group

segfault Aug 31, 2024, 4:21 AM
−55 points
1 comment1 min readLW link

An­thropic is be­ing sued for copy­ing books to train Claude

RemmeltAug 31, 2024, 2:57 AM
20 points
4 comments2 min readLW link
(fingfx.thomsonreuters.com)

Book re­view: On the Edge

PeterMcCluskeyAug 30, 2024, 10:18 PM
34 points
0 comments9 min readLW link
(bayesianinvestor.com)

Can Large Lan­guage Models effec­tively iden­tify cy­ber­se­cu­rity risks?

emile delcourtAug 30, 2024, 8:20 PM
18 points
0 comments11 min readLW link

Sin­gu­lar learn­ing the­ory: exercises

Zach FurmanAug 30, 2024, 8:00 PM
88 points
5 comments14 min readLW link

AI for Bio: State Of The Field

sarahconstantinAug 30, 2024, 6:00 PM
73 points
2 comments15 min readLW link
(sarahconstantin.substack.com)

Multi-Tiered AI

Timothy BruneauAug 30, 2024, 5:46 PM
1 point
0 comments2 min readLW link

I uni­ver­sally try­ing to re­ject the Mind Pro­jec­tion Fal­lacy—consequences

YanLyutnevAug 30, 2024, 5:42 PM
−4 points
0 comments9 min readLW link

AIS ter­minol­ogy pro­posal: stan­dard­ize terms for prob­a­bil­ity ranges

eggsyntaxAug 30, 2024, 3:43 PM
30 points
12 comments2 min readLW link

[Question] Does a time-re­versible phys­i­cal law/​Cel­lu­lar Au­toma­ton always im­ply the First Law of Ther­mo­dy­nam­ics?

Noosphere89Aug 30, 2024, 3:12 PM
7 points
11 comments1 min readLW link

Prin­ci­ples for the AGI Race

William_SAug 30, 2024, 2:29 PM
245 points
13 comments18 min readLW link

Con­gres­sional In­sider Trading

Maxwell TabarrokAug 30, 2024, 1:32 PM
57 points
6 comments7 min readLW link
(www.maximum-progress.com)

[Question] Thoughts on pa­per “How Or­ganisms Come to Know the World: Fun­da­men­tal Limits on Ar­tifi­cial Gen­eral In­tel­li­gence”?

mikbpAug 30, 2024, 9:04 AM
2 points
3 comments1 min readLW link

Are LLMs on the Path to AGI?

DavidmanheimAug 30, 2024, 3:14 AM
14 points
2 comments5 min readLW link

Nurs­ing doubts

dynomightAug 30, 2024, 2:25 AM
144 points
23 comments9 min readLW link
(dynomight.net)

Free Will and Dodg­ing Anvils: AIXI Off-Policy

Cole WyethAug 29, 2024, 10:42 PM
37 points
12 comments9 min readLW link

Seat­tle USA—ACX Mee­tups Every­where Fall 2024

a7xAug 29, 2024, 9:42 PM
2 points
0 comments1 min readLW link

Ran­cho Cu­ca­monga USA—ACX Mee­tups Every­where Fall 2024

Nelson James HorsleyAug 29, 2024, 7:18 PM
1 point
0 comments1 min readLW link

Reno USA—ACX Mee­tups Every­where Fall 2024

Daniel GoldAug 29, 2024, 7:18 PM
1 point
0 comments1 min readLW link

Ta­marindo Costa Rica—ACX Mee­tups Every­where Fall 2024

timelessAug 29, 2024, 6:44 PM
1 point
0 comments1 min readLW link

San­ti­ago Chile—ACX Mee­tups Every­where Fall 2024

Iñaki Aug 29, 2024, 6:44 PM
1 point
0 comments1 min readLW link