Anti MMAcevedo Protocol

Logan Zoellner16 Apr 2024 22:32 UTC
1 point
1 comment8 min readLW link

Trans­form­ers Rep­re­sent Belief State Geom­e­try in their Resi­d­ual Stream

Adam Shai16 Apr 2024 21:16 UTC
411 points
100 comments12 min readLW link

Tinker

Richard_Ngo16 Apr 2024 18:26 UTC
38 points
0 comments1 min readLW link
(press.asimov.com)

Paul Chris­ti­ano named as US AI Safety In­sti­tute Head of AI Safety

Joel Burget16 Apr 2024 16:22 UTC
256 points
58 comments1 min readLW link
(www.commerce.gov)

Creat­ing un­re­stricted AI Agents with Com­mand R+

Simon Lermen16 Apr 2024 14:52 UTC
77 points
13 comments5 min readLW link

What should the EA com­mu­nity learn from the FTX /​ SBF dis­aster? An in-depth dis­cus­sion with Will MacAskill on the Clearer Think­ing pod­cast

spencerg16 Apr 2024 13:11 UTC
20 points
0 comments1 min readLW link
(podcast.clearerthinking.org)

{Book Sum­mary} The Art of Gathering

Tristan Williams16 Apr 2024 10:48 UTC
28 points
0 comments1 min readLW link

Es­say com­pe­ti­tion on the Au­toma­tion of Wis­dom and Philos­o­phy — $25k in prizes

16 Apr 2024 10:10 UTC
82 points
12 comments8 min readLW link
(blog.aiimpacts.org)

An­nounc­ing SPAR Sum­mer 2024!

laurenmarie1216 Apr 2024 8:30 UTC
30 points
2 comments1 min readLW link

The ar­gu­ment for near-term hu­man dis­em­pow­er­ment through AI

Chris_Leong16 Apr 2024 4:50 UTC
21 points
2 comments1 min readLW link
(link.springer.com)

My ex­pe­rience us­ing fi­nan­cial com­mit­ments to over­come akrasia

William Howard15 Apr 2024 22:57 UTC
137 points
31 comments18 min readLW link

An eval­u­a­tion of cir­cuit eval­u­a­tion metrics

15 Apr 2024 19:38 UTC
18 points
0 comments4 min readLW link

Ex­per­i­ments with an al­ter­na­tive method to pro­mote spar­sity in sparse autoencoders

Eoin Farrell15 Apr 2024 18:21 UTC
29 points
7 comments12 min readLW link

Effec­tively Han­dling Disagree­ments—In­tro­duc­ing a New Workshop

Camille Berger 15 Apr 2024 16:33 UTC
37 points
2 comments7 min readLW link

Four Lo­cal Gigs

jefftk15 Apr 2024 16:00 UTC
8 points
0 comments1 min readLW link
(www.jefftk.com)

Tak­ing into ac­count prefer­ences of past selves

Jacob G-W15 Apr 2024 13:15 UTC
14 points
9 comments7 min readLW link

Monthly Roundup #17: April 2024

Zvi15 Apr 2024 12:10 UTC
54 points
4 comments76 min readLW link
(thezvi.wordpress.com)

Re­con­sider the anti-cav­ity bac­te­ria if you are Asian

Lao Mein15 Apr 2024 7:02 UTC
168 points
43 comments4 min readLW link

An­thropic AI made the right call

bhauth15 Apr 2024 0:39 UTC
22 points
20 comments1 min readLW link

May 2024 New­ton meetup???

duck_master14 Apr 2024 22:28 UTC
2 points
0 comments1 min readLW link

Clip­board Filtering

jefftk14 Apr 2024 20:50 UTC
25 points
1 comment1 min readLW link
(www.jefftk.com)

A High De­cou­pling Failure

Maxwell Tabarrok14 Apr 2024 19:46 UTC
37 points
5 comments3 min readLW link
(www.maximum-progress.com)

ACX Zwolle meetup

Shaedys14 Apr 2024 13:09 UTC
7 points
0 comments1 min readLW link

A quick ex­per­i­ment on LMs’ in­duc­tive bi­ases in perform­ing search

Alex Mallen14 Apr 2024 3:41 UTC
32 points
2 comments4 min readLW link

UDT1.01 Essen­tial Mis­cel­lanea (4/​10)

Diffractor14 Apr 2024 2:23 UTC
19 points
0 comments10 min readLW link

[Cos­mol­ogy Talks] New Prob­a­bil­ity Ax­ioms Could Fix Cos­mol­ogy’s Mul­ti­verse (Par­tially) - Sylvia Wenmackers

mako yass14 Apr 2024 1:26 UTC
18 points
2 comments1 min readLW link
(www.youtube.com)

Speedrun ru­iner re­search idea

lemonhope13 Apr 2024 23:42 UTC
2 points
11 comments2 min readLW link

Text Posts from the Kids Group: 2020

jefftk13 Apr 2024 22:30 UTC
69 points
3 comments19 min readLW link
(www.jefftk.com)

[Question] What con­vinc­ing warn­ing shot could help pre­vent ex­tinc­tion from AI?

13 Apr 2024 18:09 UTC
105 points
18 comments2 min readLW link

My ex­pe­rience at ML4Good AI Safety Bootcamp

TheManxLoiner13 Apr 2024 10:55 UTC
20 points
0 comments5 min readLW link

Con­se­quen­tial­ism is a com­pass, not a judge

Neil 13 Apr 2024 10:47 UTC
26 points
6 comments2 min readLW link

Carl Sa­gan, nuk­ing the moon, and not nuk­ing the moon

eukaryote13 Apr 2024 4:08 UTC
103 points
8 comments6 min readLW link
(eukaryotewritesblog.com)

[Question] Bar­cod­ing LLM Train­ing Data Sub­sets. Any­one try­ing this for in­ter­pretabil­ity?

right..enough?13 Apr 2024 3:09 UTC
7 points
0 comments7 min readLW link

Prompts for Big-Pic­ture Planning

Raemon13 Apr 2024 3:04 UTC
72 points
1 comment3 min readLW link

Claude wants to be conscious

Joe Kwon13 Apr 2024 1:40 UTC
2 points
8 comments6 min readLW link

Things Solenoid Narrates

Solenoid_Entity12 Apr 2024 23:57 UTC
45 points
2 comments2 min readLW link

MIRI’s April 2024 Newsletter

Harlan12 Apr 2024 23:38 UTC
95 points
0 comments3 min readLW link
(intelligence.org)

Poker, Beef Wel­ling­ton, and Mount Stupid

boghan12 Apr 2024 18:06 UTC
10 points
2 comments7 min readLW link

Forecasting

A*12 Apr 2024 17:55 UTC
4 points
0 comments1 min readLW link

Gen­er­al­ized Stat Mech: The Boltz­mann Approach

12 Apr 2024 17:47 UTC
68 points
7 comments20 min readLW link

AISN #33: Re­assess­ing AI and Biorisk Plus, Con­soli­da­tion in the Cor­po­rate AI Land­scape, and Na­tional In­vest­ments in AI

12 Apr 2024 16:10 UTC
13 points
0 comments9 min readLW link
(newsletter.safe.ai)

“How the Gaza Health Ministry Fakes Ca­su­alty Num­bers”

CronoDAS12 Apr 2024 5:57 UTC
−10 points
9 comments1 min readLW link
(www.tabletmag.com)

UDT1.01: Plannable and Un­planned Ob­ser­va­tions (3/​10)

Diffractor12 Apr 2024 5:24 UTC
31 points
0 comments7 min readLW link

Re­port: Eval­u­at­ing an AI Chip Regis­tra­tion Policy

Deric Cheng12 Apr 2024 4:39 UTC
25 points
0 comments5 min readLW link
(www.convergenceanalysis.org)

In­terfer­ence Issues

jefftk12 Apr 2024 2:30 UTC
17 points
1 comment3 min readLW link
(www.jefftk.com)

A D&D.Sci Dodecalogue

abstractapplic12 Apr 2024 1:10 UTC
53 points
0 comments3 min readLW link

[Question] Up­com­ing un­am­bigu­ously good tech pos­si­bil­ities? (Like eg in­door plumb­ing)

lemonhope11 Apr 2024 23:14 UTC
9 points
6 comments1 min readLW link

Leave No Con­text Be­hind—A Comment

Gunnar_Zarncke11 Apr 2024 22:50 UTC
17 points
0 comments2 min readLW link

AXRP Epi­sode 27 - AI Con­trol with Buck Sh­legeris and Ryan Greenblatt

DanielFilan11 Apr 2024 21:30 UTC
69 points
10 comments107 min readLW link

ChatGPT defines 10 con­crete terms: gener­i­cally, for 5- and 11-year-olds, and for a sci­en­tist

Bill Benzon11 Apr 2024 20:27 UTC
3 points
9 comments6 min readLW link