All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 161718 19 20 21 22 23 24 25 26 27 28 29 30

Anti MMAcevedo Protocol

Logan Zoellner16 Apr 2024 22:32 UTC

1 point

1 comment8 min readLW link

Transformers Represent Belief State Geometry in their Residual Stream

Adam Shai16 Apr 2024 21:16 UTC

412 points

100 comments12 min readLW link

Tinker

Richard_Ngo16 Apr 2024 18:26 UTC

38 points

0 comments1 min readLW link

(press.asimov.com)

Paul Christiano named as US AI Safety Institute Head of AI Safety

Joel Burget16 Apr 2024 16:22 UTC

256 points

58 comments1 min readLW link

(www.commerce.gov)

Creating unrestricted AI Agents with Command R+

Simon Lermen16 Apr 2024 14:52 UTC

77 points

13 comments5 min readLW link

What should the EA community learn from the FTX / SBF disaster? An in-depth discussion with Will MacAskill on the Clearer Thinking podcast

spencerg16 Apr 2024 13:11 UTC

20 points

0 comments1 min readLW link

(podcast.clearerthinking.org)

{Book Summary} The Art of Gathering

Tristan Williams16 Apr 2024 10:48 UTC

28 points

0 comments1 min readLW link

Essay competition on the Automation of Wisdom and Philosophy — $25k in prizes

owencb and AI Impacts

16 Apr 2024 10:10 UTC

82 points

12 comments8 min readLW link

(blog.aiimpacts.org)

Announcing SPAR Summer 2024!

laurenmarie1216 Apr 2024 8:30 UTC

30 points

2 comments1 min readLW link

The argument for near-term human disempowerment through AI

Chris_Leong16 Apr 2024 4:50 UTC

21 points

2 comments1 min readLW link

(link.springer.com)

My experience using financial commitments to overcome akrasia

William Howard15 Apr 2024 22:57 UTC

137 points

33 comments18 min readLW link

An evaluation of circuit evaluation metrics

Iván Arcuschin, Niels uit de Bos and Adrià Garriga-alonso

15 Apr 2024 19:38 UTC

18 points

0 comments4 min readLW link

Experiments with an alternative method to promote sparsity in sparse autoencoders

Eoin Farrell15 Apr 2024 18:21 UTC

29 points

7 comments12 min readLW link

Effectively Handling Disagreements—Introducing a New Workshop

Camille Berger 15 Apr 2024 16:33 UTC

37 points

2 comments7 min readLW link

Four Local Gigs

jefftk15 Apr 2024 16:00 UTC

8 points

0 comments1 min readLW link

(www.jefftk.com)

Taking into account preferences of past selves

Jacob G-W15 Apr 2024 13:15 UTC

14 points

9 comments7 min readLW link

Monthly Roundup #17: April 2024

Zvi15 Apr 2024 12:10 UTC

54 points

4 comments76 min readLW link

(thezvi.wordpress.com)

Reconsider the anti-cavity bacteria if you are Asian

Lao Mein15 Apr 2024 7:02 UTC

168 points

43 comments4 min readLW link

Anthropic AI made the right call

bhauth15 Apr 2024 0:39 UTC

22 points

20 comments1 min readLW link

May 2024 Newton meetup???

duck_master14 Apr 2024 22:28 UTC

2 points

0 comments1 min readLW link

Clipboard Filtering

jefftk14 Apr 2024 20:50 UTC

25 points

1 comment1 min readLW link

(www.jefftk.com)

A High Decoupling Failure

Maxwell Tabarrok14 Apr 2024 19:46 UTC

37 points

5 comments3 min readLW link

(www.maximum-progress.com)

ACX Zwolle meetup

Shaedys14 Apr 2024 13:09 UTC

7 points

0 comments1 min readLW link

A quick experiment on LMs’ inductive biases in performing search

Alex Mallen14 Apr 2024 3:41 UTC

32 points

2 comments4 min readLW link

UDT1.01 Essential Miscellanea (4/10)

Diffractor14 Apr 2024 2:23 UTC

19 points

0 comments10 min readLW link

[Cosmology Talks] New Probability Axioms Could Fix Cosmology’s Multiverse (Partially) - Sylvia Wenmackers

mako yass14 Apr 2024 1:26 UTC

18 points

2 comments1 min readLW link

(www.youtube.com)

Speedrun ruiner research idea

lemonhope13 Apr 2024 23:42 UTC

2 points

11 comments2 min readLW link

Text Posts from the Kids Group: 2020

jefftk13 Apr 2024 22:30 UTC

69 points

3 comments19 min readLW link

(www.jefftk.com)

[Question] What convincing warning shot could help prevent extinction from AI?

Charbel-Raphaël and cozyfractal

13 Apr 2024 18:09 UTC

105 points

18 comments2 min readLW link

My experience at ML4Good AI Safety Bootcamp

TheManxLoiner13 Apr 2024 10:55 UTC

20 points

0 comments5 min readLW link

Consequentialism is a compass, not a judge

Neil 13 Apr 2024 10:47 UTC

26 points

6 comments2 min readLW link

Carl Sagan, nuking the moon, and not nuking the moon

eukaryote13 Apr 2024 4:08 UTC

103 points

8 comments6 min readLW link

(eukaryotewritesblog.com)

[Question] Barcoding LLM Training Data Subsets. Anyone trying this for interpretability?

right..enough?13 Apr 2024 3:09 UTC

7 points

0 comments7 min readLW link

Prompts for Big-Picture Planning

Raemon13 Apr 2024 3:04 UTC

72 points

1 comment3 min readLW link

Claude wants to be conscious

Joe Kwon13 Apr 2024 1:40 UTC

2 points

8 comments6 min readLW link

Things Solenoid Narrates

Solenoid_Entity12 Apr 2024 23:57 UTC

45 points

2 comments2 min readLW link

MIRI’s April 2024 Newsletter

Harlan12 Apr 2024 23:38 UTC

95 points

0 comments3 min readLW link

(intelligence.org)

Poker, Beef Wellington, and Mount Stupid

boghan12 Apr 2024 18:06 UTC

10 points

2 comments7 min readLW link

Forecasting

A*12 Apr 2024 17:55 UTC

4 points

0 comments1 min readLW link

Generalized Stat Mech: The Boltzmann Approach

David Lorell and johnswentworth

12 Apr 2024 17:47 UTC

68 points

7 comments20 min readLW link

AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI

aogara, Corin Katzke, Alexa Pan and Dan H

12 Apr 2024 16:10 UTC

13 points

0 comments9 min readLW link

(newsletter.safe.ai)

“How the Gaza Health Ministry Fakes Casualty Numbers”

CronoDAS12 Apr 2024 5:57 UTC

−10 points

9 comments1 min readLW link

(www.tabletmag.com)

UDT1.01: Plannable and Unplanned Observations (3/10)

Diffractor12 Apr 2024 5:24 UTC

31 points

0 comments7 min readLW link

Report: Evaluating an AI Chip Registration Policy

Deric Cheng12 Apr 2024 4:39 UTC

25 points

0 comments5 min readLW link

(www.convergenceanalysis.org)

Interference Issues

jefftk12 Apr 2024 2:30 UTC

17 points

1 comment3 min readLW link

(www.jefftk.com)

A D&D.Sci Dodecalogue

abstractapplic12 Apr 2024 1:10 UTC

54 points

0 comments3 min readLW link

[Question] Upcoming unambiguously good tech possibilities? (Like eg indoor plumbing)

lemonhope11 Apr 2024 23:14 UTC

9 points

6 comments1 min readLW link

Leave No Context Behind—A Comment

Gunnar_Zarncke11 Apr 2024 22:50 UTC

18 points

0 comments2 min readLW link

AXRP Episode 27 - AI Control with Buck Shlegeris and Ryan Greenblatt

DanielFilan11 Apr 2024 21:30 UTC

69 points

10 comments107 min readLW link

ChatGPT defines 10 concrete terms: generically, for 5- and 11-year-olds, and for a scientist

Bill Benzon11 Apr 2024 20:27 UTC

3 points

9 comments6 min readLW link