All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Where Does Adversarial Pressure Come From?

quetzal_rainbow14 Dec 2023 22:31 UTC

16 points

1 comment2 min readLW link

Epoch wise critical periods, and singular learning theory

Garrett Baker14 Dec 2023 20:55 UTC

9 points

1 comment5 min readLW link

OpenAI Superalignment: Weak-to-strong generalization

Dalmert14 Dec 2023 19:47 UTC

25 points

3 comments1 min readLW link

(openai.com)

Applications for EA Global are still open!

Eli_Nathan14 Dec 2023 19:10 UTC

1 point

0 comments1 min readLW link

Personal Development System: Winning Repeatedly and Growing Effectively With The BIG4

Paul Rohde14 Dec 2023 18:49 UTC

13 points

0 comments33 min readLW link

(blog.paul-rohde.com)

Introducing The ‘From Big Ideas To Real-World Results’: A Series for Effective Personal Development

Paul Rohde14 Dec 2023 18:49 UTC

13 points

1 comment8 min readLW link

(blog.paul-rohde.com)

Talking With People Who Speak to Congressional Staffers about AI risk

Eneasz14 Dec 2023 17:55 UTC

32 points

0 comments1 min readLW link

(www.thebayesianconspiracy.com)

Bayesian Injustice

Kevin Dorst14 Dec 2023 15:44 UTC

124 points

10 comments6 min readLW link

(kevindorst.substack.com)

AI #42: The Wrong Answer

Zvi14 Dec 2023 14:50 UTC

67 points

6 comments54 min readLW link

(thezvi.wordpress.com)

Some for-profit AI alignment org ideas

Eric Ho14 Dec 2023 14:23 UTC

86 points

19 comments9 min readLW link

Mapping the semantic void: Strange goings-on in GPT embedding spaces

mwatkins14 Dec 2023 13:10 UTC

114 points

31 comments14 min readLW link

Categorical Organization in Memory: ChatGPT Organizes the 665 Topic Tags from My New Savanna Blog

Bill Benzon14 Dec 2023 13:02 UTC

0 points

6 comments2 min readLW link

Moral Mountains

Adam Zerner14 Dec 2023 10:40 UTC

8 points

10 comments2 min readLW link

Update on Chinese IQ-related gene panels

Lao Mein14 Dec 2023 10:12 UTC

70 points

7 comments1 min readLW link

Red Line Ashmont Train is Now Approaching

jefftk14 Dec 2023 2:50 UTC

23 points

2 comments1 min readLW link

(www.jefftk.com)

Various AI doom pathways (and how likely they are)

Logan Zoellner14 Dec 2023 0:54 UTC

1 point

1 comment4 min readLW link

(midwitalignment.substack.com)

Are There Examples of Overhang for Other Technologies?

Jeffrey Heninger13 Dec 2023 21:48 UTC

59 points

50 comments11 min readLW link

(blog.aiimpacts.org)

Is being sexy for your homies?

Valentine13 Dec 2023 20:37 UTC

169 points

97 comments14 min readLW link 2 reviews

How bad is chlorinated water?

bhauth13 Dec 2023 18:00 UTC

43 points

18 comments3 min readLW link

(www.bhauth.com)

[Question] Suggestions for net positive LLM research

Cole Wyeth13 Dec 2023 17:29 UTC

13 points

6 comments1 min readLW link

AI Control: Improving Safety Despite Intentional Subversion

Buck, Fabien Roger, ryan_greenblatt and Kshitij Sachan

13 Dec 2023 15:51 UTC

228 points

18 comments10 min readLW link 1 review

The Busy Bee Brain

Bill Benzon13 Dec 2023 13:10 UTC

11 points

0 comments6 min readLW link

The Best of Don’t Worry About the Vase

Zvi13 Dec 2023 12:50 UTC

55 points

4 comments13 min readLW link

(thezvi.wordpress.com)

[Question] Has anyone here investigated the occult community? It is curious to me that many magicians consider themselves empiricists.

SpectrumDT13 Dec 2023 11:09 UTC

5 points

10 comments1 min readLW link

AI Views Snapshots

Rob Bensinger13 Dec 2023 0:45 UTC

142 points

61 comments1 min readLW link

The convergent dynamic we missed

Remmelt12 Dec 2023 23:19 UTC

2 points

2 comments1 min readLW link

A Kindness, or The Inevitable Consequence of Perfect Inference (a short story)

samhealy12 Dec 2023 23:03 UTC

6 points

0 comments9 min readLW link

Love, Reverence, and Life

Elizabeth and Tristan Williams

12 Dec 2023 21:49 UTC

36 points

9 comments28 min readLW link 2 reviews

Taboo “procrastination”

Neil 12 Dec 2023 21:33 UTC

19 points

7 comments1 min readLW link

Enhancing intelligence by banging your head on the wall

Bezzi12 Dec 2023 21:00 UTC

37 points

26 comments1 min readLW link

Yamaha P-Series Overview

jefftk12 Dec 2023 20:30 UTC

10 points

1 comment1 min readLW link

(www.jefftk.com)

Balsa Update and General Thank You

Zvi12 Dec 2023 20:30 UTC

61 points

8 comments8 min readLW link

(thezvi.wordpress.com)

Towards an Ethics Calculator for Use by an AGI

sweenesm12 Dec 2023 18:37 UTC

3 points

2 comments11 min readLW link

Why Psychologists Are Wrong About The Illusion Of Explanatory Depth

moses onyedikachukwu12 Dec 2023 18:32 UTC

1 point

0 comments4 min readLW link

A design concept for superintelligent machines (and Popper’s critique of induction)

tiplur-bilrex12 Dec 2023 18:31 UTC

−7 points

6 comments1 min readLW link

(tiplur-bilrex.tlon.network)

Significantly Enhancing Adult Intelligence With Gene Editing May Be Possible

GeneSmith and kman

12 Dec 2023 18:14 UTC

436 points

189 comments33 min readLW link

[Question] Why No Automated Plagerism Detection For Past Papers?

Lao Mein12 Dec 2023 17:24 UTC

7 points

10 comments1 min readLW link

OpenAI: Leaks Confirm the Story

Zvi12 Dec 2023 14:00 UTC

77 points

9 comments16 min readLW link

(thezvi.wordpress.com)

Navigating the Attackspace

Jonas Kgomo12 Dec 2023 13:59 UTC

1 point

0 comments2 min readLW link

Nonlinear’s Evidence: Debunking False and Misleading Claims

KatWoods12 Dec 2023 13:16 UTC

104 points

171 comments1 min readLW link

AI Institution Design Hackathon (EAG Bay Area Satellite Event)

beatrice@foresight.org and Allison Duettmann

12 Dec 2023 13:10 UTC

1 point

0 comments1 min readLW link

Funding case: AI Safety Camp

Remmelt and Linda Linsefors

12 Dec 2023 9:08 UTC

66 points

5 comments6 min readLW link

(manifund.org)

What is the next level of rationality?

lsusr and Yoav Ravid

12 Dec 2023 8:14 UTC

48 points

24 comments7 min readLW link

Embedded Agents are Quines

lsusr and DaemonicSigil

12 Dec 2023 4:57 UTC

11 points

7 comments8 min readLW link

Predict the future! Earn fake internet points! Get a (free) gambling addiction!

Robert Cousineau12 Dec 2023 4:39 UTC

3 points

0 comments1 min readLW link

The likely first longevity drug is based on sketchy science. This is bad for science and bad for longevity.

BobBurgers12 Dec 2023 2:42 UTC

161 points

34 comments5 min readLW link

When will GPT-5 come out? Prediction markets vs. Extrapolation

Malte12 Dec 2023 2:41 UTC

12 points

9 comments3 min readLW link

On plans for a functional society

kave and Vaniver

12 Dec 2023 0:07 UTC

41 points

8 comments13 min readLW link

Secondary Risk Markets

Vaniver11 Dec 2023 21:52 UTC

35 points

4 comments4 min readLW link

Has anyone experimented with Dodrio, a tool for exploring transformer models through interactive visualization?

Bill Benzon11 Dec 2023 20:34 UTC

4 points

0 comments1 min readLW link