All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 202122 23 24 25 26 27 28 29 30 31

Slim overview of work one could do to make AI go better (and a grab-bag of other career considerations)

Chi Nguyen20 Mar 2024 23:17 UTC

9 points

1 comment1 min readLW link

How does AI solve problems?

Dom Polsinelli20 Mar 2024 22:29 UTC

2 points

0 comments7 min readLW link

What I Learned (Conclusion To “The Sense Of Physical Necessity”)

LoganStrohl20 Mar 2024 21:24 UTC

34 points

0 comments3 min readLW link

Stagewise Development in Neural Networks

Jesse Hoogland, Liam Carroll and Daniel Murfet

20 Mar 2024 19:54 UTC

90 points

1 comment11 min readLW link

On the Gladstone Report

Zvi20 Mar 2024 19:50 UTC

64 points

11 comments40 min readLW link

(thezvi.wordpress.com)

Natural Latents: The Concepts

johnswentworth and David Lorell

20 Mar 2024 18:21 UTC

87 points

18 comments19 min readLW link

Comparing Alignment to other AGI interventions: Basic model

Martín Soto20 Mar 2024 18:17 UTC

12 points

4 comments7 min readLW link

AI-generated opioids could be a catastrophic risk

ejk6420 Mar 2024 17:48 UTC

0 points

2 comments3 min readLW link

New report: Safety Cases for AI

joshc20 Mar 2024 16:45 UTC

89 points

14 comments1 min readLW link

(twitter.com)

User-inclination-guessing algorithms: registering a goal

ProgramCrafter20 Mar 2024 15:55 UTC

2 points

0 comments2 min readLW link

My MATS Summer 2023 experience

James Chua20 Mar 2024 11:26 UTC

29 points

0 comments3 min readLW link

(jameschua.net)

[Question] What are the weirdest things a human may want for their own sake?

Mateusz Bagiński20 Mar 2024 11:15 UTC

7 points

16 comments1 min readLW link

[Question] Best organization red-pill books and posts?

lemonhope20 Mar 2024 7:01 UTC

10 points

2 comments1 min readLW link

Parent-Friendly Dance Weekends

jefftk20 Mar 2024 2:10 UTC

16 points

0 comments2 min readLW link

(www.jefftk.com)

[Question] “I Can’t Believe It Both Is and Is Not Encephalitis!” Or: What do you do when the evidence is crazy?

Erhannis19 Mar 2024 22:08 UTC

20 points

3 comments11 min readLW link

Delta’s of Change

Jonas Kgomo19 Mar 2024 21:03 UTC

1 point

0 comments4 min readLW link

Increasing IQ by 10 Points is Possible

George3d619 Mar 2024 20:48 UTC

23 points

50 comments5 min readLW link

(morelucid.substack.com)

Are extreme probabilities for P(doom) epistemically justifed?

NathanBarnard and Alexander Gietelink Oldenziel

19 Mar 2024 20:32 UTC

20 points

12 comments7 min readLW link

Have I Solved the Two Envelopes Problem Once and For All?

JackOfAllTrades19 Mar 2024 19:57 UTC

−6 points

5 comments3 min readLW link

[Question] How can one be less wrong, if their conversation partner loses the interest on discussing the topic with them?

Ooker19 Mar 2024 18:11 UTC

−10 points

3 comments1 min readLW link

Carlo: uncertainty analysis in Google Sheets

ProbabilityEnjoyer19 Mar 2024 17:59 UTC

6 points

0 comments1 min readLW link

(carlo.app)

NAIRA—An exercise in regulatory, competitive safety governance [AI Governance Institutional Design idea]

Heramb19 Mar 2024 17:43 UTC

2 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

AI Safety Evaluations: A Regulatory Review

Elliot Mckernon and Deric Cheng

19 Mar 2024 15:05 UTC

22 points

1 comment11 min readLW link

Mechanism for feature learning in neural networks and backpropagation-free machine learning models

Matt Goldenberg19 Mar 2024 14:55 UTC

8 points

1 comment1 min readLW link

(www.science.org)

Monthly Roundup #16: March 2024

Zvi19 Mar 2024 13:10 UTC

33 points

4 comments55 min readLW link

(thezvi.wordpress.com)

Experimentation (Part 7 of “The Sense Of Physical Necessity”)

LoganStrohl18 Mar 2024 21:25 UTC

33 points

0 comments10 min readLW link

INTERVIEW: Round 2 - StakeOut.AI w/ Dr. Peter Park

jacobhaimes18 Mar 2024 21:21 UTC

5 points

0 comments1 min readLW link

(into-ai-safety.github.io)

Neuroscience and Alignment

Garrett Baker18 Mar 2024 21:09 UTC

40 points

25 comments2 min readLW link

GPT, the magical collaboration zone, Lex Fridman and Sam Altman

Bill Benzon18 Mar 2024 20:04 UTC

3 points

1 comment3 min readLW link

Measuring Coherence of Policies in Toy Environments

dx26 and Richard_Ngo

18 Mar 2024 17:59 UTC

59 points

9 comments14 min readLW link

AtP*: An efficient and scalable method for localizing LLM behaviour to components

Neel Nanda, János Kramár, Tom Lieberum and Rohin Shah

18 Mar 2024 17:28 UTC

19 points

0 comments1 min readLW link

(arxiv.org)

Community Notes by X

NicholasKees18 Mar 2024 17:13 UTC

124 points

15 comments7 min readLW link

[Question] Is the Basilisk pretending to be hidden in this simulation so that it can check what I would do if conditioned by a world without the Basilisk?

maybefbi18 Mar 2024 16:05 UTC

−18 points

1 comment1 min readLW link

On Devin

Zvi18 Mar 2024 13:20 UTC

148 points

34 comments11 min readLW link

(thezvi.wordpress.com)

RLLMv10 experiment

MiguelDev18 Mar 2024 8:32 UTC

5 points

0 comments2 min readLW link

Join the AI Evaluation Tasks Bounty Hackathon

Esben Kran18 Mar 2024 8:15 UTC

12 points

1 comment1 min readLW link

5 Physics Problems

DaemonicSigil and Muireall

18 Mar 2024 8:05 UTC

60 points

0 comments15 min readLW link

Inferring the model dimension of API-protected LLMs

Ege Erdil18 Mar 2024 6:19 UTC

34 points

3 comments4 min readLW link

(arxiv.org)

AI strategy given the need for good reflection

owencb18 Mar 2024 0:48 UTC

7 points

0 comments1 min readLW link

XAI releases Grok base model

Jacob G-W18 Mar 2024 0:47 UTC

11 points

3 comments1 min readLW link

(x.ai)

Toki pona FAQ

dkl917 Mar 2024 21:44 UTC

36 points

8 comments1 min readLW link

(dkl9.net)

EA ErFiN Project work

Max_He-Ho17 Mar 2024 20:42 UTC

2 points

0 comments1 min readLW link

EA ErFiN Project work

Max_He-Ho17 Mar 2024 20:37 UTC

2 points

0 comments1 min readLW link

[Question] Alice and Bob is debating on a technique. Alice says Bob should try it before denying it. Is it a fallacy or something similar?

Ooker17 Mar 2024 20:01 UTC

0 points

19 comments2 min readLW link

Is there a way to calculate the P(we are in a 2nd cold war)?

cloak17 Mar 2024 20:01 UTC

−9 points

2 comments1 min readLW link

The Worst Form Of Government (Except For Everything Else We’ve Tried)

johnswentworth17 Mar 2024 18:11 UTC

134 points

47 comments4 min readLW link

Applying simulacrum levels to hobbies, interests and goals

DMMF17 Mar 2024 16:18 UTC

15 points

2 comments4 min readLW link

(danfrank.ca)

What is the best argument that LLMs are shoggoths?

JoshuaFox17 Mar 2024 11:36 UTC

26 points

22 comments1 min readLW link

Invitation to the Princeton AI Alignment and Safety Seminar

Sadhika Malladi17 Mar 2024 1:10 UTC

6 points

1 comment1 min readLW link

Anxiety vs. Depression

Sable17 Mar 2024 0:15 UTC

85 points

35 comments3 min readLW link

(affablyevil.substack.com)