All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 20242025

AllJan

All 1 2 3 4 567 8

[Question] Is “hidden complexity of wishes problem” solved?

Roman Malov5 Jan 2025 22:59 UTC

10 points

4 comments1 min readLW link

A Ground-Level Perspective on Capacity Building in International Development

Sean Aubin5 Jan 2025 20:36 UTC

10 points

1 comment8 min readLW link

Why Linear AI Safety Hits a Wall and How Fractal Intelligence Unlocks Non-Linear Solutions

Andy E Williams5 Jan 2025 17:08 UTC

−3 points

6 comments5 min readLW link

How to Do a PhD (in AI Safety)

Lewis Hammond5 Jan 2025 16:57 UTC

6 points

0 comments1 min readLW link

(lewishammond.com)

Reasons for and against working on technical AI safety at a frontier AI lab

bilalchughtai5 Jan 2025 14:49 UTC

89 points

12 comments12 min readLW link

Oppression and production are competing explanations for wealth inequality.

Benquo5 Jan 2025 14:13 UTC

32 points

15 comments8 min readLW link

(benjaminrosshoffman.com)

Maximizing Communication, not Traffic

jefftk5 Jan 2025 13:00 UTC

127 points

7 comments1 min readLW link

(www.jefftk.com)

Policymakers don’t have access to paywalled articles

Adam Jones5 Jan 2025 10:56 UTC

17 points

4 comments2 min readLW link

(adamjones.me)

Capital Ownership Will Not Prevent Human Disempowerment

beren5 Jan 2025 6:00 UTC

112 points

9 comments14 min readLW link

Chinese Researchers Crack ChatGPT: Replicating OpenAI’s Advanced AI Model

Evan_Gaensbauer5 Jan 2025 3:50 UTC

−8 points

1 comment1 min readLW link

(www.geeky-gadgets.com)

Orange and Strawberry Truffles

jefftk5 Jan 2025 1:50 UTC

10 points

1 comment1 min readLW link

(www.jefftk.com)

AXRP Episode 38.4 - Shakeel Hashim on AI Journalism

DanielFilan5 Jan 2025 0:20 UTC

9 points

0 comments12 min readLW link

How i’m building my ai system, how it’s going so far, and my thoughts on it

ollie_4 Jan 2025 18:20 UTC

−1 points

3 comments5 min readLW link

Parkinson’s Law and the Ideology of Statistics

Benquo4 Jan 2025 15:49 UTC

106 points

1 comment8 min readLW link

(benjaminrosshoffman.com)

Speedrunning Rationality: Day I

aproteinengine4 Jan 2025 14:28 UTC

5 points

0 comments1 min readLW link

The Laws of Large Numbers

Dmitry Vaintrob4 Jan 2025 11:54 UTC

31 points

6 comments12 min readLW link

The Golden Opportunity for American AI

Annapurna4 Jan 2025 10:26 UTC

2 points

2 comments1 min readLW link

(blogs.microsoft.com)

A Generalization of the Good Regulator Theorem

Alfred Harwood4 Jan 2025 9:55 UTC

20 points

5 comments9 min readLW link

Logic vs intuition ⇔ algorithm vs ML

pchvykov4 Jan 2025 9:06 UTC

5 points

0 comments7 min readLW link

debating buying NVDA in 2019

bhauth4 Jan 2025 5:06 UTC

23 points

0 comments3 min readLW link

(bhauth.com)

Making progress bars for Alignment

Kabir Kumar3 Jan 2025 21:25 UTC

0 points

0 comments1 min readLW link

(lu.ma)

The Intelligence Curse

lukedrago3 Jan 2025 19:07 UTC

85 points

26 comments18 min readLW link

(lukedrago.substack.com)

The case for pay-on-results coaching

Chipmonk3 Jan 2025 18:40 UTC

16 points

3 comments1 min readLW link

Introducing Squiggle AI

ozziegooen3 Jan 2025 17:53 UTC

79 points

13 comments1 min readLW link

Human study on AI spear phishing campaigns

Simon Lermen and Fred Heiding

3 Jan 2025 15:11 UTC

74 points

8 comments5 min readLW link

The subset parity learning problem: much more than you wanted to know

Dmitry Vaintrob3 Jan 2025 9:13 UTC

87 points

17 comments11 min readLW link

Building AI safety benchmark environments on themes of universal human values

Roland Pihlakas3 Jan 2025 4:24 UTC

17 points

3 comments8 min readLW link

(docs.google.com)

Emotional Superrationality

nullproxy2 Jan 2025 22:54 UTC

−6 points

4 comments11 min readLW link

Playing with Otamatones

jefftk2 Jan 2025 19:50 UTC

12 points

0 comments1 min readLW link

(www.jefftk.com)

7. Iterate the Game: Racing Where?

Allison Duettmann2 Jan 2025 19:06 UTC

11 points

0 comments9 min readLW link

6. Increase Intelligence: Welcome AI Players

Allison Duettmann2 Jan 2025 19:06 UTC

6 points

1 comment19 min readLW link

5. Uphold Voluntarism: Digital Defense

Allison Duettmann2 Jan 2025 19:05 UTC

3 points

0 comments18 min readLW link

4. Uphold Voluntarism: Physical Defense

Allison Duettmann2 Jan 2025 19:04 UTC

5 points

2 comments23 min readLW link

3. Improve Cooperation: Better Technologies

Allison Duettmann2 Jan 2025 19:03 UTC

3 points

2 comments23 min readLW link

2. Skim the Manual: Intelligent Voluntary Cooperation

Allison Duettmann2 Jan 2025 19:02 UTC

12 points

0 comments18 min readLW link

1. Meet the Players: Value Diversity

Allison Duettmann2 Jan 2025 19:00 UTC

30 points

2 comments11 min readLW link

Preface

Allison Duettmann2 Jan 2025 18:59 UTC

26 points

1 comment7 min readLW link

The AI Agent Revolution: Beyond the Hype of 2025

DimaG2 Jan 2025 18:55 UTC

−7 points

1 comment28 min readLW link

On False Dichotomies

nullproxy2 Jan 2025 18:54 UTC

−3 points

0 comments5 min readLW link

Preference Inversion

Benquo2 Jan 2025 18:15 UTC

44 points

35 comments4 min readLW link

(benjaminrosshoffman.com)

Alignment Is Not All You Need

Adam Jones2 Jan 2025 17:50 UTC

40 points

10 comments6 min readLW link

(adamjones.me)

What’s the short timeline plan?

Marius Hobbhahn2 Jan 2025 14:59 UTC

269 points

36 comments23 min readLW link

AI #97: 4

Zvi2 Jan 2025 14:10 UTC

44 points

4 comments40 min readLW link

(thezvi.wordpress.com)

[Question] Can private companies test LVTs?

Yair Halberstadt2 Jan 2025 11:08 UTC

7 points

0 comments1 min readLW link

A pragmatic story about where we get our priors

Fiora from Rosebloom2 Jan 2025 10:16 UTC

13 points

6 comments3 min readLW link

Grammars, subgrammars, and combinatorics of generalization in transformers

Dmitry Vaintrob2 Jan 2025 9:37 UTC

36 points

0 comments17 min readLW link

[Question] 2025 Alignment Predictions

anaguma2 Jan 2025 5:37 UTC

3 points

3 comments1 min readLW link

Grading my 2024 AI predictions

Nikola Jurkovic2 Jan 2025 5:01 UTC

15 points

1 comment3 min readLW link

Practicing Bayesian Epistemology with “Two Boys” Probability Puzzles

Liron2 Jan 2025 4:42 UTC

42 points

13 comments6 min readLW link

Implications of Moral Realism on AI Safety

Myles H2 Jan 2025 2:58 UTC

7 points

1 comment3 min readLW link