All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 252627 28 29 30

The Problem with Reasoners by Aidan McLaughin

t14n25 Nov 2024 20:24 UTC

7 points

1 comment1 min readLW link

(aidanmclaughlin.notion.site)

Locally optimal psychology

Chipmonk25 Nov 2024 18:35 UTC

37 points

7 comments2 min readLW link

(twitter.com)

a space habitat design

bhauth25 Nov 2024 17:28 UTC

53 points

13 comments9 min readLW link

(bhauth.com)

Arthropod (non) sentience

Arturo Macias25 Nov 2024 16:01 UTC

9 points

8 comments4 min readLW link

Crosspost: Developing the middle ground on polarized topics

juliawise25 Nov 2024 14:39 UTC

13 points

16 comments3 min readLW link

Two flavors of computational functionalism

EuanMcLean25 Nov 2024 10:47 UTC

28 points

9 comments4 min readLW link

Alignment is not intelligent

Donatas Lučiūnas25 Nov 2024 6:59 UTC

−17 points

18 comments5 min readLW link

Zaragoza ACX/LW Meetup

Fernand025 Nov 2024 6:56 UTC

1 point

0 comments1 min readLW link

A better “Statement on AI Risk?”

Knight Lee25 Nov 2024 4:50 UTC

4 points

4 comments3 min readLW link

Reflections on ML4Good

james__p25 Nov 2024 2:40 UTC

12 points

0 comments1 min readLW link

AI Specialized in ML Training Could Create ASI: AGI Is Unnecessary

satopi25 Nov 2024 2:31 UTC

−5 points

1 comment1 min readLW link

I, Token

Ivan Vendrov25 Nov 2024 2:20 UTC

14 points

2 comments3 min readLW link

(nothinghuman.substack.com)

Passages I Highlighted in The Letters of J.R.R.Tolkien

Ivan Vendrov25 Nov 2024 1:47 UTC

116 points

10 comments31 min readLW link

Decorated pedestrian tunnels

dkl924 Nov 2024 22:16 UTC

0 points

3 comments1 min readLW link

(dkl9.net)

Gothenburg LW/ACX meetup

Stefan24 Nov 2024 19:40 UTC

2 points

0 comments1 min readLW link

[Question] Are You More Real If You’re Really Forgetful?

Thane Ruthenis24 Nov 2024 19:30 UTC

39 points

25 comments5 min readLW link

Perils of Generalizing from One’s Social Group

localdeity24 Nov 2024 15:31 UTC

64 points

1 comment3 min readLW link

Disentangling Representations through Multi-task Learning

Bogdan Ionut Cirstea24 Nov 2024 13:10 UTC

14 points

1 comment1 min readLW link

(arxiv.org)

The U.S. National Security State is Here to Make AI Even Less Transparent and Accountable

Matrice Jacobine24 Nov 2024 9:36 UTC

0 points

0 comments2 min readLW link

(www.eff.org)

Mechanistic Interpretability of Llama 3.2 with Sparse Autoencoders

PaulPauls24 Nov 2024 5:45 UTC

20 points

3 comments1 min readLW link

(github.com)

SB-1047, ChatGPT and AI’s Game of Thrones

Rahul Chand24 Nov 2024 2:29 UTC

−3 points

1 comment13 min readLW link

Beyond Gaussian: Language Model Representations and Distributions

Matt Levinson24 Nov 2024 1:53 UTC

5 points

1 comment5 min readLW link

How Universal Basic Income Could Help Us Build a Brighter Future

Yanling Guo23 Nov 2024 22:03 UTC

−13 points

13 comments3 min readLW link

Compute and size limits on AI are the actual danger

Shmi23 Nov 2024 21:29 UTC

31 points

5 comments2 min readLW link

Paradigm Shifts—change everything… except almost everything

James Stephen Brown23 Nov 2024 18:34 UTC

1 point

0 comments3 min readLW link

(nonzerosum.games)

A Sober Look at Steering Vectors for LLMs

Joschka Braun, Dmitrii Krasheninnikov, Usman Anwar, RobertKirk, Daniel Tan and David Scott Krueger (formerly: capybaralet)

23 Nov 2024 17:30 UTC

31 points

0 comments5 min readLW link

Text Posts from the Kids Group: 2018

jefftk23 Nov 2024 12:50 UTC

20 points

0 comments24 min readLW link

(www.jefftk.com)

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward type

Bogdan Ionut Cirstea23 Nov 2024 12:45 UTC

11 points

0 comments1 min readLW link

On The Rationalist Megameetup

Screwtape23 Nov 2024 9:08 UTC

25 points

3 comments10 min readLW link

[Question] Have we seen any “ReLU instead of sigmoid-type improvements” recently

KvmanThinking23 Nov 2024 3:51 UTC

2 points

4 comments1 min readLW link

A few questions about recent developments in EA

Peter Berggren23 Nov 2024 2:36 UTC

24 points

12 comments2 min readLW link

Paraddictions: unreasonably compelling behaviors and their uses

Michael Cohn22 Nov 2024 20:53 UTC

13 points

0 comments6 min readLW link

Literacy Rates Haven’t Fallen By 20% Since the Department of Education Was Created

Maxwell Tabarrok22 Nov 2024 20:53 UTC

44 points

0 comments3 min readLW link

(www.maximum-progress.com)

Plausibly Factoring Conjectures

Quinn and dspivak

22 Nov 2024 20:11 UTC

22 points

1 comment10 min readLW link

Optimizing Problem-Solving Strategies Through Prediction Markets

patrik-cihal22 Nov 2024 19:58 UTC

1 point

0 comments2 min readLW link

Doing Research Part-Time is Great

casualphysicsenjoyer22 Nov 2024 19:01 UTC

37 points

7 comments5 min readLW link

Rethinking Laplace’s Rule of Succession

Cleo Nardo22 Nov 2024 18:46 UTC

9 points

5 comments2 min readLW link

(Salt) Water Gargling as an Antiviral

Elizabeth22 Nov 2024 18:00 UTC

88 points

6 comments5 min readLW link

(acesounderglass.com)

The Manufactured Crisis: How Society Is Willingly Tying Its Own Noose

PROPHET22 Nov 2024 17:45 UTC

−2 points

2 comments8 min readLW link

Sideloading: creating a model of a person via LLM with very large prompt

avturchin and RomanS

22 Nov 2024 16:41 UTC

12 points

4 comments35 min readLW link

Neuroscience of human social instincts: a sketch

Steven Byrnes22 Nov 2024 16:16 UTC

55 points

0 comments31 min readLW link

Rebutting Every Objection To Giving To The Shrimp Welfare Project

omnizoid22 Nov 2024 16:12 UTC

−2 points

0 comments8 min readLW link

A very strange probability paradox

notfnofn22 Nov 2024 14:01 UTC

90 points

26 comments9 min readLW link

The boat

RomanS22 Nov 2024 12:56 UTC

3 points

0 comments2 min readLW link

[Question] Which things were you surprised to learn are metaphors?

Gordon Seidoh Worley22 Nov 2024 3:46 UTC

28 points

18 comments1 min readLW link

LLM chatbots have ~half of the kinds of “consciousness” that humans believe in. Humans should avoid going crazy about that.

Andrew_Critch22 Nov 2024 3:26 UTC

77 points

53 comments5 min readLW link

Reading RFK Jr so that you don’t have to

braces22 Nov 2024 0:59 UTC

56 points

1 comment8 min readLW link

Don’t want Goodhart? — Specify the damn variables

Yan Lyutnev21 Nov 2024 22:45 UTC

−3 points

2 comments5 min readLW link

Don’t want Goodhart? — Specify the variables more

YanLyutnev21 Nov 2024 22:43 UTC

3 points

2 comments5 min readLW link

Aligning AI Safety Projects with a Republican Administration

Deric Cheng21 Nov 2024 22:12 UTC

29 points

1 comment8 min readLW link