All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 222324 25 26 27 28 29 30

[Question] Karma votes: blind to or accounting for score?

cata22 Jun 2024 21:40 UTC

19 points

4 comments1 min readLW link

[Question] Should effective altruism be more “cool”?

jaredmantell22 Jun 2024 20:42 UTC

3 points

3 comments1 min readLW link

Meta Alignment: Communication Wack-a-Mole

Bridgett Kay22 Jun 2024 20:12 UTC

16 points

2 comments5 min readLW link

(dxmrevealed.wordpress.com)

AI as a computing platform: what to expect

Jonasb22 Jun 2024 19:55 UTC

−3 points

0 comments7 min readLW link

(www.denominations.io)

Expected number of tries

adios22 Jun 2024 19:22 UTC

6 points

0 comments2 min readLW link

Applying Force to the Wrong End of a Causal Chain

silentbob22 Jun 2024 18:06 UTC

40 points

0 comments9 min readLW link

Bed Time Quests & Dinner Games for 3-5 year olds

Gunnar_Zarncke and Shoshannah Tekofsky

22 Jun 2024 7:53 UTC

51 points

0 comments1 min readLW link

(kidquest.substack.com)

Appraising aggregativism and utilitarianism

Cleo Nardo21 Jun 2024 23:10 UTC

27 points

10 comments19 min readLW link

Best-of-n with misaligned reward models for Math reasoning

Fabien Roger21 Jun 2024 22:53 UTC

24 points

0 comments3 min readLW link

No really, the Sticker Shortcut fallacy is indeed a fallacy

ymeskhout21 Jun 2024 22:27 UTC

11 points

2 comments5 min readLW link

(www.ymeskhout.com)

Sarajevo 1914: Black Swan Questions

JohnBuridan21 Jun 2024 21:27 UTC

8 points

0 comments2 min readLW link

Yudkowsky is too optimistic about how AI will treat humans.

ProfessorFalken21 Jun 2024 19:01 UTC

0 points

1 comment1 min readLW link

Juneberry Puffs

jefftk21 Jun 2024 18:50 UTC

15 points

0 comments1 min readLW link

(www.jefftk.com)

Let’s Design a School, Part 3.2 Costs

Sable21 Jun 2024 17:58 UTC

8 points

0 comments5 min readLW link

(affablyevil.substack.com)

2022 AI Alignment Course: 5→37% working on AI safety

Dewi21 Jun 2024 17:45 UTC

7 points

3 comments3 min readLW link

Some Thoughts on AI Alignment: Using AI to Control AI

eigenvalue21 Jun 2024 17:44 UTC

1 point

1 comment1 min readLW link

(github.com)

What distinguishes “early”, “mid” and “end” games?

Raemon21 Jun 2024 17:41 UTC

47 points

22 comments1 min readLW link

Nuclear War, Map and Territory, Values | Guild of the Rose Newsletter, May 2024

moridinamael21 Jun 2024 17:39 UTC

18 points

0 comments4 min readLW link

(guildoftherose.org)

AI governance needs a theory of victory

Corin Katzke and Justin Bullock

21 Jun 2024 16:15 UTC

34 points

6 comments1 min readLW link

(www.convergenceanalysis.org)

Connecting the Dots: LLMs can Infer & Verbalize Latent Structure from Training Data

Johannes Treutlein and Owain_Evans

21 Jun 2024 15:54 UTC

160 points

13 comments8 min readLW link

(arxiv.org)

On OpenAI’s Model Spec

Zvi21 Jun 2024 13:00 UTC

46 points

3 comments30 min readLW link

(thezvi.wordpress.com)

Attention Output SAEs Improve Circuit Analysis

Connor Kissane, robertzk, Arthur Conmy and Neel Nanda

21 Jun 2024 12:56 UTC

33 points

1 comment19 min readLW link

“Newton’s laws” of finance

pchvykov21 Jun 2024 9:41 UTC

9 points

3 comments10 min readLW link

Capitalising On Trust—A Simulation

James Stephen Brown21 Jun 2024 4:43 UTC

2 points

0 comments1 min readLW link

(nonzerosum.games)

″… than average” is (almost) meaningless

jwfiredragon21 Jun 2024 4:42 UTC

16 points

6 comments3 min readLW link

The Kernel of Meaning in Property Rights

Abhimanyu Pallavi Sudhir21 Jun 2024 1:12 UTC

7 points

6 comments2 min readLW link

Enriched tab is now the default LW Frontpage experience for logged-in users

Ruby and RobertM

21 Jun 2024 0:09 UTC

46 points

27 comments3 min readLW link

Debate, Oracles, and Obfuscated Arguments

Jonah Brown-Cohen and Geoffrey Irving

20 Jun 2024 23:14 UTC

40 points

2 comments21 min readLW link

Evaporation of improvements

Viliam20 Jun 2024 18:34 UTC

28 points

27 comments2 min readLW link

Interpreting and Steering Features in Images

Gytis Daujotas20 Jun 2024 18:33 UTC

65 points

6 comments5 min readLW link

Claude 3.5 Sonnet

Zach Stein-Perlman20 Jun 2024 18:00 UTC

75 points

41 comments1 min readLW link

(www.anthropic.com)

[Question] What is going to happen in a case of an AGI era where humans are out of the game?

Cipolla20 Jun 2024 17:44 UTC

−2 points

1 comment1 min readLW link

Jailbreak steering generalization

Sarah Ball and Nina Panickssery

20 Jun 2024 17:25 UTC

41 points

4 comments2 min readLW link

(arxiv.org)

Case studies on social-welfare-based standards in various industries

HoldenKarnofsky20 Jun 2024 13:33 UTC

42 points

0 comments1 min readLW link

AI #69: Nice

Zvi20 Jun 2024 12:40 UTC

65 points

9 comments51 min readLW link

(thezvi.wordpress.com)

Niche product design

Itay Dreyfus20 Jun 2024 6:34 UTC

2 points

1 comment3 min readLW link

(productidentity.co)

Data on AI

Robi Rahman, Jaime Sevilla Molina, Pablo Villalobos and Ben Cottier

20 Jun 2024 6:31 UTC

1 point

0 comments1 min readLW link

(epochai.org)

Actually, Power Plants May Be an AI Training Bottleneck.

Lao Mein20 Jun 2024 4:41 UTC

83 points

13 comments2 min readLW link

Proposing the Post-Singularity Symbiotic Researches

Hiroshi Yamakawa20 Jun 2024 4:05 UTC

5 points

0 comments12 min readLW link

Week One of Studying Transformers Architecture

JustisMills20 Jun 2024 3:47 UTC

3 points

0 comments15 min readLW link

(justismills.substack.com)

[Question] What are things you’re allowed to do as a startup?

Elizabeth20 Jun 2024 0:01 UTC

30 points

9 comments1 min readLW link

LessWrong/ACX meetup Transilvanya tour—Alba Iulia

Marius Adrian Nicoară19 Jun 2024 19:56 UTC

1 point

1 comment1 min readLW link

Chronic perfectionism through the eyes of school reports

Stuart Johnson19 Jun 2024 17:46 UTC

13 points

3 comments1 min readLW link

Ilya Sutskever created a new AGI startup

harfe19 Jun 2024 17:17 UTC

95 points

35 comments1 min readLW link

(ssi.inc)

Beyond the Board: Exploring AI Robustness Through Go

AdamGleave19 Jun 2024 16:40 UTC

41 points

2 comments1 min readLW link

(far.ai)

A study on cults and non-cults—answer questions about a group and get a cult score

spencerg19 Jun 2024 14:30 UTC

1 point

8 comments1 min readLW link

(www.guidedtrack.com)

Workshop: data analysis for software engineers

Derek M. Jones19 Jun 2024 14:20 UTC

2 points

0 comments1 min readLW link

FLEXIBLE AND ADAPTABLE LLM’s WITH CONTINUOUS SELF TRAINING

Escaque 6619 Jun 2024 14:17 UTC

−11 points

0 comments3 min readLW link

Surviving Seveneves

Yair Halberstadt19 Jun 2024 13:11 UTC

41 points

4 comments11 min readLW link

Self responsibility

Elo19 Jun 2024 10:17 UTC

17 points

3 comments2 min readLW link