All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 202122 23 24 25 26 27 28 29 30 31

Seattle Winter Solstice

a7xDec 20, 2023, 8:30 PM

6 points

1 comment1 min readLW link

How Would an Utopia-Maximizer Look Like?

Thane RuthenisDec 20, 2023, 8:01 PM

31 points

23 comments10 min readLW link

Succession

Richard_NgoDec 20, 2023, 7:25 PM

159 points

48 comments11 min readLW link

(www.narrativeark.xyz)

Metaculus Introduces Multiple Choice Questions

ChristianWilliamsDec 20, 2023, 7:00 PM

4 points

0 comments1 min readLW link

(www.metaculus.com)

Brighter Than Today Versions

jefftkDec 20, 2023, 6:20 PM

16 points

2 comments2 min readLW link

(www.jefftk.com)

Gaia Network: a practical, incremental pathway to Open Agency Architecture

Roman Leventov and Rafael Kaufmann Nedal

Dec 20, 2023, 5:11 PM

22 points

8 comments16 min readLW link

On the future of language models

owencbDec 20, 2023, 4:58 PM

105 points

17 comments1 min readLW link

[Valence series] Appendix A: Hedonic tone / (dis)pleasure / (dis)liking

Steven ByrnesDec 20, 2023, 3:54 PM

18 points

0 comments13 min readLW link

Matrix completion prize results

paulfchristianoDec 20, 2023, 3:40 PM

41 points

0 comments2 min readLW link

(www.alignment.org)

[Question] What’s the minimal additive constant for Kolmogorov Complexity that a programming language can achieve?

Noosphere89Dec 20, 2023, 3:36 PM

11 points

15 comments1 min readLW link

Legalize butanol?

bhauthDec 20, 2023, 2:24 PM

39 points

20 comments5 min readLW link

(www.bhauth.com)

A short dialogue on comparability of values

cousin_itDec 20, 2023, 2:08 PM

27 points

7 comments1 min readLW link

Inside View, Outside View… And Opposing View

chaosmageDec 20, 2023, 12:35 PM

21 points

1 comment5 min readLW link

Heuristics for preventing major life mistakes

SK2Dec 20, 2023, 8:01 AM

28 points

2 comments3 min readLW link

What should be reified?

herschelDec 20, 2023, 4:52 AM

4 points

2 comments2 min readLW link

(brothernin.substack.com)

(In)appropriate (De)reification

herschelDec 20, 2023, 4:51 AM

10 points

1 comment4 min readLW link

(brothernin.substack.com)

Escaping Skeuomorphism

Stuart JohnsonDec 20, 2023, 3:51 AM

28 points

0 comments8 min readLW link

Ronny and Nate discuss what sorts of minds humanity is likely to find by Machine Learning

So8res and Ronny Fernandez

Dec 19, 2023, 11:39 PM

40 points

30 comments25 min readLW link

[Question] What are the best Siderea posts?

mike_hawkeDec 19, 2023, 11:07 PM

17 points

2 comments1 min readLW link

Meaning & Agency

abramdemskiDec 19, 2023, 10:27 PM

91 points

17 comments14 min readLW link

s/acc: Safe Accelerationism Manifesto

lorepieriDec 19, 2023, 10:19 PM

−4 points

5 comments2 min readLW link

(lorenzopieri.com)

Don’t Share Information Exfohazardous on Others’ AI-Risk Models

Thane RuthenisDec 19, 2023, 8:09 PM

66 points

11 comments1 min readLW link

Paper: Tell, Don’t Show- Declarative facts influence how LLMs generalize

Owain_Evans and AlexMeinke

Dec 19, 2023, 7:14 PM

45 points

4 comments6 min readLW link

(arxiv.org)

Interview: Applications w/ Alice Rigg

jacobhaimesDec 19, 2023, 7:03 PM

12 points

0 comments1 min readLW link

(into-ai-safety.github.io)

How does a toy 2 digit subtraction transformer predict the sign of the output?

Evan AndersDec 19, 2023, 6:56 PM

14 points

0 comments8 min readLW link

(evanhanders.blog)

Incremental AI Risks from Proxy-Simulations

kmenouDec 19, 2023, 6:56 PM

2 points

0 comments1 min readLW link

(individual.utoronto.ca)

A proposition for the modification of our epistemology

JacobBowdenDec 19, 2023, 6:55 PM

−4 points

2 comments4 min readLW link

Goal-Completeness is like Turing-Completeness for AGI

LironDec 19, 2023, 6:12 PM

50 points

26 comments3 min readLW link

SociaLLM: proposal for a language model design for personalised apps, social science, and AI safety research

Roman LeventovDec 19, 2023, 4:49 PM

17 points

5 comments3 min readLW link

Chording “The Next Right Thing”

jefftkDec 19, 2023, 3:40 PM

11 points

0 comments2 min readLW link

(www.jefftk.com)

Monthly Roundup #13: December 2023

ZviDec 19, 2023, 3:10 PM

32 points

5 comments26 min readLW link

(thezvi.wordpress.com)

Effective Aspersions: How the Nonlinear Investigation Went Wrong

TracingWoodgrainsDec 19, 2023, 12:00 PM

188 points

171 comments1 min readLW link 1 review

A Universal Emergent Decomposition of Retrieval Tasks in Language Models

Alexandre Variengien and Eric Winsor

Dec 19, 2023, 11:52 AM

84 points

3 comments10 min readLW link

(arxiv.org)

Assessment of AI safety agendas: think about the downside risk

Roman LeventovDec 19, 2023, 9:00 AM

13 points

1 comment1 min readLW link

Constellations are Younger than Continents

Jeffrey HeningerDec 19, 2023, 6:12 AM

261 points

22 comments2 min readLW link

The Dark Arts

lsusr and Lyrongolem

Dec 19, 2023, 4:41 AM

132 points

49 comments9 min readLW link

When scientists consider whether their research will end the world

HarlanDec 19, 2023, 3:47 AM

30 points

4 comments11 min readLW link

(blog.aiimpacts.org)

Is the far future inevitably zero sum?

Srdjan MileticDec 19, 2023, 1:45 AM

8 points

2 comments2 min readLW link

(dissent.blog)

The ‘Neglected Approaches’ Approach: AE Studio’s Alignment Agenda

Cameron Berg, Judd Rosenblatt, AE Studio and Marc Carauleanu

Dec 18, 2023, 8:35 PM

168 points

21 comments12 min readLW link

The Shortest Path Between Scylla and Charybdis

Thane RuthenisDec 18, 2023, 8:08 PM

50 points

8 comments5 min readLW link

OpenAI: Preparedness framework

Zach Stein-PerlmanDec 18, 2023, 6:30 PM

70 points

23 comments4 min readLW link

(openai.com)

[Valence series] 5. “Valence Disorders” in Mental Health & Personality

Steven ByrnesDec 18, 2023, 3:26 PM

43 points

12 comments13 min readLW link

Discussion: Challenges with Unsupervised LLM Knowledge Discovery

Seb Farquhar, Vikrant Varma, zac_kenton, gasteigerjo, Vlad Mikulik and Rohin Shah

Dec 18, 2023, 11:58 AM

147 points

21 comments10 min readLW link

Interpreting the Learning of Deceit

RogerDearnaleyDec 18, 2023, 8:12 AM

30 points

14 comments9 min readLW link

Talk: “AI Would Be A Lot Less Alarming If We Understood Agents”

johnswentworthDec 17, 2023, 11:46 PM

58 points

3 comments1 min readLW link

(www.youtube.com)

∀: a story

Richard_NgoDec 17, 2023, 10:42 PM

37 points

1 comment8 min readLW link

(www.narrativeark.xyz)

Reviving a 2015 MacBook

jefftkDec 17, 2023, 9:00 PM

11 points

0 comments1 min readLW link

(www.jefftk.com)

A Common-Sense Case For Mutually-Misaligned AGIs Allying Against Humans

Thane RuthenisDec 17, 2023, 8:28 PM

29 points

7 comments11 min readLW link

The Limits of Artificial Consciousness: A Biology-Based Critique of Chalmers’ Fading Qualia Argument

Štěpán LosDec 17, 2023, 7:11 PM

−6 points

9 comments17 min readLW link

What makes teaching math special

ViliamDec 17, 2023, 2:15 PM

41 points

27 comments11 min readLW link