All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

AGI & Consciousness—Joscha Bach

Rahul Chand8 Oct 2024 22:51 UTC

1 point

0 comments10 min readLW link

Video and transcript of presentation on Otherness and control in the age of AGI

Joe Carlsmith8 Oct 2024 22:30 UTC

35 points

1 comment27 min readLW link

From seeded complexity to consciousness—yes, it’s all the same.

eschatail8 Oct 2024 21:31 UTC

−23 points

0 comments2 min readLW link

Limits of safe and aligned AI

Shivam8 Oct 2024 21:30 UTC

2 points

0 comments4 min readLW link

[Question] What constitutes an infohazard?

K1r4d4rk.v18 Oct 2024 21:29 UTC

−4 points

8 comments1 min readLW link

[Question] What makes one a “rationalist”?

mathyouf8 Oct 2024 20:25 UTC

7 points

5 comments3 min readLW link

[Intuitive self-models] 4. Trance

Steven Byrnes8 Oct 2024 13:30 UTC

75 points

7 comments24 min readLW link

Schelling game evaluations for AI control

Olli Järviniemi8 Oct 2024 12:01 UTC

65 points

5 comments11 min readLW link

Thinking About a Pedalboard

jefftk8 Oct 2024 11:50 UTC

9 points

2 comments1 min readLW link

(www.jefftk.com)

Overview of strong human intelligence amplification methods

TsviBT8 Oct 2024 8:37 UTC

270 points

141 comments10 min readLW link

Near-death experiences

Declan Molony8 Oct 2024 6:34 UTC

3 points

1 comment3 min readLW link

The unreasonable effectiveness of plasmid sequencing as a service

Abhishaike Mahajan8 Oct 2024 2:02 UTC

23 points

2 comments13 min readLW link

(www.owlposting.com)

There is a globe in your LLM

jacob_drori8 Oct 2024 0:43 UTC

86 points

4 comments1 min readLW link

MATS AI Safety Strategy Curriculum v2

DanielFilan and Ryan Kidd

7 Oct 2024 22:44 UTC

42 points

6 comments13 min readLW link

2025 Color Trends

sarahconstantin7 Oct 2024 21:20 UTC

40 points

7 comments6 min readLW link

(sarahconstantin.substack.com)

Clarifying Alignment Fundamentals Through the Lens of Ontology

eternal/ephemera7 Oct 2024 20:57 UTC

12 points

4 comments24 min readLW link

Ethics on Cosmic Scale, Outer Space Treaty, Directed Panspermia, Forwards-Contamination, Technology Assessment, Planetary Protection, and Fermi’s Paradox

MrFantastic7 Oct 2024 20:56 UTC

−12 points

0 comments1 min readLW link

Domain-specific SAEs

jacob_drori7 Oct 2024 20:15 UTC

27 points

0 comments5 min readLW link

Metaculus Is Open Source

ChristianWilliams7 Oct 2024 19:55 UTC

13 points

0 comments1 min readLW link

(www.metaculus.com)

Research update: Towards a Law of Iterated Expectations for Heuristic Estimators

Eric Neyman7 Oct 2024 19:29 UTC

87 points

2 comments22 min readLW link

AI Model Registries: A Foundational Tool for AI Governance

Elliot Mckernon, Deric Cheng and Gwyn Glasser

7 Oct 2024 19:27 UTC

20 points

1 comment4 min readLW link

(www.convergenceanalysis.org)

Evaluating the truth of statements in a world of ambiguous language.

Hastings7 Oct 2024 18:08 UTC

48 points

19 comments2 min readLW link

Advice for journalists

Nathan Young7 Oct 2024 16:46 UTC

100 points

53 comments9 min readLW link

(nathanpmyoung.substack.com)

Time Efficient Resistance Training

romeostevensit7 Oct 2024 15:15 UTC

42 points

10 comments3 min readLW link

A Narrow Path: a plan to deal with AI extinction risk

Andrea_Miotti, davekasten and Tolga

7 Oct 2024 13:02 UTC

73 points

11 comments2 min readLW link

(www.narrowpath.co)

Toy Models of Feature Absorption in SAEs

chanind, hrdkbhatnagar, TomasD and Joseph Bloom

7 Oct 2024 9:56 UTC

49 points

8 comments10 min readLW link

An argument that consequentialism is incomplete

cousin_it7 Oct 2024 9:45 UTC

32 points

27 comments1 min readLW link

An X-Ray is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation

hugofry, Ahmed Abdulaal, NMontanaBrown and a-ijishakin

7 Oct 2024 8:53 UTC

38 points

0 comments5 min readLW link

(arxiv.org)

Compelling Villains and Coherent Values

Cole Wyeth6 Oct 2024 19:53 UTC

38 points

4 comments4 min readLW link

To Be Born in a Bag

Niko_McCarty6 Oct 2024 17:21 UTC

19 points

1 comment16 min readLW link

(www.asimov.press)

Whimsical Thoughts on an AI Notepad: Exploring Non-Invasive Neural Integration via Viral and Stem Cell Pathways

Pug stanky6 Oct 2024 16:37 UTC

1 point

2 comments4 min readLW link

Why I’m not a Bayesian

Richard_Ngo6 Oct 2024 15:22 UTC

187 points

92 comments10 min readLW link

(www.mindthefuture.info)

European Progress Conference

Martin Sustrik6 Oct 2024 11:10 UTC

27 points

11 comments3 min readLW link

(250bpm.substack.com)

Open Thread Fall 2024

habryka5 Oct 2024 22:28 UTC

45 points

171 comments1 min readLW link

[Question] Seeking AI Alignment Tutor/Advisor: $100–150/hr

MrThink5 Oct 2024 21:28 UTC

26 points

3 comments2 min readLW link

Interpretability of SAE Features Representing Check in ChessGPT

Jonathan Kutasov5 Oct 2024 20:43 UTC

27 points

2 comments8 min readLW link

2024 Election Forecasting Contest

mike207315 Oct 2024 20:43 UTC

4 points

0 comments1 min readLW link

(www.mikesblog.net)

5 ways to improve CoT faithfulness

CBiddulph5 Oct 2024 20:17 UTC

38 points

39 comments6 min readLW link

Consciousness As Recursive Reflections

Gunnar_Zarncke5 Oct 2024 20:00 UTC

7 points

3 comments1 min readLW link

(www.astralcodexten.com)

What is it like to be psychologically healthy? Podcast ft. DaystarEld

Chipmonk and DaystarEld

5 Oct 2024 19:14 UTC

31 points

8 comments2 min readLW link

(chrislakin.blog)

Musings on Text Data Wall (Oct 2024)

Vladimir_Nesov5 Oct 2024 19:00 UTC

20 points

2 comments5 min readLW link

Apply to the Cooperative AI PhD Fellowship by October 14th!

Lewis Hammond5 Oct 2024 12:41 UTC

23 points

0 comments1 min readLW link

AISafety.info: What is the “natural abstractions hypothesis”?

Algon5 Oct 2024 12:31 UTC

38 points

2 comments3 min readLW link

(aisafety.info)

ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct

25Hour and submarat

5 Oct 2024 11:30 UTC

34 points

2 comments8 min readLW link

Exploring SAE features in LLMs with definition trees and token lists

mwatkins4 Oct 2024 22:15 UTC

37 points

5 comments6 min readLW link

AXRP Episode 37 - Jaime Sevilla on Forecasting AI

DanielFilan4 Oct 2024 21:00 UTC

21 points

3 comments56 min readLW link

[Question] Seeking Solutions for Aggregating Classifier Outputs

Saeid Ghafouri4 Oct 2024 17:39 UTC

−1 points

0 comments1 min readLW link

Amoeba roles in tech

Sindhu Shivaprasad4 Oct 2024 17:25 UTC

12 points

0 comments4 min readLW link

LASR Labs Spring 2025 applications are open!

Erin Robertson, charlie_griffin, joehardie and Justin Olive

4 Oct 2024 13:44 UTC

37 points

0 comments4 min readLW link

(Maybe) A Bag of Heuristics is All There Is & A Bag of Heuristics is All You Need

Sodium3 Oct 2024 19:11 UTC

34 points

17 comments17 min readLW link