Natural Abstraction

TagLast edit: Oct 10, 2022, 5:45 PM by Raemon

The Natural Abstraction hypothesis says that:

Our physical world abstracts well: for most systems, the information relevant “far away” from the system (in various senses) is much lower-dimensional than the system itself. These low-dimensional summaries are exactly the high-level abstract objects/concepts typically used by humans.
These abstractions are “natural”: a wide variety of cognitive architectures will learn to use approximately the same high-level abstract objects/concepts to reason about the world.

(from “Testing the Natural Abstraction Hypothesis”)

Natural Abstractions: Key claims, Theorems, and Critiques

LawrenceC, Leon Lang and Erik Jenner

Mar 16, 2023, 4:37 PM

241 points

23 comments45 min readLW link 3 reviews

Natural Latents: The Concepts

johnswentworth and David Lorell

Mar 20, 2024, 6:21 PM

90 points

18 comments19 min readLW link

Natural Latents: The Math

johnswentworth and David Lorell

Dec 27, 2023, 7:03 PM

123 points

40 comments12 min readLW link 2 reviews

Alignment By Default

johnswentworthAug 12, 2020, 6:54 PM

174 points

96 comments11 min readLW link 2 reviews

Testing The Natural Abstraction Hypothesis: Project Intro

johnswentworthApr 6, 2021, 9:24 PM

168 points

41 comments6 min readLW link 1 review

The Natural Abstraction Hypothesis: Implications and Evidence

CallumMcDougallDec 14, 2021, 11:14 PM

39 points

9 comments19 min readLW link

What is a Tool?

johnswentworth and David Lorell

Jun 25, 2024, 11:40 PM

62 points

4 comments6 min readLW link

Contrapositive Natural Abstraction—Project Intro

Elliot CallenderJun 24, 2024, 6:37 PM

4 points

5 comments2 min readLW link

Public Static: What is Abstraction?

johnswentworthJun 9, 2020, 6:36 PM

97 points

18 comments11 min readLW link

Agency As a Natural Abstraction

Thane RuthenisMay 13, 2022, 6:02 PM

55 points

9 comments13 min readLW link

Testing The Natural Abstraction Hypothesis: Project Update

johnswentworthSep 20, 2021, 3:44 AM

88 points

17 comments8 min readLW link 1 review

Relevant to natural abstractions: Euclidean Symmetry Equivariant Machine Learning—Overview, Applications, and Open Questions

the gears to ascensionDec 8, 2022, 6:01 PM

8 points

0 comments1 min readLW link

(youtu.be)

[ASoT] Natural abstractions and AlphaZero

Ulisse MiniDec 10, 2022, 5:53 PM

33 points

1 comment1 min readLW link

(arxiv.org)

[Hebbian Natural Abstractions] Mathematical Foundations

Samuel Nellessen and Jan

Dec 25, 2022, 8:58 PM

15 points

2 comments6 min readLW link

(www.snellessen.com)

Natural Abstraction: Convergent Preferences Over Information Structures

paulomOct 14, 2023, 6:34 PM

28 points

1 comment36 min readLW link

AISafety.info: What is the “natural abstractions hypothesis”?

AlgonOct 5, 2024, 12:31 PM

38 points

2 comments3 min readLW link

(aisafety.info)

Towards the Operationalization of Philosophy & Wisdom

Thane RuthenisOct 28, 2024, 7:45 PM

20 points

2 comments33 min readLW link

(aiimpacts.org)

Minimal Motivation of Natural Latents

johnswentworth and David Lorell

Oct 14, 2024, 10:51 PM

46 points

14 comments3 min readLW link

Disentangling Representations through Multi-task Learning

Bogdan Ionut CirsteaNov 24, 2024, 1:10 PM

14 points

1 comment1 min readLW link

(arxiv.org)

Natural abstractions are observer-dependent: a conversation with John Wentworth

Martín SotoFeb 12, 2024, 5:28 PM

39 points

13 comments7 min readLW link

AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them

Roman LeventovDec 27, 2023, 2:51 PM

33 points

9 comments4 min readLW link

The Plan − 2023 Version

johnswentworthDec 29, 2023, 11:34 PM

152 points

40 comments31 min readLW link 1 review

From Conceptual Spaces to Quantum Concepts: Formalising and Learning Structured Conceptual Models

Roman LeventovFeb 6, 2024, 10:18 AM

8 points

1 comment4 min readLW link

(arxiv.org)

Abstract Mathematical Concepts vs. Abstractions Over Real-World Systems

Thane RuthenisFeb 18, 2025, 6:04 PM

32 points

10 comments4 min readLW link

Natural Latents Are Not Robust To Tiny Mixtures

johnswentworth and David Lorell

Jun 7, 2024, 6:53 PM

61 points

8 comments5 min readLW link

AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space

Bogdan Ionut CirsteaSep 14, 2024, 11:23 PM

17 points

1 comment1 min readLW link

(arxiv.org)

Validating / finding alignment-relevant concepts using neural data

Bogdan Ionut CirsteaSep 20, 2024, 9:12 PM

7 points

0 comments1 min readLW link

(docs.google.com)

Idealized Agents Are Approximate Causal Mirrors (+ Radical Optimism on Agent Foundations)

Thane RuthenisDec 22, 2023, 8:19 PM

74 points

14 comments6 min readLW link

What Does The Natural Abstraction Framework Say About ELK?

johnswentworthFeb 15, 2022, 2:27 AM

35 points

0 comments6 min readLW link

[Hebbian Natural Abstractions] Introduction

Samuel Nellessen and Jan

Nov 21, 2022, 8:34 PM

34 points

3 comments4 min readLW link

(www.snellessen.com)

Select Agent Specifications as Natural Abstractions

lukemarksApr 7, 2023, 11:16 PM

19 points

3 comments5 min readLW link

A rough and incomplete review of some of John Wentworth’s research

So8resMar 28, 2023, 6:52 PM

175 points

18 comments18 min readLW link

The Lightcone Theorem: A Better Foundation For Natural Abstraction?

johnswentworthMay 15, 2023, 2:24 AM

69 points

25 comments6 min readLW link

$500 Bounty/Prize Problem: Channel Capacity Using “Insensitive” Functions

johnswentworthMay 16, 2023, 9:31 PM

40 points

11 comments2 min readLW link

Abstraction is Bigger than Natural Abstraction

Nicholas / Heather KrossMay 31, 2023, 12:00 AM

18 points

0 comments5 min readLW link

(www.thinkingmuchbetter.com)

Natural Categories Update

Logan ZoellnerOct 10, 2022, 3:19 PM

33 points

6 comments2 min readLW link

Computing Natural Abstractions: Linear Approximation

johnswentworthApr 15, 2021, 5:47 PM

41 points

22 comments7 min readLW link

AXRP Episode 15 - Natural Abstractions with John Wentworth

DanielFilanMay 23, 2022, 5:40 AM

34 points

1 comment58 min readLW link

The Core of the Alignment Problem is...

Thomas Larsen, Jeremy Gillen and JamesH

Aug 17, 2022, 8:07 PM

76 points

10 comments9 min readLW link

Causal Abstraction Toy Model: Medical Sensor

johnswentworthDec 11, 2019, 9:12 PM

34 points

6 comments6 min readLW link

The Plan − 2022 Update

johnswentworthDec 1, 2022, 8:43 PM

239 points

37 comments8 min readLW link 1 review

Take 4: One problem with natural abstractions is there’s too many of them.

Charlie SteinerDec 5, 2022, 10:39 AM

37 points

4 comments1 min readLW link

Take 5: Another problem for natural abstractions is laziness.

Charlie SteinerDec 6, 2022, 7:00 AM

31 points

4 comments3 min readLW link

If Wentworth is right about natural abstractions, it would be bad for alignment

Wuschel SchulzDec 8, 2022, 3:19 PM

29 points

5 comments4 min readLW link

The “Minimal Latents” Approach to Natural Abstractions

johnswentworthDec 20, 2022, 1:22 AM

53 points

24 comments12 min readLW link

Causal abstractions vs infradistributions

Pablo VillalobosDec 26, 2022, 12:21 AM

24 points

0 comments6 min readLW link

Simulacra are Things

janusJan 8, 2023, 11:03 PM

63 points

7 comments2 min readLW link

World-Model Interpretability Is All We Need

Thane RuthenisJan 14, 2023, 7:37 PM

36 points

22 comments21 min readLW link

Why I’m not working on {debate, RRM, ELK, natural abstractions}

Steven ByrnesFeb 10, 2023, 7:22 PM

71 points

19 comments9 min readLW link

The conceptual Doppelgänger problem

TsviBTFeb 12, 2023, 5:23 PM

12 points

5 comments4 min readLW link

[Question] Is InstructGPT Following Instructions in Other Languages Surprising?

DragonGodFeb 13, 2023, 11:26 PM

39 points

15 comments1 min readLW link

[Appendix] Natural Abstractions: Key Claims, Theorems, and Critiques

LawrenceC, Erik Jenner and Leon Lang

Mar 16, 2023, 4:38 PM

48 points

0 comments13 min readLW link

Wittgenstein’s Language Games and the Critique of the Natural Abstraction Hypothesis

Chris_LeongMar 16, 2023, 7:56 AM

16 points

20 comments2 min readLW link

[Question] [DISC] Are Values Robust?

DragonGodDec 21, 2022, 1:00 AM

12 points

9 comments2 min readLW link

Jonothan Gorard:The territory is isomorphic to an equivalence class of its maps

Daniel CSep 7, 2024, 10:04 AM

19 points

18 comments2 min readLW link

(x.com)

My AI Model Delta Compared To Yudkowsky

johnswentworthJun 10, 2024, 4:12 PM

280 points

103 comments4 min readLW link

Contra Steiner on Too Many Natural Abstractions

DragonGodDec 24, 2022, 5:42 PM

10 points

6 comments1 min readLW link

Alignment Targets and The Natural Abstraction Hypothesis

Stephen FowlerMar 8, 2023, 11:45 AM

10 points

0 comments3 min readLW link

[Linkpost] Concept Alignment as a Prerequisite for Value Alignment

Bogdan Ionut CirsteaNov 4, 2023, 5:34 PM

27 points

0 comments1 min readLW link

(arxiv.org)

Simulators Increase the Likelihood of Alignment by Default

Wuschel SchulzApr 30, 2023, 4:32 PM

13 points

1 comment5 min readLW link

«Boundaries/Membranes» and AI safety compilation

ChipmonkMay 3, 2023, 9:41 PM

56 points

17 comments8 min readLW link

[Linkpost] MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

Bogdan Ionut CirsteaMar 10, 2024, 1:30 AM

10 points

0 comments1 min readLW link

(openreview.net)

[Question] Does the Telephone Theorem give us a free lunch?

NumendilFeb 15, 2023, 2:13 AM

11 points

2 comments1 min readLW link

Abstraction As Symmetry and Other Thoughts

NumendilFeb 1, 2023, 6:25 AM

28 points

9 comments2 min readLW link

Nature < Nurture for AIs

scottviteriJun 4, 2023, 8:38 PM

14 points

22 comments7 min readLW link

[Linkpost] Large Language Models Converge on Brain-Like Word Representations

Bogdan Ionut CirsteaJun 11, 2023, 11:20 AM

36 points

12 comments1 min readLW link

[Linkpost] Scaling laws for language encoding models in fMRI

Bogdan Ionut CirsteaJun 8, 2023, 10:52 AM

30 points

0 comments1 min readLW link

[Linkpost] Mapping Brains with Language Models: A Survey

Bogdan Ionut CirsteaJun 16, 2023, 9:49 AM

5 points

0 comments1 min readLW link

[Linkpost] Rosetta Neurons: Mining the Common Units in a Model Zoo

Bogdan Ionut CirsteaJun 17, 2023, 4:38 PM

12 points

0 comments1 min readLW link

[Linkpost] A shared linguistic space for transmitting our thoughts from brain to brain in natural conversations

Bogdan Ionut CirsteaJul 1, 2023, 1:57 PM

17 points

2 comments1 min readLW link

[Linkpost] Large language models converge toward human-like concept organization

Bogdan Ionut CirsteaSep 2, 2023, 6:00 AM

22 points

1 comment1 min readLW link

An embedding decoder model, trained with a different objective on a different dataset, can decode another model’s embeddings surprisingly accurately

Logan ZoellnerSep 3, 2023, 11:34 AM

20 points

1 comment1 min readLW link

The utility of humans within a Super Artificial Intelligence realm.

Marc MonroyOct 11, 2023, 5:30 PM

1 point

0 comments7 min readLW link

[Linkpost] Generalization in diffusion models arises from geometry-adaptive harmonic representation

Bogdan Ionut CirsteaOct 11, 2023, 5:48 PM

4 points

3 comments1 min readLW link

Towards building blocks of ontologies

Daniel C, Alex_Altair, Dalcy, Alfred Harwood and JoseFaustino

Feb 8, 2025, 4:03 PM

29 points

0 comments26 min readLW link

Universal dimensions of visual representation

Bogdan Ionut CirsteaAug 28, 2024, 10:38 AM

10 points

0 comments1 min readLW link

(arxiv.org)

Abstractions are not Natural

Alfred HarwoodNov 4, 2024, 11:10 AM

25 points

21 comments11 min readLW link

No comments.

Nat­u­ral Abstraction

Natural Abstraction