Singular Learning Theory

TagLast edit: Jun 20, 2023, 11:25 PM by DanielFilan

Singluar learning theory is a theory that applies algebraic geometry to statistical learning theory, developed by Sumio Watanabe. Reference textbooks are “the grey book”, Algebraic Geometry and Statistical Learning Theory, and “the green book”, Mathematical Theory of Bayesian Statistics.

DSLT 2. Why Neural Networks obey Occam’s Razor

Liam CarrollJun 18, 2023, 12:23 AM

24 points

14 comments17 min readLW link

DSLT 0. Distilling Singular Learning Theory

Liam CarrollJun 16, 2023, 9:50 AM

80 points

7 comments5 min readLW link

DSLT 3. Neural Networks are Singular

Liam CarrollJun 20, 2023, 8:20 AM

29 points

5 comments19 min readLW link

DSLT 1. The RLCT Measures the Effective Dimension of Neural Networks

Liam CarrollJun 16, 2023, 9:50 AM

54 points

10 comments13 min readLW link

Neural networks generalize because of this one weird trick

Jesse HooglandJan 18, 2023, 12:10 AM

181 points

34 comments15 min readLW link 1 review

(www.jessehoogland.com)

Singular learning theory: exercises

Zach FurmanAug 30, 2024, 8:00 PM

90 points

5 comments14 min readLW link

Announcing Timaeus

Jesse Hoogland, Daniel Murfet, Alexander Gietelink Oldenziel and Stan van Wingerden

Oct 22, 2023, 11:59 AM

188 points

15 comments4 min readLW link

Growth and Form in a Toy Model of Superposition

Liam Carroll and Edmund Lau

Nov 8, 2023, 11:08 AM

89 points

7 comments14 min readLW link

DSLT 4. Phase Transitions in Neural Networks

Liam CarrollJun 24, 2023, 5:22 PM

30 points

3 comments16 min readLW link

Investigating the learning coefficient of modular addition: hackathon project

Nina Panickssery and Dmitry Vaintrob

Oct 17, 2023, 7:51 PM

94 points

5 comments12 min readLW link

Gradient surfing: the hidden role of regularization

Jesse HooglandFeb 6, 2023, 3:50 AM

37 points

9 comments14 min readLW link

(www.jessehoogland.com)

Timaeus’s First Four Months

Jesse Hoogland, Daniel Murfet, Stan van Wingerden and Alexander Gietelink Oldenziel

Feb 28, 2024, 5:01 PM

173 points

6 comments6 min readLW link

Spooky action at a distance in the loss landscape

Jesse Hoogland and Filip Sondej

Jan 28, 2023, 12:22 AM

61 points

4 comments7 min readLW link

(www.jessehoogland.com)

Generalization, from thermodynamics to statistical physics

Jesse HooglandNov 30, 2023, 9:28 PM

64 points

9 comments28 min readLW link

Dialogue introduction to Singular Learning Theory

Olli JärviniemiJul 8, 2024, 4:58 PM

101 points

15 comments8 min readLW link

Proof idea: SLT to AIT

Lucius BushnaqFeb 10, 2025, 11:14 PM

40 points

15 comments6 min readLW link

Simple versus Short: Higher-order degeneracy and error-correction

Daniel MurfetMar 11, 2024, 7:52 AM

110 points

8 comments13 min readLW link

You’re Measuring Model Complexity Wrong

Jesse Hoogland and Stan van Wingerden

Oct 11, 2023, 11:46 AM

93 points

17 comments13 min readLW link

Towards Developmental Interpretability

Jesse Hoogland, Alexander Gietelink Oldenziel, Daniel Murfet and Stan van Wingerden

Jul 12, 2023, 7:33 PM

192 points

10 comments9 min readLW link 1 review

The generalization phase diagram

Dmitry VaintrobJan 26, 2025, 8:30 PM

26 points

2 comments16 min readLW link

Stagewise Development in Neural Networks

Jesse Hoogland, Liam Carroll and Daniel Murfet

Mar 20, 2024, 7:54 PM

96 points

1 comment11 min readLW link

Apply for the 2023 Developmental Interpretability Conference!

Stan van Wingerden, Alexander Gietelink Oldenziel, Jesse Hoogland and Daniel Murfet

Aug 25, 2023, 7:12 AM

33 points

0 comments2 min readLW link

[Question] A few Alignment questions: utility optimizers, SLT, sharp left turn and identifiability

Igor TimofeevSep 26, 2023, 12:27 AM

6 points

1 comment2 min readLW link

Timaeus is hiring!

Jesse Hoogland, Stan van Wingerden, Alexander Gietelink Oldenziel and Daniel Murfet

Jul 12, 2024, 11:42 PM

67 points

6 comments2 min readLW link

Deep Learning is cheap Solomonoff induction?

Lucius Bushnaq, Kaarel and Dmitry Vaintrob

Dec 7, 2024, 11:00 AM

45 points

1 comment17 min readLW link

Epoch wise critical periods, and singular learning theory

Garrett BakerDec 14, 2023, 8:55 PM

16 points

1 comment5 min readLW link

AXRP Episode 38.2 - Jesse Hoogland on Singular Learning Theory

DanielFilanNov 27, 2024, 6:30 AM

34 points

0 comments10 min readLW link

AXRP Episode 31 - Singular Learning Theory with Daniel Murfet

DanielFilanMay 7, 2024, 3:50 AM

72 points

4 comments71 min readLW link

Interpreting Complexity

Maxwell AdamMar 14, 2025, 4:52 AM

53 points

8 comments26 min readLW link

Open Call for Research Assistants in Developmental Interpretability

Jesse Hoogland, Daniel Murfet, Alexander Gietelink Oldenziel and Stan van Wingerden

Aug 30, 2023, 9:02 AM

55 points

11 comments4 min readLW link

Estimating the Probability of Sampling a Trained Neural Network at Random

Adam Scherlis and Nora Belrose

Mar 1, 2025, 2:11 AM

32 points

10 comments1 min readLW link

(arxiv.org)

The Hessian rank bounds the learning coefficient

Lucius BushnaqAug 8, 2024, 8:55 PM

68 points

10 comments4 min readLW link

A short ‘derivation’ of Watanabe’s Free Energy Formula

Wuschel SchulzJan 29, 2024, 11:41 PM

13 points

6 comments7 min readLW link

My impression of singular learning theory

Ege ErdilJun 18, 2023, 3:34 PM

47 points

30 comments2 min readLW link

rough draft on what happens in the brain when you have an insight

EmrikMay 21, 2024, 6:02 PM

11 points

2 comments1 min readLW link

Learning coefficient estimation: the details

Zach FurmanNov 16, 2023, 3:19 AM

36 points

0 comments2 min readLW link

(colab.research.google.com)

Interview Daniel Murfet on Universal Phenomena in Learning Machines

Alexander Gietelink OldenzielFeb 6, 2023, 12:00 AM

50 points

1 comment16 min readLW link

My Criticism of Singular Learning Theory

Joar SkalseNov 19, 2023, 3:19 PM

83 points

56 comments12 min readLW link

Estimating effective dimensionality of MNIST models

Arjun PanicksseryNov 2, 2023, 2:13 PM

41 points

3 comments1 min readLW link

Singularities against the Singularity: Announcing Workshop on Singular Learning Theory and Alignment

Jesse Hoogland, Alexander Gietelink Oldenziel and Daniel Murfet

Apr 1, 2023, 9:58 AM

87 points

0 comments1 min readLW link

(singularlearningtheory.com)

My hopes for alignment: Singular learning theory and whole brain emulation

Garrett BakerOct 25, 2023, 6:31 PM

61 points

5 comments12 min readLW link

Degeneracies are sticky for SGD

Guillaume Corlouer and Nicolas Macé

Jun 16, 2024, 9:19 PM

56 points

1 comment16 min readLW link

Singular Learning Theory for Dummies

Rahul ChandOct 15, 2024, 9:13 PM

1 point

0 comments8 min readLW link

Minor interpretability exploration #4: LayerNorm and the learning coefficient

Rareș BaronMar 20, 2025, 4:18 PM

2 points

0 comments1 min readLW link

Feature Targeted LLC Estimation Distinguishes SAE Features from Random Directions

Lidor Banuel Dabbah and Aviel Boag

Jul 19, 2024, 8:32 PM

59 points

6 comments16 min readLW link

Minor interpretability exploration #3: Extending superposition to different activation functions (loss landscape)

Rareș BaronMar 14, 2025, 3:45 PM

3 points

0 comments3 min readLW link

Jesse Hoogland on Developmental Interpretability and Singular Learning Theory

Michaël TrazziJul 6, 2023, 3:46 PM

42 points

2 comments4 min readLW link

(theinsideview.ai)

Singular learning theory and bridging from ML to brain emulations

kave and Garrett Baker

Nov 1, 2023, 9:31 PM

26 points

16 comments29 min readLW link

The Theory Behind Loss Curves

James CamachoMay 6, 2025, 10:22 PM

16 points

3 comments4 min readLW link

(github.com)

No comments.

Sin­gu­lar Learn­ing Theory

Singular Learning Theory