Solomonoff Induction

TagLast edit: 21 Apr 2022 23:14 UTC by Alex_Altair

Solomonoff induction is an inference system defined by Ray Solomonoff that will learn to correctly predict any computable sequence with only the absolute minimum amount of data. This system, in a certain sense, is the perfect universal prediction algorithm.

To summarize it very informally, Solomonoff induction works by:

Starting with all possible hypotheses (sequences) as represented by computer programs (that generate those sequences), weighted by their simplicity (2^-ⁿ, where n is the program length);
Discarding those hypotheses that are inconsistent with the data.

Weighting hypotheses by simplicity, the system automatically incorporates a form of Occam’s razor, which is why it has been playfully referred to as Solomonoff’s lightsaber.

Solomonoff induction gets off the ground with a solution to the “problem of the priors”. Suppose that you stand before a universal prefix Turing machine U. You are interested in a certain finite output string y₀. In particular, you want to know the probability that U will produce the output y₀ given a random input tape. This probability is the Solomonoff a priori probability of y₀.

More precisely, suppose that a particular infinite input string x₀ is about to be fed into U. However, you know nothing about x₀ other than that each term of the string is either 0 or 1. As far as your state of knowledge is concerned, the ith digit of x₀ is as likely to be 0 as it is to be 1, for all i = 1, 2, …. You want to find the a priori probability m(y₀) of the following proposition:

(*) If U takes in x₀ as input, then U will produce output y₀ and then halt.

Unfortunately, computing the exact value of m(y₀) would require solving the halting problem, which is undecidable. Nonetheless, it is easy to derive an expression for m(y₀). If U halts on an infinite input string x, then U must read only a finite initial segment of x, after which U immediately halts. We call a finite string p a self-delimiting program if and only if there exists an infinite input string x beginning with p such that U halts on x immediately after reading to the end of p. The set 𝒫 of self-delimiting programs is the prefix code for U. It is the determination of the elements of 𝒫 that requires a solution to the halting problem.

Given p ∈ 𝒫, we write “prog (x₀) = p” to express the proposition that x₀ begins with p, and we write “U(p) = y₀″ to express the proposition that U produces output y₀, and then halts, when fed any input beginning with p. Proposition (*) is then equivalent to the exclusive disjunction

⋁_p_∈ 𝒫:_U₍_p_) =_y₀(prog (x₀) = p).
Since x₀ was chosen at random from {0, 1}^ω, we take the probability of prog (x₀) = p to be 2^− ℓ(^p⁾, where ℓ(p) is the length of p as a bit string. Hence, the probability of (*) is

m(y₀) := ∑_p_∈ 𝒫:_U₍_p_) =_y₀2^− ℓ(^p⁾.

See also

References

Algorithmic probability on Scholarpedia

The Solomonoff Prior is Malign

Mark Xu14 Oct 2020 1:33 UTC

171 points

52 comments16 min readLW link 3 reviews

An Intuitive Explanation of Solomonoff Induction

Alex_Altair11 Jul 2012 8:05 UTC

159 points

225 comments24 min readLW link

A Semitechnical Introductory Dialogue on Solomonoff Induction

Eliezer Yudkowsky4 Mar 2021 17:27 UTC

142 points

33 comments54 min readLW link

Open Problems Related to Solomonoff Induction

Wei Dai6 Jun 2012 0:26 UTC

44 points

104 comments2 min readLW link

Solomonoff induction still works if the universe is uncomputable, and its usefulness doesn’t require knowing Occam’s razor

Christopher King18 Jun 2023 1:52 UTC

38 points

28 comments4 min readLW link

When does rationality-as-search have nontrivial implications?

nostalgebraist4 Nov 2018 22:42 UTC

72 points

12 comments3 min readLW link

[Question] How is Solomonoff induction calculated in practice?

Bucky4 Jun 2019 10:11 UTC

33 points

13 comments1 min readLW link

The Problem of the Criterion

Gordon Seidoh Worley21 Jan 2021 15:05 UTC

61 points

63 comments10 min readLW link

K-complexity is silly; use cross-entropy instead

So8res20 Dec 2022 23:06 UTC

145 points

54 comments4 min readLW link 2 reviews

Solomonoff Cartesianism

Rob Bensinger2 Mar 2014 17:56 UTC

51 points

51 comments25 min readLW link

[Question] Is the human brain a valid choice for the Universal Turing Machine in Solomonoff Induction?

habryka8 Dec 2018 1:49 UTC

22 points

13 comments1 min readLW link

Clarifying Consequentialists in the Solomonoff Prior

Vlad Mikulik11 Jul 2018 2:35 UTC

20 points

16 comments6 min readLW link

A potential problem with using Solomonoff induction as a prior

JoshuaZ7 Apr 2011 19:27 UTC

18 points

18 comments1 min readLW link

Clarifying The Malignity of the Universal Prior: The Lexical Update

interstice15 Jan 2020 0:00 UTC

20 points

2 comments3 min readLW link

[Question] Questions about Solomonoff induction

mukashi10 Jan 2024 1:16 UTC

7 points

11 comments1 min readLW link

Reflective AIXI and Anthropics

Diffractor24 Sep 2018 2:15 UTC

18 points

14 comments8 min readLW link

Asymptotic Logical Uncertainty: Concrete Failure of the Solomonoff Approach

Scott Garrabrant22 Jul 2015 19:27 UTC

13 points

0 comments1 min readLW link

My impression of singular learning theory

Ege Erdil18 Jun 2023 15:34 UTC

47 points

30 comments2 min readLW link

Mathematical Inconsistency in Solomonoff Induction?

curi25 Aug 2020 17:09 UTC

7 points

15 comments2 min readLW link

Why you can’t treat decidability and complexity as a constant (Post #1)

Noosphere8926 Jul 2023 17:54 UTC

6 points

13 comments5 min readLW link

Limited agents need approximate induction

Manfred24 Apr 2015 7:42 UTC

16 points

10 comments8 min readLW link

Solomonoff Induction and Sleeping Beauty

ike17 Nov 2020 2:28 UTC

7 points

0 comments2 min readLW link

Solomonoff Induction explained via dialog.

panickedapricott21 Sep 2017 5:27 UTC

3 points

0 comments1 min readLW link

(arbital.com)

Occam’s Razor and the Universal Prior

Peter Chatain3 Oct 2021 3:23 UTC

28 points

5 comments21 min readLW link

What is the advantage of the Kolmogorov complexity prior?

skepsci16 Feb 2012 1:51 UTC

18 points

29 comments2 min readLW link

Are the fundamental physical constants computable?

Yair Halberstadt5 Apr 2022 15:05 UTC

15 points

6 comments2 min readLW link

From the “weird math questions” department...

CronoDAS9 Aug 2012 7:19 UTC

7 points

50 comments1 min readLW link

Pascal’s Mugging: Tiny Probabilities of Vast Utilities

Eliezer Yudkowsky19 Oct 2007 23:37 UTC

112 points

353 comments4 min readLW link

Multiple Worlds, One Universal Wave Function

evhub4 Nov 2020 22:28 UTC

60 points

76 comments61 min readLW link

Computational Model: Causal Diagrams with Symmetry

johnswentworth22 Aug 2019 17:54 UTC

53 points

29 comments4 min readLW link

An additional problem with Solomonoff induction

gedymin22 Jan 2014 23:34 UTC

3 points

51 comments4 min readLW link

Remarks 1–18 on GPT (compressed)

Cleo Nardo20 Mar 2023 22:27 UTC

146 points

35 comments31 min readLW link

[Question] Why would code/English or low-abstraction/high-abstraction simplicity or brevity correspond?

curi4 Sep 2020 19:46 UTC

2 points

15 comments1 min readLW link

Approximating Solomonoff Induction

Houshalter29 May 2015 12:23 UTC

13 points

45 comments3 min readLW link

The power of finite and the weakness of infinite binary point numbers

AxiomWriter20 Apr 2024 6:03 UTC

−3 points

6 comments2 min readLW link

Prediction can be Outer Aligned at Optimum

Lukas Finnveden10 Jan 2021 18:48 UTC

15 points

12 comments11 min readLW link

Excerpt from Arbital Solomonoff induction dialogue

Richard_Ngo17 Jan 2021 3:49 UTC

36 points

6 comments5 min readLW link

(arbital.com)

What program structures enable efficient induction?

Daniel C5 Sep 2024 10:12 UTC

21 points

5 comments3 min readLW link

Response to “What does the universal prior actually look like?”

michaelcohen20 May 2021 16:12 UTC

36 points

33 comments18 min readLW link

An attempt to break circularity in science

fryolysis15 Jul 2022 18:32 UTC

3 points

5 comments1 min readLW link

Does Solomonoff always win?

cousin_it23 Feb 2011 20:42 UTC

14 points

56 comments2 min readLW link

Summary of the Acausal Attack Issue for AIXI

Diffractor13 Dec 2021 8:16 UTC

12 points

6 comments4 min readLW link

[Question] Generalization of the Solomonoff Induction to Accuracy—Is it possible? Would it be useful?

PeterL20 Feb 2022 19:29 UTC

2 points

1 comment1 min readLW link

Commensurable Scientific Paradigms; or, computable induction

samshap13 Apr 2022 0:01 UTC

14 points

0 comments5 min readLW link

The Solomonoff prior is malign. It’s not a big deal.

Charlie Steiner25 Aug 2022 8:25 UTC

41 points

9 comments7 min readLW link

A Brief Introduction to Algorithmic Common Intelligence, ACI . 1

Akira Pyinya5 Apr 2023 5:43 UTC

−2 points

1 comment2 min readLW link

A Brief Introduction to ACI, 2: An Event-Centric View

Akira Pyinya12 Apr 2023 3:23 UTC

3 points

0 comments2 min readLW link

Prosaic misalignment from the Solomonoff Predictor

Cleo Nardo9 Dec 2022 17:53 UTC

42 points

3 comments5 min readLW link

Solomonoff Induction, by Shane Legg

cousin_it21 Feb 2011 0:32 UTC

21 points

8 comments1 min readLW link

all claw, no world — and other thoughts on the universal distribution

Tamsin Leake14 Dec 2022 18:55 UTC

15 points

0 comments7 min readLW link

(carado.moe)

(A Failed Approach) From Precedent to Utility Function

Akira Pyinya29 Apr 2023 21:55 UTC

0 points

2 comments4 min readLW link

Intuitive Explanation of Solomonoff Induction

lukeprog1 Dec 2011 6:56 UTC

14 points

31 comments10 min readLW link

Beyond Rewards and Values: A Non-dualistic Approach to Universal Intelligence

Akira Pyinya30 Dec 2022 19:05 UTC

10 points

4 comments14 min readLW link

Solomonoff’s solipsism

Mergimio H. Doefevmil8 May 2023 6:55 UTC

−13 points

9 comments1 min readLW link

Belief in the Implied Invisible

Eliezer Yudkowsky8 Apr 2008 7:40 UTC

65 points

34 comments6 min readLW link

ACI #3: The Origin of Goals and Utility

Akira Pyinya17 May 2023 20:47 UTC

1 point

0 comments6 min readLW link

Weak arguments against the universal prior being malign

X4vier14 Jun 2018 17:11 UTC

50 points

23 comments3 min readLW link

Decoherence is Simple

Eliezer Yudkowsky6 May 2008 7:44 UTC

72 points

62 comments11 min readLW link

This Territory Does Not Exist

ike13 Aug 2020 0:30 UTC

7 points

197 comments7 min readLW link

The Ethics of ACI

Akira Pyinya16 Feb 2023 23:51 UTC

−8 points

0 comments3 min readLW link

“The Solomonoff Prior is Malign” is a special case of a simpler argument

David Matolcsi17 Nov 2024 21:32 UTC

75 points

10 comments12 min readLW link

How do low level hypotheses constrain high level ones? The mystery of the disappearing diamond.

Christopher King11 Jul 2023 19:27 UTC

17 points

11 comments2 min readLW link

The prior of a hypothesis does not depend on its complexity

cousin_it26 Aug 2010 13:20 UTC

34 points

69 comments1 min readLW link

Breaking the Optimizer’s Curse, and Consequences for Existential Risks and Value Learning

Roger Dearnaley21 Feb 2023 9:05 UTC

10 points

1 comment23 min readLW link

No comments.