I went through the maths in OP and it seems to check out. I think the core inconsistency is that Solomonoff Induction implies l(X∪Y)=l(X) which is obviously wrong. I’m going to redo the maths below (breaking it down step-by-step more). curi has 2l(X)=l(X) which is the same inconsistency given his substitution. I’m not sure we can make that substitution but I also don’t think we need to.
Let X and Y be independent hypotheses for Solomonoff induction.
According to the prior, the non-normalized probability of X (and similarly for Y) is: (1)
curi has slightly different logic and argues l(X∪Y)≃2l(X) which I think is reasonable. His argument means we get l(X)≃2l(X). I don’t think those steps are necessary but they are worth mentioning as a difference. I think Equation (8) is enough.
I was curious about what happens when l(X)≠l(Y). Let’s assume the following: (9)
I went through the maths in OP and it seems to check out. I think the core inconsistency is that Solomonoff Induction implies l(X∪Y)=l(X) which is obviously wrong. I’m going to redo the maths below (breaking it down step-by-step more). curi has 2l(X)=l(X) which is the same inconsistency given his substitution. I’m not sure we can make that substitution but I also don’t think we need to.
Let X and Y be independent hypotheses for Solomonoff induction.
According to the prior, the non-normalized probability of X (and similarly for Y) is: (1)
P(X)=12l(X)
what is the probability of X∪Y? (2)
P(X∪Y)=P(X)+P(Y)−P(X∩Y)=12l(X)+12l(Y)−12l(X)⋅12l(Y)=12l(X)+12l(Y)−12l(X)⋅2l(Y)=12l(X)+12l(Y)−12l(X)+l(Y)
However, by Equation (1) we have: (3)
P(X∪Y)=12l(X∪Y)
thus (4)
12l(X∪Y)=12l(X)+12l(Y)−12l(X)+l(Y)
This must hold for any and all X and Y.
curi considers the case where X and Y are the same length, starting with Equation (4), we get (5):
12l(X∪Y)=12l(X)+12l(Y)−12l(X)+l(Y)=12l(X)+12l(X)−12l(X)+l(X)=22l(X)−122l(X)=12l(X)−1−122l(X)
but (6)
12l(X)−1≫122l(X)
and (7)
0≈122l(X)
so: (8)
12l(X∪Y)≃12l(X)−1∴l(X∪Y)≃l(X)−1□
curi has slightly different logic and argues l(X∪Y)≃2l(X) which I think is reasonable. His argument means we get l(X)≃2l(X). I don’t think those steps are necessary but they are worth mentioning as a difference. I think Equation (8) is enough.
I was curious about what happens when l(X)≠l(Y). Let’s assume the following: (9)
l(X)<l(Y)∴12l(X)≫12l(Y)
so, from Equation (2): (10)
P(X∪Y)=12l(X)+12l(Y)−12l(X)+l(Y)liml(Y)→∞P(X∪Y)=12l(X)+12l(Y)0−12l(X)+l(Y)0∴P(X∪Y)≃12l(X)
by Equation (3) and Equation (10): (11)
12l(X∪Y)≃12l(X)∴l(X∪Y)≃l(X)⇒l(Y)≃0
but Equation (9) says l(X)<l(Y) --- this contradicts Equation (11).
So there’s an inconsistency regardless of whether l(X)=l(Y) or not.