Morendil comments on Book Club Update, Chapter 2 of Probability Theory

Morendil 29 Jun 2010 8:13 UTC
3 points
OK, thanks.

I’m able to follow a fair bit of what’s going on here; the hard portions for me are when Jaynes gets some result without saying which rule or operation justifies it—I suppose it’s obvious to someone familiar with calculus, but when you lack these background assumptions it can be very hard to infer what rules are being used, so I can’t even find out how I might plug the gaps in my knowledge. (Definitely “deadly unk-unk” territory for me.)

(Of course “follow” isn’t the same thing at all as “would be able to get similar results on a different but related problem”. I grok the notion of a functional equation, and I can verify intermediate steps using a symbolic math package, but Jaynes’ overall strategy is obscure to me. Is this a common pattern, taking the derivative of a functional equation then integrating back?)

The next bit where I lose track is 2.22. What’s going on here, is this a total derivative?
What links here?
- Book Club Update, Chapter 2 of Probability Theory by Morendil (29 Jun 2010 0:46 UTC; 10 points)
- taiyo 29 Jun 2010 8:56 UTC
  2 points
  Parent
  Yeah. A total derivative. The way I think about it is the dv thing there (jargon: a differential 1-form) eats a tangent vector in the y-z plane. It spits out the rate of change of the function in the direction of the vector (scaled appropriately with the magnitude of the vector). It does this by looking at the rate of change in the y-direction (the dy stuff) and in the z-direction (the dz stuff) and adding those together (since after taking derivatives, things get nice and linear).
  
  I’m not too familiar with the functional equation business either. I’m currently trying to figure out what the heck is happening on the bottom half of page 32. Figuring out the top half took me a really long while (esp. 2.50).
  
  I’m convinced that the inequality in eqn 2.52 shouldn’t be there. In particular, when you stick in the solution S(x) = 1 - x, it’s false. I can’t figure out if anything below it depends on that because I don’t understand much below it.
  - Soki 1 Jul 2010 15:23 UTC
    4 points
    Parent
    I could not figure out why alpha > 0 neither and it seems wrong to me too. But this does not look like a problem.
    
    We know that J is an increasing function because of 2-49. So in 2-53, alpha and log(x/S(x)) must have the same sign, since the remaining of the right member tends toward 0 when q tends toward + infinity.
    
    Then b is positive and I think it is all that matters.
    
    However, if alpha = 0, b is not defined. But if alpha=0 then log(x/S(x))=0 as a consequence of 2-53, so x/S(x)=1. There is only one x that gives us this since S is strictly decreasing. And by continuity we can still get 2-56.
    - taiyo 1 Jul 2010 20:53 UTC
      0 points
      Parent
      Lovely. Thanks.
  - Morendil 1 Jul 2010 16:17 UTC
    0 points
    Parent
    I’m totally stuck on getting 2.50 from 2.48, would appreciate a hint.
    What links here?
    Book Club Update, Chapter 2 of Probability Theory by Morendil (29 Jun 2010 0:46 UTC; 10 points)
    - taiyo 1 Jul 2010 17:44 UTC
      3 points
      Parent
      K. S. Van Horn gives a few lines describing the derivation in his PT:TLoS errata. I don’t understand why he does step 4 there—it seems to me to be irrelevant. The two main facts which are needed are step 2-3 and step 5, the sum of a geometric series and the Taylor series expansion around y = S(x). Hopefully that is a good hint.
      
      Nitpicking with his errata, 1/(1-z) = 1 + z + O(z^2) for all z is wrong since the interval of convergence for the RHS is (-1,1). This is not important to the problem since the z here will be z = exp(-q) which is less than 1 since q is positive.
      - Soki 1 Jul 2010 21:54 UTC
        3 points
        Parent
        It is not very important, but since you mentioned it :
        
        The interval of convergence of the Taylor series of 1/(1-z) at z=0 is indeed (-1,1).
        
        But “1/(1-z) = 1 + z + O(z^2) for all z” does not make sense to me.
        
        1/(1-z) = 1 + z + O(z^2) means that there is an M such as |1/(1-z) - (1 + z)| is no greater that M*z^2 for every z close enough to 0. It is about the behavior of 1/(1-z) - (1 + z) when z tends toward 0, not when z belongs to (-1,1).
      - Morendil 4 Jul 2010 23:15 UTC
        0 points
        Parent
        Is there anything more to getting 2.53 than just rearranging things around? I’m not sure I really understand where we get the left-hand side from.
      - Morendil 1 Jul 2010 20:15 UTC
        0 points
        Parent
        
        Hopefully that is a good hint.
        
        Indeed, thanks!