RobinZ comments on Why (and why not) Bayesian Updating?

RobinZ 19 Nov 2009 23:46 UTC
1 point
Correct me if I’m wrong, but would the actual measure of the connection between A and B be more accurately summarized as K(A + B) < K(A) + K(B), then?
- SilasBarta 20 Nov 2009 16:06 UTC
  0 points
  Parent
  I believe that’s an equivalent way to express “H(X) - H(X|Y) > 0” and “P(A ∩ B) != P(A) * P(B)”. Or at least, any one of the three can be derived from any of the others.
  
  Note that the Kullback-Leibler divergence (a generalization of entropy) between X and Y is equivalent to the number of extra bits required to code data sampled from X when your compression algorithm is optimized for Y, which shows how these all relate.