V_V comments on [link] Baidu cheats in an AI contest in order to gain a 0.24% advantage

V_V 14 Jun 2015 1:19 UTC
0 points

By your definition of meaningful information, it’s not actually clear that a strong lossless compressor wouldn’t discover and encode that meaningful information.

It could, but also it could not. My point is that compression ratio (that is, average log-likelihood of the data under the model) is not a good proxy for “understanding” since it can be optimized to a very large extent without modeling “meaningful” information.
- Daniel_Burfoot 14 Jun 2015 15:09 UTC
  0 points
  Parent
  Yes, good compression can be achieved without deep understanding. But a compressor with deep understanding will ultimately achieve better compression. For example, you can get good text compression results with a simple bigram or trigram model, but eventually a sophisticated grammar-based model will outperform the Ngram approach.