timtyler comments on Which are the useful areas of AI study?

timtyler 16 Jan 2011 10:16 UTC
0 points
How to compress is one question. How to tree-prune is a second, and what values machines should have is a third.

What I didn’t realise—until fairly recently—is the extent to which the first problem is key. It appears that can be usefully be factored out and solved independently.

General-purpose stream compression is a fairly specific computer science problem. However, it is true that we still need to figure out how to do it well.
- endoself 16 Jan 2011 18:51 UTC
  1 point
  Parent
  Having a very detailed understanding of general patterns in English gives you almost as much compression of a textbook as the additional ability to fully understand the subject matter. How does progress on the first ability get us closer to GAI?
  - timtyler 16 Jan 2011 19:27 UTC
    0 points
    Parent
    You need general purpose compression—not just text compression—if you are being general.
    
    Matt explains why compression is significant here.
    
    Obviously the quality of the model depends on the degree of compression. Poorer compression results in reduced understanding of the material.
    - endoself 16 Jan 2011 21:13 UTC
      1 point
      Parent
      In the link, he proves that optimal compression is equivalent to AI. He gives no reason why improving compression is the best approach to developing AI. The OP asked what areas of AI study are useful, not which ones would work in principle. You may have evidence that this is the best approach in practice, but you have not presented it yet.
      - timtyler 16 Jan 2011 22:07 UTC
        0 points
        Parent
        
        In the link, he proves that optimal compression is equivalent to AI.
        
        I don’t actually think that compression is equivalent to intelligence. However, it is pretty close!
        
        The OP asked what areas of AI study are useful, not which ones would work in principle. You may have evidence that this is the best approach in practice, but you have not presented it yet.
        
        My main presentation in that area is in the first of the links I gave:
        
        http://timtyler.org/machine_forecasting/
        
        Brief summary:
        
        Compression allows an easy way of measuring progress—an area which has been explored by Shane Legg. Also, it successfully breaks a challenging problem down into sub-components—often an important step on the way to solving the problem. Lastly, but perhaps most significantly, developing good quality stream compression engines looks like an easier problem than machine intelligence—and it is one which immediately suggests possible ways to solve it.
        
        I don’t know that it is the best approach in practice—just that it looks like a pretty promising one.
        endoself 17 Jan 2011 0:13 UTC
        0 points
        Parent
        It is not clear to me that what we are measuring is progress. We are definitely improving something, but that does not necessarily get us closer to GAI. Different algorithms could have very different compression efficiencies on different types of data. Some of these may require real progress toward AI, but many types of data can be compressed significantly with little intelligence. A program that could compress any type of non-random data could be improved significantly just by focusing on thing that are easy to predict.
        timtyler 17 Jan 2011 7:23 UTC
        1 point
        Parent
        It is not clear to me that what we are measuring is progress.
        
        Compression ratio, on general Occamian data defined w.r.t a small reference machine. If in doubt, refer to Shane Legg: http://www.vetta.org/publications/
        
        A program that could compress any type of non-random data could be improved significantly just by focusing on thing that are easy to predict.
        
        Not by very much usually, if there is lots of data. Powerful general purpose compressors would actually work well.
        
        Anyway, that is the idea—to build a general purpose Occamian system—not one with lots of preconceptions. Non-Occamian preconceptions don’t buy that much and are not really needed—if you are sufficiently smart and have a little while to look at the world and see what it is like.
        endoself 17 Jan 2011 10:13 UTC
        1 point
        Parent
        
        A program that could compress any type of non-random data could be improved significantly just by focusing on thing that are easy to predict.
        
        Not by very much usually, if there is lots of data. Powerful general purpose compressors would actually work well.
        
        Huffman encoding also works very well in many cases. Obviously you cannot compress any compressible data without GAI, but there are no programs in existence that can compress any compressible data and things like Huffman encoding do make numerical progress in compression, but they do not represent any real conceptual progress. Do you know of any compression programs that make conceptual progress toward GAI? If not, then why do you think that focusing on the compression aspect is likely to provide such progress?
        timtyler 17 Jan 2011 18:29 UTC
        2 points
        Parent
        Huffman coding dates back to 1952 - it has been replaced by othehr schemes in size-sensitive applications.
        
        Do you know of any compression programs that make conceptual progress toward GAI?
        
        Alexander Ratushnyak seems to be making good progress—see: http://prize.hutter1.net/
        endoself 18 Jan 2011 18:08 UTC
        1 point
        Parent
        I realize that Huffman coding is outdated, I was just trying to make the point that compression progress is possible without AI progress.
        
        Do you have Alexander Ratushnyak’s source code? What techniques does he use that teach us something that could be useful for GAI?
        timtyler 18 Jan 2011 19:04 UTC
        0 points
        Parent
        
        I was just trying to make the point that compression progress is possible without AI progress.
        
        If you have a compressor that isn’t part of a machine intelligence, or is part of a non-state of the art machine intelligence, then that is likely to be true.
        
        However, if your compressor is in a state of the art machine intelligence—and it is the primary tool being used to predict the consequences of its actions, then compression progress (smaller or faster), would translate into increased intelligence. It would help the machine to better predict the consequences of its possible actions, and/or to act more quickly.
        
        Do you have Alexander Ratushnyak’s source code?
        
        That is available here.
        
        What techniques does he use that teach us something that could be useful for GAI?
        
        Alas, delving in is beyond the scope of this comment.
        Expand this thread
        endoself 19 Jan 2011 3:42 UTC
        1 point
        Parent
        
        However, if your compressor is in a state of the art machine intelligence—and it is the primary tool being used to predict the consequences of its actions, then compression progress (smaller or faster), would translate into increased intelligence. It would help the machine to better predict the consequences of its possible actions, and/or to act more quickly.
        
        What techniques does he use that teach us something that could be useful for GAI?
        
        Alas, delving in is beyond the scope of this comment.
        
        Your arguments seem to be more inside-view than mine, so I will update my estimates in favour of your point.
        timtyler 19 Jan 2011 9:37 UTC
        1 point
        Parent
        I got something from you too. One of the problems with a compression-based approach to machine intelligence is that so far, it hasn’t been very popular. There just aren’t very many people working on it.
        
        Compression is a tough traditional software engineering problem. It seems relatively unglamourous—and there are some barriers to entry, in the form of a big mountain of existing work. Building on that work might not be the most direct way towards the goal—but unless you do that, you can’t easily make competitive products in the field.
        
        Sequence prediction (via stream compression) still seems like the number 1 driving problem to me—and a likely path towards the goal—but some of the above points do seem to count against it.