compressionlossless-compressiontext-compression

What is the current state of text-only compression algorithms?


In honor of the Hutter Prize, what are the top algorithms (and a quick description of each) for text compression?

Note: The intent of this question is to get a description of compression algorithms, not of compression programs.


Solution

  • The boundary-pushing compressors combine algorithms for insane results. Common algorithms include:

    Maximum Compression is a pretty cool text and general compression benchmark site. Matt Mahoney publishes another benchmark. Mahoney's may be of particular interest because it lists the primary algorithm used per entry.