TechEcho

17 comments

derf_almost 11 years ago

Sadly, the description of arithmetic coding bears almost no resemblance to the actual algorithm (it roughly describes the equal probability case, but that misses most of the point). The description of Shannon-Fano as "bottom up" and Huffman as "top-down" is also exactly backwards (the actual descriptions of those algorithms are accurate, but the labeling is confused).The article contains a lot of terms you can search for if you are interested in these things, but sadly is not very informative on its own.

jmspringalmost 11 years ago

An interesting read, I will need to go through it with more care when I get home. A couple of personal interests are having studied with both David Huffman (though not for compression) and Glen Langdon (one of the arithmetic coding pioneers), I need to see how this article compares with my notes.I also need to see about getting some of my old (15+ years) course notes online.Just in those two, much history and memory lost and a hit for UC Santa Cruz's computer engineering and science departments (Huffman passed a number of years back, Glen this year after a period of retirement).Edit - the scope of the article and the title provided...a serious disconnect in terms of breadth. But, better than nothing.

评论 #8089015 未加载

adbgealmost 11 years ago

If anyone is interested in the PAQ code, I have put one of the stronger-but-open-source variants on GitHub, here: <a href="https://github.com/robertseaton/paq8pxd" rel="nofollow">https://github.com/robertseaton/paq8pxd</a>

webreacalmost 11 years ago

This article is very good, but it is very short about gzip. Event if there was no technical innovation, I think that gzip was really important for development of compression softwares that are not patent-encumbered.

khitchdeealmost 11 years ago

Seems like even though people went after this, there's not been much innovation, last few decades. Most of the new stuff looks quite incremental. It almost seems like we are losing our edge in our ability to build from the ground up. This is because we are part of a system that maintains an eagle's eye on new ideas. We need to take some of that pressure off us so that we can think a bit outside the box.

评论 #8089847 未加载

评论 #8090707 未加载

评论 #8089899 未加载

评论 #8090132 未加载

评论 #8095734 未加载

heyalexejalmost 11 years ago

Site seems to be down. Cached version: <a href="http://webcache.googleusercontent.com/search?q=cache%3Aieeeghn.org%2Fwiki%2Findex.php%2FHistory_of_Lossless_Data_Compression_Algorithms&oq=cache%3Aieeeghn.org%2Fwiki%2Findex.php%2FHistory_of_Lossless_Data_Compression_Algorithms" rel="nofollow">http://webcache.googleusercontent.com/search?q=cache%3Aieeeg...</a>

_deliriumalmost 11 years ago

One interesting domain-specific class of compression algorithms not mentioned here is for lossless audio compression, which tends to use a different (though also pretty simple) technique, somewhat related to PPM. A common approach is to predict the waveform using linear predictive coding (LPC), and then entropy-code the residuals. FLAC does that, for example.

评论 #8090539 未加载

mistercowalmost 11 years ago

> Arithmetic coding is arguably the most optimal entropy coding techniqueProvably optimal, even (assuming infinite precision).

ellipticalmost 11 years ago

Are there compression techniques that are only "mostly" lossless? I was thinking something along the lines of: for delta, N > 0 the probability that the compression & decompression of a random stream of N bytes will result in loss is less than delta?

oakwhizalmost 11 years ago

I'm surprised that LZ4 is not in the list. It's based on LZ77 like many of the others.

mariuoloalmost 11 years ago

They left out .zoo (LZW) and .lzh/.lha (LZSS).Both were quite widespread in the BBS era.

评论 #8090215 未加载

hcrispalmost 11 years ago

How does Snappy compression fit into the picture?

评论 #8089019 未加载

评论 #8091955 未加载

评论 #8089292 未加载

评论 #8089042 未加载

leftrightupdownalmost 11 years ago

so based on this, if someone wanted best compression program they would choose paq?

评论 #8089532 未加载

评论 #8090128 未加载

评论 #8089269 未加载

drydotalmost 11 years ago

I miss ARJ in the article

danieldrehmeralmost 11 years ago

insert weissman score joke here

SurfScorealmost 11 years ago

But what about Pied Piper?

评论 #8088986 未加载

thisjepisjealmost 11 years ago

...GIF is lossless?

评论 #8089301 未加载

17 comments

derf_almost 11 years ago

jmspringalmost 11 years ago

评论 #8089015 未加载

adbgealmost 11 years ago

webreacalmost 11 years ago

khitchdeealmost 11 years ago

评论 #8089847 未加载

评论 #8090707 未加载

评论 #8089899 未加载

评论 #8090132 未加载

评论 #8095734 未加载

heyalexejalmost 11 years ago

_deliriumalmost 11 years ago

评论 #8090539 未加载

mistercowalmost 11 years ago

> Arithmetic coding is arguably the most optimal entropy coding techniqueProvably optimal, even (assuming infinite precision).

ellipticalmost 11 years ago

oakwhizalmost 11 years ago

I'm surprised that LZ4 is not in the list. It's based on LZ77 like many of the others.

mariuoloalmost 11 years ago

They left out .zoo (LZW) and .lzh/.lha (LZSS).Both were quite widespread in the BBS era.

评论 #8090215 未加载

hcrispalmost 11 years ago

How does Snappy compression fit into the picture?

评论 #8089019 未加载

评论 #8091955 未加载

评论 #8089292 未加载

评论 #8089042 未加载

leftrightupdownalmost 11 years ago

so based on this, if someone wanted best compression program they would choose paq?

评论 #8089532 未加载

评论 #8090128 未加载

评论 #8089269 未加载

drydotalmost 11 years ago

I miss ARJ in the article

danieldrehmeralmost 11 years ago

insert weissman score joke here

SurfScorealmost 11 years ago

But what about Pied Piper?

评论 #8088986 未加载

thisjepisjealmost 11 years ago

...GIF is lossless?

评论 #8089301 未加载

History of Lossless Data Compression Algorithms

17 comments

History of Lossless Data Compression Algorithms

17 comments