TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

History of Lossless Data Compression Algorithms

146 pointsby _nullandnull_almost 11 years ago

17 comments

derf_almost 11 years ago
Sadly, the description of arithmetic coding bears almost no resemblance to the actual algorithm (it roughly describes the equal probability case, but that misses most of the point). The description of Shannon-Fano as &quot;bottom up&quot; and Huffman as &quot;top-down&quot; is also exactly backwards (the actual descriptions of those algorithms are accurate, but the labeling is confused).<p>The article contains a lot of terms you can search for if you are interested in these things, but sadly is not very informative on its own.
jmspringalmost 11 years ago
An interesting read, I will need to go through it with more care when I get home. A couple of personal interests are having studied with both David Huffman (though not for compression) and Glen Langdon (one of the arithmetic coding pioneers), I need to see how this article compares with my notes.<p>I also need to see about getting some of my old (15+ years) course notes online.<p>Just in those two, much history and memory lost and a hit for UC Santa Cruz&#x27;s computer engineering and science departments (Huffman passed a number of years back, Glen this year after a period of retirement).<p>Edit - the scope of the article and the title provided...a serious disconnect in terms of breadth. But, better than nothing.
评论 #8089015 未加载
adbgealmost 11 years ago
If anyone is interested in the PAQ code, I have put one of the stronger-but-open-source variants on GitHub, here: <a href="https://github.com/robertseaton/paq8pxd" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;robertseaton&#x2F;paq8pxd</a>
webreacalmost 11 years ago
This article is very good, but it is very short about gzip. Event if there was no technical innovation, I think that gzip was really important for development of compression softwares that are not patent-encumbered.
khitchdeealmost 11 years ago
Seems like even though people went after this, there&#x27;s not been much innovation, last few decades. Most of the new stuff looks quite incremental. It almost seems like we are losing our edge in our ability to build from the ground up. This is because we are part of a system that maintains an eagle&#x27;s eye on new ideas. We need to take some of that pressure off us so that we can think a bit outside the box.
评论 #8089847 未加载
评论 #8090707 未加载
评论 #8089899 未加载
评论 #8090132 未加载
评论 #8095734 未加载
heyalexejalmost 11 years ago
Site seems to be down. Cached version: <a href="http://webcache.googleusercontent.com/search?q=cache%3Aieeeghn.org%2Fwiki%2Findex.php%2FHistory_of_Lossless_Data_Compression_Algorithms&amp;oq=cache%3Aieeeghn.org%2Fwiki%2Findex.php%2FHistory_of_Lossless_Data_Compression_Algorithms" rel="nofollow">http:&#x2F;&#x2F;webcache.googleusercontent.com&#x2F;search?q=cache%3Aieeeg...</a>
_deliriumalmost 11 years ago
One interesting domain-specific class of compression algorithms not mentioned here is for lossless audio compression, which tends to use a different (though also pretty simple) technique, somewhat related to PPM. A common approach is to predict the waveform using linear predictive coding (LPC), and then entropy-code the residuals. FLAC does that, for example.
评论 #8090539 未加载
mistercowalmost 11 years ago
&gt; Arithmetic coding is arguably the most optimal entropy coding technique<p>Provably optimal, even (assuming infinite precision).
ellipticalmost 11 years ago
Are there compression techniques that are only &quot;mostly&quot; lossless? I was thinking something along the lines of: for delta, N &gt; 0 the probability that the compression &amp; decompression of a random stream of N bytes will result in loss is less than delta?
oakwhizalmost 11 years ago
I&#x27;m surprised that LZ4 is not in the list. It&#x27;s based on LZ77 like many of the others.
mariuoloalmost 11 years ago
They left out .zoo (LZW) and .lzh&#x2F;.lha (LZSS).<p>Both were quite widespread in the BBS era.
评论 #8090215 未加载
hcrispalmost 11 years ago
How does Snappy compression fit into the picture?
评论 #8089019 未加载
评论 #8091955 未加载
评论 #8089292 未加载
评论 #8089042 未加载
leftrightupdownalmost 11 years ago
so based on this, if someone wanted best compression program they would choose paq?
评论 #8089532 未加载
评论 #8090128 未加载
评论 #8089269 未加载
drydotalmost 11 years ago
I miss ARJ in the article
danieldrehmeralmost 11 years ago
insert weissman score joke here
SurfScorealmost 11 years ago
But what about Pied Piper?
评论 #8088986 未加载
thisjepisjealmost 11 years ago
...GIF is lossless?
评论 #8089301 未加载