TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

History of Lossless Data Compression Algorithms

146 点作者 _nullandnull_将近 11 年前

17 条评论

derf_将近 11 年前
Sadly, the description of arithmetic coding bears almost no resemblance to the actual algorithm (it roughly describes the equal probability case, but that misses most of the point). The description of Shannon-Fano as &quot;bottom up&quot; and Huffman as &quot;top-down&quot; is also exactly backwards (the actual descriptions of those algorithms are accurate, but the labeling is confused).<p>The article contains a lot of terms you can search for if you are interested in these things, but sadly is not very informative on its own.
jmspring将近 11 年前
An interesting read, I will need to go through it with more care when I get home. A couple of personal interests are having studied with both David Huffman (though not for compression) and Glen Langdon (one of the arithmetic coding pioneers), I need to see how this article compares with my notes.<p>I also need to see about getting some of my old (15+ years) course notes online.<p>Just in those two, much history and memory lost and a hit for UC Santa Cruz&#x27;s computer engineering and science departments (Huffman passed a number of years back, Glen this year after a period of retirement).<p>Edit - the scope of the article and the title provided...a serious disconnect in terms of breadth. But, better than nothing.
评论 #8089015 未加载
adbge将近 11 年前
If anyone is interested in the PAQ code, I have put one of the stronger-but-open-source variants on GitHub, here: <a href="https://github.com/robertseaton/paq8pxd" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;robertseaton&#x2F;paq8pxd</a>
webreac将近 11 年前
This article is very good, but it is very short about gzip. Event if there was no technical innovation, I think that gzip was really important for development of compression softwares that are not patent-encumbered.
khitchdee将近 11 年前
Seems like even though people went after this, there&#x27;s not been much innovation, last few decades. Most of the new stuff looks quite incremental. It almost seems like we are losing our edge in our ability to build from the ground up. This is because we are part of a system that maintains an eagle&#x27;s eye on new ideas. We need to take some of that pressure off us so that we can think a bit outside the box.
评论 #8089847 未加载
评论 #8090707 未加载
评论 #8089899 未加载
评论 #8090132 未加载
评论 #8095734 未加载
heyalexej将近 11 年前
Site seems to be down. Cached version: <a href="http://webcache.googleusercontent.com/search?q=cache%3Aieeeghn.org%2Fwiki%2Findex.php%2FHistory_of_Lossless_Data_Compression_Algorithms&amp;oq=cache%3Aieeeghn.org%2Fwiki%2Findex.php%2FHistory_of_Lossless_Data_Compression_Algorithms" rel="nofollow">http:&#x2F;&#x2F;webcache.googleusercontent.com&#x2F;search?q=cache%3Aieeeg...</a>
_delirium将近 11 年前
One interesting domain-specific class of compression algorithms not mentioned here is for lossless audio compression, which tends to use a different (though also pretty simple) technique, somewhat related to PPM. A common approach is to predict the waveform using linear predictive coding (LPC), and then entropy-code the residuals. FLAC does that, for example.
评论 #8090539 未加载
mistercow将近 11 年前
&gt; Arithmetic coding is arguably the most optimal entropy coding technique<p>Provably optimal, even (assuming infinite precision).
elliptic将近 11 年前
Are there compression techniques that are only &quot;mostly&quot; lossless? I was thinking something along the lines of: for delta, N &gt; 0 the probability that the compression &amp; decompression of a random stream of N bytes will result in loss is less than delta?
oakwhiz将近 11 年前
I&#x27;m surprised that LZ4 is not in the list. It&#x27;s based on LZ77 like many of the others.
mariuolo将近 11 年前
They left out .zoo (LZW) and .lzh&#x2F;.lha (LZSS).<p>Both were quite widespread in the BBS era.
评论 #8090215 未加载
hcrisp将近 11 年前
How does Snappy compression fit into the picture?
评论 #8089019 未加载
评论 #8091955 未加载
评论 #8089292 未加载
评论 #8089042 未加载
leftrightupdown将近 11 年前
so based on this, if someone wanted best compression program they would choose paq?
评论 #8089532 未加载
评论 #8090128 未加载
评论 #8089269 未加载
drydot将近 11 年前
I miss ARJ in the article
danieldrehmer将近 11 年前
insert weissman score joke here
SurfScore将近 11 年前
But what about Pied Piper?
评论 #8088986 未加载
thisjepisje将近 11 年前
...GIF is lossless?
评论 #8089301 未加载