TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Looking for compression algorithm that took 6GB to 800MB. Anyone know?

3 pointsby drvc33over 9 years ago
Looking for a good media compression & archiving algorithm for an app I'm building. More specifically, I'm looking an algorithm I had used 10 years ago but had totally forgotten the name of. Description: It was a self-extracting archive (about 800MB) that spat out a folder of over 6GB (mainly media: movies & audio). Now, the media files themselves were already in compressed format (MP3, MP4, etc...) so I was really impressed. Only drawback (as expected) was that it took over 1 hour to decompress on an Intel dual core. Does this kind of performance ring any bells for anyone? I just need a name.

7 comments

insolubleover 9 years ago
Call me a skeptic, but there is no way there was an algorithm available to humans 10 years ago that <i>reversibly</i> compressed 6GB of unique, already-compressed media files down to 800MB. The <i>only</i> way this could have happened is if there were shared files or shared segments between the files. For example, if a DVD had a bunch of audio tracks but some of the tracks were basically just direct copies of the others, then the compressor could recognise the similarity and capitalise thereon. For lossless compression of general data, 7-zip set on Ultra is probably the best available right now. On the other hand, algorithms such as FLAC or PNG work well for losslessly compressing uncompressed media.
ta808945over 9 years ago
I did quick internet search and only app mentioned by people that is able to achieve such compression rate is KGB Archiver[1]. And according to its wikipedia page it uses PAQ[2].<p>[1] <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;KGB_Archiver" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;KGB_Archiver</a> [2] <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;PAQ6" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;PAQ6</a>
ChuckMcMover 9 years ago
We just to joke at NetApp that the Oil and Gas industry had the best compression algorithm, it could compress 100TB of seismic imaging data into a single bit {oil &#x2F; no-oil } :-)<p>I created a theoretical compressor which I haven&#x27;t yet been able to implement which uses the fact that every sequence of bits appears in pi somewhere, so my compressor would just return the digit offset and the length of data. I keep looking for a source for all the digits of pi though, have yet to find it.
评论 #10853449 未加载
评论 #10855692 未加载
评论 #10854164 未加载
评论 #10850193 未加载
rahimnathwaniover 9 years ago
I don&#x27;t believe there is any known lossless algorithm which can achieve 7.5:1 compression on already-lossy-compressed media files.<p>If you want further lossy compression for existing media files, look at the newest algorithms supported by ffmpeg.<p>If you want generic lossless compression, you can do a bit better than the usual suspects (gzip et al), but only if you&#x27;re willing to put up with very slow compression times.<p>If you have some other type of specific data (e.g. sparse files) then you could do something custom, but I guess this is unlikely to be your situation.
DanBCover 9 years ago
You can look through the software here: <a href="http:&#x2F;&#x2F;www.maximumcompression.com&#x2F;benchmarks&#x2F;benchmarks.php" rel="nofollow">http:&#x2F;&#x2F;www.maximumcompression.com&#x2F;benchmarks&#x2F;benchmarks.php</a><p>Or the software linked via here: <a href="http:&#x2F;&#x2F;prize.hutter1.net&#x2F;" rel="nofollow">http:&#x2F;&#x2F;prize.hutter1.net&#x2F;</a><p>As other people say, what you&#x27;re asking for probably isn&#x27;t possible.
drvc33over 9 years ago
I wanted to add: The self-extractor had no interface -- it was all in the windows command prompt. A message in the prompt said something about &quot;Media compressor&quot; and below it was the percentage extracted.
toreriklinnerudover 9 years ago
Pied Piper?
评论 #10846110 未加载
评论 #10847072 未加载