TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Compression with common binary database

2 点作者 tarikozket大约 8 年前
Today I realized something. Humans compress their communication by knowing things mutually. For example, if you know a joke and if your friend knows the same joke too, you can remind him&#x2F;her that joke any second by just telling a word of it.<p>Today our compression algorithms compress stuff by only thinking that nobody knows the joke we are gonna tell and tries to choose shorter words instead of just outputting a word of the joke.<p>What if, we would analyze many compressed files and find the most common binary patterns and create a new compression algorithm which uses these patterns to tell the joke with only a word? And the person who wants to decompress the file would need to have the database as well.

2 条评论

PaulHoule大约 8 年前
See <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Brotli" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Brotli</a>
评论 #13734814 未加载
throwaway_374大约 8 年前
Great in concept until you run into the pigeonhole principle. Such compression schemes are fundamentally flawed and you can spend years wasting time on them until some corner edge case will throw you. Nonetheless, I&#x27;d encourage you to satisfy yourself by trying to devise such an elaborate scheme... Pied Piper may have a job opening for you.