TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Analyzing unknown binary files using information entropy

32 pointsby egorstabout 10 years ago

3 comments

woliveirajrabout 10 years ago
There is some technique called &quot;normalized compression distance&quot; that does sort of it. It uses compression to compare how similar some data is to some another.<p>For a similar problem, you can work like it was answered here: <a href="http:&#x2F;&#x2F;reverseengineering.stackexchange.com&#x2F;questions&#x2F;2897&#x2F;tool-or-data-for-analysis-of-binary-code-to-detect-cpu-architecture&#x2F;2900#2900" rel="nofollow">http:&#x2F;&#x2F;reverseengineering.stackexchange.com&#x2F;questions&#x2F;2897&#x2F;t...</a>
snarfyabout 10 years ago
I always thought this idea could be greatly expanded upon.<p>I&#x27;ve seen it used to guess the native language of a text file based on the compressed input. I always believed this could be used as a sort of universal translator. You could compress the audio sounds of birds, throw this algorithm at it, and extract meaningful content.
评论 #9546830 未加载
rasz_plabout 10 years ago
Cantor.Dust - the future was here, but turned out to be vaporware :(<p><a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=4bM3Gut1hIk" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=4bM3Gut1hIk</a>