TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Logswan – Fast Web log analyzer using probabilistic data structures

35 pointsby mulanderover 9 years ago

2 comments

ar7hurover 9 years ago
If you&#x27;re interested in the probabilistic approach, this is how it works: <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;HyperLogLog" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;HyperLogLog</a><p>&quot;The basis of the HyperLogLog algorithm is the observation that the cardinality of a multiset of uniformly-distributed random numbers can be estimated by calculating the maximum number of leading zeros in the binary representation of each number in the set. If the maximum number of leading zeros observed is n, an estimate for the number of distinct elements in the set is 2^n.&quot;
cwilkesover 9 years ago
If anyone involved in the project is reading this the DNS entry for &quot;www.logswan.org&quot;, available as a link on the github page, does not exist.
评论 #10312711 未加载