TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Logswan – Fast Web log analyzer using probabilistic data structures

35 点作者 mulander超过 9 年前

2 条评论

ar7hur超过 9 年前
If you&#x27;re interested in the probabilistic approach, this is how it works: <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;HyperLogLog" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;HyperLogLog</a><p>&quot;The basis of the HyperLogLog algorithm is the observation that the cardinality of a multiset of uniformly-distributed random numbers can be estimated by calculating the maximum number of leading zeros in the binary representation of each number in the set. If the maximum number of leading zeros observed is n, an estimate for the number of distinct elements in the set is 2^n.&quot;
cwilkes超过 9 年前
If anyone involved in the project is reading this the DNS entry for &quot;www.logswan.org&quot;, available as a link on the github page, does not exist.
评论 #10312711 未加载