TechEcho

hurinabout 10 years ago

// The InverseBloomFilter may report a false negative but can never report a false positive.This is actually not true and not possible. There is a good explanation here: <a href="http://cstheory.stackexchange.com/a/14455/43" rel="nofollow">http://cstheory.stackexchange.com/a/14455/43</a>There will be a low probability of false positives ~ 2^32*k. (Also it's not an inverse-bloom filter, it's a cache).

评论 #9360704 未加载

评论 #9360730 未加载

评论 #9361229 未加载

评论 #9360744 未加载

wcdolphinabout 10 years ago

That InverseBloomFilter sounds like an LRU cache of hashes. A true inverse bloom filter can be proven to be impossible, sadly.

Analog24about 10 years ago

With enough data, a traditional Bloom filter "fills up", after which it has a false-positive probability of 1.I don't think this correct. The false-positive probability wouldn't become 1, it's just that every query would return positive, regardless if it's a true or false positive. I think what you meant to say was that the positive probability is 1, not the false-positive probability.

natchabout 10 years ago

Why does the readme use Boom and Bloom interchangeably? Is it just a typo that for some reason is randomly scattered throughout the document (and in the repo name itself)? Or is there meant to be a difference? I don't see anywhere that the difference is explained, if it's intentional.

评论 #9360511 未加载

eternalbanabout 10 years ago

You should checkout Tyler's blog. Good solid content for the concurrency geeks.

Show HN: BoomFilters – Probabilistic data structures for processing streams

5 comments

Show HN: BoomFilters – Probabilistic data structures for processing streams

5 comments