TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

BookKeeper: High-availability scalable distributed logging

17 pointsby mad44over 10 years ago

2 comments

kylequestover 10 years ago
Kafka can be compared to HedWig, which uses BookKeeper internally to store data.<p>HedWig is closer to the traditional message broker model where the broker (Hub in case of HedWig) keeps track of the subscriptions and what&#x27;s been consumed so far. Kafka, on the other hand, uses a stateless broker model where the consumers maintain the subscription state about what has been consumed.<p>HedWig Hubs keep track of all subscriptions and once all consumers &quot;consume&quot; a given message Hubs delete the message. Kafka doesn&#x27;t do that. It allows its consumers to start all over again even if the messages have been consumed (as long as the message is not too old).<p>HedWig is also slower because of its focus on high durability. Earlier versions of Kafka didn&#x27;t care about durability as much, so Kafka was much faster.<p>HedWig is also design to work with a large number of topics and a few consumers for those topics. Kafka can do a better job supporting a large number of consumers (given its stateless broker design).
t1mover 10 years ago
The post mentions Tango, which has a novel consistency algorithm, but doesn&#x27;t mention how BookKeeper differs from Kafka (also an Apache project). Can anyone comment on the difference?