TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Replica Strategy in Hdfs Is Not Good Enough

2 点作者 garfee超过 11 年前

1 comment

brugidou超过 11 年前
Comparing to mongodb is a joke.<p>However some more advanced strategies should be applied for very large hdfs clusters. The rack aware strategy is actually better than what is described because the probability distribution is not perfectly uniform. It all depends on the hardware, the location... Etc. But with a very large number of blocks the probability of loosing data with 3 nodes failure is close to 1 unfortunately.<p>We could try to imagine a better strategy having replicas in cliques of nodes to mitigate the risk. Its a tradeoff of loosing more data with less probability or less data with high probability I guess? Haven&#x27;t done the math :)