TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: What is the architectural legacy of Hadoop?

1 点作者 ibains将近 4 年前
I was thinking through - what the computer systems &#x2F; architectural legacy of Hadoop is.<p>In Databases, the physical query plan streams data. Hadoop seems to have contributed that when running on cheap hardware or at scale, you can write intermediate results to files, so failures are handled better.<p>Is there anything more to it?

2 条评论

PaulHoule将近 4 年前
There is HDFS, which has aged better, and the old MapReduce &#x27;query&#x27; processing system which has aged worse. (Replaced by Spark and about 10 other things.)<p>There is a large supply of firms that would like to dethrone HDFS, because they think customers think that paying to 3x replicate the data is too much. (The winner is Amazon S3 where you pay even more!)<p>Maybe the scene has changed, ceph has made some inroads, but HDFS has the amazing property of being almost as fast running in degraded mode as it is normally, thus being fast enough that it can regrade faster than it degrades.<p>A big cluster is going to be partially degraded a lot so it matters.
CarbonCycles将近 4 年前
Distributed processing at massive scale...used for data transformation, stateless algorithm deployment, data mining&#x2F;discovery at scale. If memory serves me correctly, still used at EBay, Google and other companies for very low level work.