TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Reference or Reading materials for Learning Hadoop

5 点作者 vbv超过 12 年前
I am trying to learn Hadoop and was wondering if there are any references, tutorials, or papers that HNers use that I could make use of and make learning Hadoop more simpler, efficient, and productive.

1 comment

eshvk超过 12 年前
So learning Hadoop can be split up into several pieces:<p>1. Learning the idea of Map-Reduce. This is fairly easy and you could browse through the original research paper and figure that out.<p>2. Learning the weird, wild animal called Hadoop (with its multiple API clusterfuck). This is going to be much harder. Presuming you know Java, the first thing you want to do is get a Cloudera VM (because you don't really want to spend time learning how to install hadoop at first) and start figuring out how to build Word Count inside the VM. This should give you some insight (not much though) in how the API works.<p>3. Figure out more complicated stuff you want to do with Hadoop and start working on it. Get a copy of Tom White's Hadoop book (From what I remember six months back, the API was hopelessly outdated but the ideas are awesome) and Jimmy Lin's book on text processing with Map Reduce ( <a href="http://lintool.github.com/MapReduceAlgorithms/" rel="nofollow">http://lintool.github.com/MapReduceAlgorithms/</a> ). Personally, I loved Jimmy's book not because of the machine learning content but because of the design patterns for Hadoop that he had embedded in there.
评论 #4577274 未加载