TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Sorting Petabytes with MapReduce

33 pointsby abrahamover 13 years ago

3 comments

abtinfover 13 years ago
Google keeps bragging about all their internal capabilities because they want to hire people. But so what? If I had a real reason to, I could fire up a couple thousand machines on amazon and analyze data akimbo.<p>In a sense, google is worse than microsoft - they really don't share any of their hardcore cs innovations. At least MS is in the business of selling technology. Google is in the business of hoarding it in order to derive competitive advantage in advertising.<p>I just wrote up an entry about this at <a href="http://news.ycombinator.com/item?id=2972368" rel="nofollow">http://news.ycombinator.com/item?id=2972368</a>
评论 #2972604 未加载
moultanoover 13 years ago
One of my favorite parts of my job is the ability to grab thousands of machines at low priority for the hell of it.
评论 #2972456 未加载
评论 #2972505 未加载
brandonbover 13 years ago
Their benchmarks seem a little odd. They claim an 11x speed improvement, compared to using half as many computers three years ago. But you'd expect roughly an order of magnitude improvement anyway, just due to Moore's law.<p>I'd be really curious to see what part of the speedup is due to software optimizations, i.e., compare the 2011 software with the 2008 software on identical hardware.
评论 #2972889 未加载
评论 #2972901 未加载