TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Eric Schmidt’s "5 Exabytes" Quote is a Load of Crap

112 pointsby robertjmooreover 14 years ago

12 comments

ajaysover 14 years ago
The figure I've heard is that the <i>data</i> generated doubles every year (here, "data" can mean web pages, logs, transactions, etc.) . Therefore, it follows that every year we create as much data as in all the previous years combined ( sum_i 2^i = 2^(i+1) ).<p>If we created X amount of data in 2003, then, 7 years later, we're creating 128X as much data; which roughly works out to X every 3 days.
评论 #2190488 未加载
评论 #2189593 未加载
评论 #2191539 未加载
评论 #2191695 未加载
burgerbrainover 14 years ago
<i>Based on the primary sources I’ve been able to piece together, the more accurate (but far less sensational) quote would be:<p>"23 Exabytes of information was recorded and replicated in 2002. We now record and transfer that much information every 7 days."</i><p>Call me crazy, but that sounds every bit just as sensational to me. Seems like all this article is doing is getting overly picking with some throwaway oft-repeated trivia stat. Who cares what the exact numbers are? The purpose of the statement remains the same.
评论 #2190086 未加载
评论 #2191083 未加载
评论 #2189666 未加载
joubertover 14 years ago
Interesting statistic: <i>It has been said that 78% of all statistics are made up.</i>
评论 #2189595 未加载
corin_over 14 years ago
I wonder if anyone would be able to calculate the amount of data created in the last two millennia... and if so, how.
评论 #2189723 未加载
评论 #2189618 未加载
fxjover 14 years ago
information is not all equal. recording from /dev/random is not valuable information even though it fills up disk space. the value of information depends very much on the context.
Tichyover 14 years ago
A lot might have happened since 2002. People with digital cameras take a lot of pictures, for example. YouTube is booming. Lot's of devices generate automatic data feeds, for example location tracking from mobile phones, clickstreams on the internet.<p>The number might still have been made up, but let's not forget that Schmidt might have some sources of information no available to the public, for example the server stats from Google and YouTube.
dvdtover 14 years ago
How timely! I was actually at a Google recruiting event/tech talk today at my university, where a Google engineer repeated this quote to us. Fittingly, he also misquoted it and said that 5 exabytes of data are created every day, instead of every two days as in the original quote. I looked at him askance for a moment due to the absurdity of the number--thanks for clearing it up!
HyprMusicover 14 years ago
Perhaps the figures he was given were based entirely on computer data - and he quoted them to sound like all data?
JacobAldridgeover 14 years ago
"We now create and replicate as much data in one week, as we did in one year, just a decade ago."<p>True, not as catchy as the dawn of time, but still mighty impressive. And in fairness to their outgoing CEO, Google didn't cache much data at the dawn of time (or even in the '80s), so it can't have been <i>that</i> important.
patrickgzillover 14 years ago
My tummy rumbled and I burped at 9:22AM EST this morning. Now that I have posted this: is that a piece of information?<p>My point is that a lot of this "information" is ephemeral and not really all that important in the long run.
评论 #2189679 未加载
maeon3over 14 years ago
Yes we are creating more data now than in the history of mankind. However the ratio of (quality stored data / total data ) has gone down with the ease of storage. Most of the "data" is for entertainment.
JonnieCacheover 14 years ago
Lies, damn lies, and clichés.