TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

How Search Works

324 点作者 vijaydev大约 12 年前

28 条评论

philsnow大约 12 年前
I missed most of the content on this ... page ? Exhibit ? Installation ? whatever it's called, because it told me to scroll, I did, and I scrolled through a bunch of what looks like empty space and arrived at the end ("and that's how search works"). The user is apparently supposed to stop and watch some animation at certain places, but it's not clear where to stop scrolling.<p>Perfect example, near the top there's some text about "It's made up of over[........] 30 TRILLION[.........] INDIVIDUAL PAGES[........] and it's constantly growing." But there's nothing to indicate that I should stop somewhere and wait for some more text to show up.<p>Maybe they should limit how far down you can scroll by setting the height of some element, and only increase it when the animation is finished.<p>Edit: the key problem here isn't the "scrolling makes things happen" gimmick that's popular lately. the problem is that it starts certain animations or fade-ins some time after I've already skipped past an apparently blank space.
评论 #5308558 未加载
评论 #5307989 未加载
评论 #5311163 未加载
dangrossman大约 12 年前
The most interesting thing there is the live view of the most recently deleted webspam. I wonder what blackhat SEO firms can learn from that to better avoid the filters.
评论 #5306808 未加载
评论 #5306837 未加载
area51mafia大约 12 年前
It's nice overall, but the timing for making items appear is a little slow. I was past most headers by the time they appeared, and I don't think I scroll too incredibly fast.
评论 #5305788 未加载
franze大约 12 年前
thx matt and the google search team for doing this. it's nothing new for technically inclined people, but every little bit helps. helps for what? teaching people to worry about the right aspects of search and the impact on their business, instead of worrying about bullshitphrases that were planted in their head by a SEO agency key account or a blogpost from 2008. so well yes, thx for doing this. i will send it to my clients (and tell them to click on the bubbles, even though they don't look clickable)<p>now an anecdote (because i feel like telling one): this week started for me with an interview that finally got published <a href="http://werbeplanung.at/news/marketing/2013/02/interview-mit-franz-enzenhofer" rel="nofollow">http://werbeplanung.at/news/marketing/2013/02/interview-mit-...</a> (it's german) in that interview i claimed that<p>* 80% of everything written about SEO and Google is bullshit<p>* that all the rumors, tipps and trends are actually hurting business<p>* that we should treat SEO as a numbers based craft of constant optimizations<p>* instead of the esoteric bullshit art it is currently<p>* and, if search traffic is important for the success of a business, they must rid themselves of external (agency) dependencies and develop internal structures<p>nothing to far fetched i think. everybody knows the SEO vertical is full of bullshit, i just took some time to estimate a number (based on a random sample of collected blogposts (that at least one person tweeted about))<p>yeah, i got a lot of angry emails, skype messages, linkedin messages, xing messages after the interview was published.<p>most of them mentioned at least one of these words<p><pre><code> * pagerank * whitehat * blackhat * grayhat * linkjuice * panda * pinguin ... </code></pre> so yeah, thx google for educating people about search. keep up the good work.
评论 #5309073 未加载
评论 #5309667 未加载
tmoertel大约 12 年前
Has anyone deciphered the fat-mustache diagram in the "Query Understanding" circle? It's in the Algorithms section.<p>At first I thought it was supposed to represent a Gaussian-like probability distribution. But when I clicked on it, the resulting animation showed a series of such distributions getting flattened by some kind of distribution-flattening hydraulic press. The accompanying caption: "Gets to the deeper meaning of the words you type."<p>If I was confused before, now I was completely lost.<p>How is deeper meaning represented by distribution flattening? I'd think it would be just the opposite, raising probability mass around the likely meanings, not spreading it out into a uniform distribution over all meanings.<p>Baffling.<p>If anyone has figured it out, please do share.<p>(Maybe I'm taking the diagrams too seriously.)<p>EDITED TO ADD: New option: If you don't have any clue what it means either, come up with an entertaining <i>yet plausible</i> story that fits the hydraulic-press-vs-mustaches animation and share that story instead.<p>EDITED TO ADD: Example: At Google’s new eco-friendly data centers, NLP computations are performed by genetically enhanced inchworms. Difficult queries, however, can cause the inchworms to get cricks in their backs. In such cases, Google’s innovative back-massager descends and restores the inchworms to their preferred position (prone), from which they can return to their computations with renewed vigor.
评论 #5306930 未加载
评论 #5306877 未加载
评论 #5308018 未加载
评论 #5306956 未加载
评论 #5306892 未加载
dylangs1030大约 12 年前
I don't know what to take from this.<p>That search is very complex (I knew that, but not with this technical detail).<p><i>Or</i>...that Google is trying very hard to maintain user interest with gimmicky shows of why it's cool and cutting edge and necessary.<p>Not that Google isn't those things...this just seems like an unnecessary expenditure of time. We know it's complex Google. Improve some other features and stop shutting others down instead of making these web 2.0 animations.
评论 #5309268 未加载
jojopotato大约 12 年前
Interesting that they show the approximate number of searches / second at the bottom. Is that an otherwise publicly available number?
eykanal大约 12 年前
I was halfway through before I realized that some of the content was clickable.<p>Very nice page, though.
JDDunn9大约 12 年前
Their characterization of their spam procedures is grossly misleading. They do not send emails to most people that have been penalized, nor do they give clear instructions on how people can fix their sites.<p>Thousands of small sites were killed by Panda for no good reason, and have little hope of getting their traffic/incomes back. Google's spam policy is skewed heavily in favor of large sites and their own properties.
评论 #5308034 未加载
_mvuc大约 12 年前
I keep checking every so often, but searching for "this phrase" or +absolute +requirement is still broken. Even "Verbatim", isn't. If they can't even get simple search right, who would trust them with anything more?
评论 #5308888 未加载
评论 #5309514 未加载
评论 #5308738 未加载
aviswanathan大约 12 年前
Scrolling is really becoming the new thing in UX design. It's an interesting contrast to the 'movie-like' flash animations of a few years ago that required no interaction on behalf of the user.
评论 #5306749 未加载
评论 #5307800 未加载
评论 #5307137 未加载
prezjordan大约 12 年前
They left out the part where they index your emails and choose items you agree with over items you don't :)
评论 #5305933 未加载
评论 #5306195 未加载
Xorlev大约 12 年前
38,800 requests/second according to their estimation.
评论 #5309213 未加载
johnmurch大约 12 年前
Is this just PR for Google? Would rather see a more technical approach - although great for forwarding to clients when asked :)
评论 #5305959 未加载
评论 #5306571 未加载
cryowaffle大约 12 年前
Whoa... really, 100 MILLION gigabytes to store "The Index"? Wow. That's big.
评论 #5306217 未加载
sytelus大约 12 年前
There are some good facts and numbers hidden in rather toy explanation:<p>1. Spam detection is automatic<p>2. There 6 types of spam<p>-Unnatural outbound links (link selling)<p>-Content copy/manufactering<p>-Keyword stuffing<p>-Forums/user generated spam<p>-Parked domains<p>-Sites hosted on spammy DNS<p>-Different content humans and bots<p>-Hacked sites<p>3. Google is removing as many as 50K spam sites per month, they get 8K reconsideration requests<p>4. Google's machine learned relevance model may be using about 200 features
manojlds大约 12 年前
&#62; By the way, in the 47 seconds you've been on this page, approximately 1,813,260 searches were performed.<p>Aren't these just some random numbers that they pull out of the air?
评论 #5306945 未加载
评论 #5306538 未加载
评论 #5306629 未加载
评论 #5306657 未加载
aeon10大约 12 年前
A beautifully designed page more than anything else
lysium大约 12 年前
Nice scroll-UI! Took some time to see the clickable items. Interesting bits about spam pages.
moeedm大约 12 年前
An awful way to learn anything.
state大约 12 年前
The better people understand their tools, the more effectively they can use them.
wfunction大约 12 年前
"We write programs &#38; formulas to deliver the best results possible."<p>No kidding.
denysonique大约 12 年前
Some of the live listed 'spam' pages appear to be genuine to me.
joshhart大约 12 年前
Answer: It uses a bunch of skip lists.<p>Source: I do hacking on top of lucene.
yarou大约 12 年前
vijay: very interesting link. thought it was interesting, despite the obvious slant.
moha24大约 12 年前
This is not how search works!!
asawant大约 12 年前
This is brilliant !!!
OGinparadise大约 12 年前
"We write programs &#38; formulas to deliver the best results possible"<p>There's a slight oversight, it should be: "We write programs &#38; formulas to deliver the most profitable results possible for this quarter"
评论 #5308199 未加载
评论 #5306679 未加载
评论 #5306689 未加载