TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Tell HN: Google doesn't work anymore for exact matches

331 点作者 bratao超过 3 年前
It&#x27;s been a while since I have felt that Google&#x27;s results have deteriorated. It takes a lot of tricks to find what I am looking for. Today an interesting case occurred that frustrated me a lot and is worth telling HN.<p>First, I was looking for a song and searched for: &quot;here were the dreams are born&quot; (I know I mistyped). One of the first results I found was this interesting story (Google results <a href="https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;gUq4XVZ" rel="nofollow">https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;gUq4XVZ</a>):<p><a href="https:&#x2F;&#x2F;mechahuggermr.tripod.com&#x2F;id66.html" rel="nofollow">https:&#x2F;&#x2F;mechahuggermr.tripod.com&#x2F;id66.html</a><p>I took the following sentence from this story and used it in the readme of an internal project:<p>&quot;David, we have been expecting you - this is what you have been searching for - this place, David, is where dreams are born&quot;<p>Some people wanted to know where this quote came from and could not find it on Google.<p>I also tested and cannot enter any combination of parameters into Google to find this page. I tried quotation marks, literal search and no hyphen. Nothing, it is impossible to find it.<p>Does anyone know what is going on here? Can someone do a magic call and find this page on Google?<p>Has Google&#x27;s AI&#x2F;BERT Enhanced Search reached a point where indexed pages can not be found?<p>All results were tested with a Brazilian connection and replicated in a Private Session on an US VPN.

46 条评论

capableweb超过 3 年前
I agree with your general point that the search quality has gone down, quotes doesn&#x27;t even always work anymore to get exact results.<p>Looking into your suggested example: That turned out to be interesting and unexpected.<p>So, the exact string you put here was &quot;David, we have been expecting you - this is what you have been searching for - this place, David, is where dreams are born&quot;, which is what you get when you copy the text from the website. It&#x27;s correct that it doesn&#x27;t work on Google searching for verbatim.<p>The actual DOM of the snippet looks like this:<p><pre><code> “David, we have been &lt;br&gt;expecting you - this is what you have been searching for - this place, &lt;br&gt;David, is where dreams are born.” </code></pre> If you take any snippet of text that doesn&#x27;t do a line-break, it seems exact searches do work, like &quot;expecting you - this is what you have been searching for - this place&quot; or &quot;deep and melodious when it spoke&quot;.<p>If you do take a snippet that does a line-break, then it cannot find anything, like &quot;David, we have been expecting you&quot; or &quot;this place, David, is where &quot;<p>It seems that Google as unlearned how to treat different type of whitespaces, especially when the author&#x2F;software has introduced manual line-breaks via the &lt;br&#x2F;&gt; HTML tag.<p>I&#x27;m sure they have at one point introduced some &quot;quality filter&quot; that gives higher score based on how well the markup is made by the websites, for one reason or another, and eventually it got so &quot;improved&quot; or established that even if it&#x27;s the only relevant hit for a human, the computer simply ignores the result for low scoring, since the markup is not 100% correct.
评论 #30131505 未加载
评论 #30134655 未加载
评论 #30133650 未加载
评论 #30131317 未加载
评论 #30131326 未加载
michaelt超过 3 年前
I have an Intel Realsense camera, which sometimes reports the error &quot;Failed to recconect&quot; (there being a typo in the drivers) [1] - that&#x27;s a pretty unique error, so in combination with the product name that should be a very easy keyword search, right? Especially if I throw in some quotation marks, to make it clear I want a literal match?<p>Yet when I search for <i>realsense &quot;failed to recconect&quot;</i> Google, in its infinite wisdom, returns pages that contain neither realsense nor recconect [2]. They offer me a supreme court opinion, a review of a car dealership, and a facebook church service.<p>Correcting the spelling of a query is one thing - but also completely ignoring other keywords? Even when there are pages available that match the query? [1]<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;IntelRealSense&#x2F;librealsense&#x2F;blob&#x2F;5ff27fca5aaeec4736d6bb3bfb958fee82ee198b&#x2F;src&#x2F;pipeline&#x2F;pipeline.cpp#L263" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;IntelRealSense&#x2F;librealsense&#x2F;blob&#x2F;5ff27fca...</a> [2] <a href="https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;okYV5V2" rel="nofollow">https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;okYV5V2</a>
评论 #30132404 未加载
评论 #30133595 未加载
评论 #30132497 未加载
评论 #30132525 未加载
评论 #30137293 未加载
monkeybutton超过 3 年前
I was looking for a specific person recently and searched: &lt;name of person&gt; Canada<p>I guess they were pretty obscure so Google in all their wisdom displayed the results for Canada, with the entire name struck through. Fantastic. Defaulting to the most generic term in a query to the point of absolute uselessness.
评论 #30131297 未加载
评论 #30131682 未加载
评论 #30132199 未加载
westcort超过 3 年前
I created a Google custom search engine for the text of about 40,000 out-of-copyright books: <a href="https:&#x2F;&#x2F;www.locserendipity.com&#x2F;Google.html" rel="nofollow">https:&#x2F;&#x2F;www.locserendipity.com&#x2F;Google.html</a> and noticed that exact text matches don’t always show up. A RegEx search of the same repository yields many more matches for specific phrases than the Google custom search does. About a year and a half ago, the quality of results went down a lot. I agree something has changed for the worse.
ColinWright超过 3 年前
Clickables:<p><a href="https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;gUq4XVZ" rel="nofollow">https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;gUq4XVZ</a><p><a href="https:&#x2F;&#x2F;mechahuggermr.tripod.com&#x2F;id66.html" rel="nofollow">https:&#x2F;&#x2F;mechahuggermr.tripod.com&#x2F;id66.html</a><p>I tested this, putting in the exact phrase.<p>DDG finds the source. Google doesn&#x27;t, and instead finds this submission.
andrewclunn超过 3 年前
I recall when YouTube used to refine the &quot;best answer&quot; to a question based on distributed popularity. Ask it, &quot;How do I screw in a lightbulb?&quot; and the video at the top would be one which had received numerous views and likes, crowdsourcing its usefulness in answering the question. Then the algorithm was changed to prioritize more recent videos. This gave rise to the churn of the novel and new over the time tested and approved. This was better for ads, but worse for users. One cannot expect the ad driven model NOT to have an impact on usability.
smt88超过 3 年前
This (shitty NLP) has been bad for a while, but I did notice it get worse recently in a way that feels crippling to me. I don&#x27;t have a functional search engine anymore.
评论 #30131265 未加载
评论 #30131858 未加载
lelandfe超过 3 年前
Curiously, searching directly on the site with that quote produces &quot;No results found,&quot; and then shows an inexact match with just that quote underneath. This is clearly a real bug on Google&#x27;s side.<p><a href="https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;2XFogU5" rel="nofollow">https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;2XFogU5</a>
评论 #30131001 未加载
weare138超过 3 年前
It&#x27;s not just you. I have been having similar problems and it just seems to keep getting worse. The other issue I&#x27;ve been having is the precedence of product pages over everything else. Now it seems no matter what I search for I&#x27;m bombarded with links to products and low quality product focused blogspam. Also I&#x27;ve noticed search results will change based on what I have recently searched. The same search can yield different results at different times. Google screwed up the one thing it was actually good at. Now Google Search just feels like every other subpar Google knockoff product.
rurp超过 3 年前
I totally agree with the consensus here that Google search result quality has declined dramatically, although I think it&#x27;s been happening for at least 5 years rather than 6-12 months. Two alternates have helped me replace Google.<p>DuckDuckGo has been mentioned by many others but I&#x27;ll add my voice as well. It does a much better job at respecting quotes than Google (though not always!). It&#x27;s very rare these days that I strike out on a search with DDG and have any luck with Google, though it does happen on rare occasions.<p>The other site is SymbolHound. This is great for programming queries since it actually search symbols verbatim! I found out about it when trying to debug a complicated makefile at a new job, while not being very good at makefiles. Ever tried searching Google for some unfamiliar syntax from a makefile? Hahaha, it doesn&#x27;t even try to give matching results; it&#x27;s totally useless.<p>I miss the old days but these two alternatives, plus falling back to Google, or at least StartPage with `!sp`, still works pretty well.
braddeicide超过 3 年前
Google results are low quality unless you enable verbatim. Tools, all results, verbatim.<p>It blows my mind this isn&#x27;t the default. I can only assume they&#x27;ve adopted the opinion of search engines before them that they could benefit from showing lower quality results to keep the users on their site longer.
评论 #30132428 未加载
noobermin超过 3 年前
As of now, searching the quote brings up this thread. I feel like Google now prioritizes certain websites (like HN) and essentially skips things like tripod websites.
评论 #30131090 未加载
thedays超过 3 年前
DuckDuckGo finds the quote on a Tripod site as its top and only result for this search.<p><a href="https:&#x2F;&#x2F;duckduckgo.com&#x2F;?q=%22David%2C+we+have+been+expecting+you+-+this+is+what+you+have+been+searching+for+-+this+place%2C+David%2C+is+where+dreams+are+born%22&amp;ia=web" rel="nofollow">https:&#x2F;&#x2F;duckduckgo.com&#x2F;?q=%22David%2C+we+have+been+expecting...</a>
iamleppert超过 3 年前
I really want a search engine that works more like a database. I want to see why it returned the results it did, like an EXPLAIN syntax does for SQL queries. I want fine grain control of the actual query, and ability to sort the results by different facets.<p>I think there’s 100% a market for a new search engine that is geared toward the power user, the researcher.<p>Google hasn’t innovated their actual search engine in ways that actually benefit the user in years. The only changes they have done have been driven by a profit motive.
评论 #30141110 未加载
评论 #30133445 未加载
ei8htyfi5e超过 3 年前
I recently came across a video of a police officer in a high speed pursuit shooting out of his car window one handed into oncoming traffic. I remembered the title of the video I found. I remembered it happened in New Mexico and that the police officer was moved to detective since this was his 3rd gun fight in 2 years out of the departments total of 4. I was trying to pull it up to show my fiancé how nuts this cops behavior is.<p>No amount of searching for this video on google could find this video. 20 minutes. I was infuriated. My brain started going to dark places. Is this censorship?<p>No. Google sucks now. Somehow. They killed the golden goose.<p>I fired up Bing. Typed the title of the video and found 100 copies instantly.<p>I now use Bing. And it’s not bad. Generally I can find what I’m looking for. It’s like a slightly worse version of what Google used to be. But it’s an infinitely better version of what google is now.<p>I also use Brave search engine but that’s worse than bing still.<p>If you’re looking for interesting party conversation, try opening with “I recently switched my search engine to Bing.” The conversation goes to fun places and most people will agree that Google has gotten much worse.
apitman超过 3 年前
Am I crazy or is there room for a good early 2000s-style link aggregator in today&#x27;s content discovery landscape? It seems like the primary reasons the likes of delicious failed is because they sold out for money. I wonder if an open source federated solution could work.
评论 #30131770 未加载
评论 #30131550 未加载
michaelcampbell超过 3 年前
DDG does much better with quotes for required&#x2F;exact matches.
评论 #30132104 未加载
评论 #30133217 未加载
OneLeggedCat超过 3 年前
I have full faith in Google&#x27;s ability to return the most profitable results.
sinuhe69超过 3 年前
In this particular case, where the result is negative, we humans can easily recognize that something is wrong. But in other cases, where the exact match is also the best match, it’s not easy to dig through all the pages to know something is amiss. Of course, I understand that SE like Google was not designed to produce exact match for long texts like this one. But no one can deny that exact match is extremely important for finding rare&#x2F;obscure stuffs. Can a search engine especially for rare, obscure stuffs (in contrast to popular search) have a niche market?
Lamad123超过 3 年前
&quot;you.com&quot; finds a lot of things the Googopoly doesn&#x27;t!! It&#x27;s kinda limited, but it sometimes helps when neigher DDG nor bing does. Not shilling for them, but I tried it and it&#x27;s not that bad! Lastt couple years, Googopoly scrambled Youtube search results.. The only one way for me to find these is you.com.
sdflhasjd超过 3 年前
It used to be possible search for obscure model numbers and product codes and return only 5 results, all of which were what you&#x27;re after.<p><i>Now</i> it returns a bunch of spam and rubbish, with the model number nowhere to be found.<p>Sometimes you do find matches... and the results are just computer generated spam for God knows what purpose.
effingwewt超过 3 年前
A friend was telling me about a model named Marie Kim.<p>Google has broken <i>amy</i> search with the name Kim as Kim Kardashian, no matter how you wildcard the search.<p>I feel truly sorry for anyone trying to get any traction with Kim as a name.<p>As others have noted, DDG magically gets this right.<p>Guess that&#x27;s what happens when a search company becomes an ad company.
bbarnett超过 3 年前
Because Google is... annoying, and silly, try verbatim search tools &gt; verbatim, after you get search results.
评论 #30131169 未加载
erichocean超过 3 年前
Since Google has source code going back to their founding, it would be nice if they literally offered older versions of their search, just with updated crawl data.<p>Then you could use &quot;Google 2008&quot; and get decent results. (Google still only needs to crawl the web once for all versions.)
ergonaught超过 3 年前
Google finds it for: &quot;David, is where dreams are born.”&quot;<p>And: &quot;The voice was deep and melodious when it spoke.&quot;<p>And most other things. Examine the raw HTML for that area and you might give them a pass when searching for an exact phrase that doesn&#x27;t actually exist in the document itself.
评论 #30131180 未加载
评论 #30131044 未加载
评论 #30131148 未加载
albertopv超过 3 年前
Few days ago I looked for an issue about maven daemon (not reusing daemon when using java 8), all first page results where about gradle, all of them. This time DuckDuckGo did a lot better. I&#x27;m now doubting Google results reliability in general.
656565656565超过 3 年前
I use allintext: for almost every search on Google
broknbottle超过 3 年前
I was just thinking about this earlier today when I was searching for something specific and getting very irrelevant results. It used to be that the first page of results was very high quality and somewhat relevant at the bottom and page two+ was pretty much don’t even bother. Now it’s like if you are lucky the first 3-4 results might be relevant, middle is some spam with similar keywords that appear to match your search but it’ll have a funky domain e.g. .gg and either nonsense or doesn’t work due to malicious crap and being blocked. Anything after the middle is pretty much garbage.
评论 #30134230 未加载
dnissley超过 3 年前
Fwiw, the original page is formatted oddly. The line breaks seem like they&#x27;re part of the content? As opposed to them just being one big paragraph that is wrapped by a single tag?<p>E.g. try doing this search, with each individual line quoted separately: <a href="https:&#x2F;&#x2F;www.google.com&#x2F;search?q=%22David%2C+we+have+been%22+%22expecting+you+-+this+is+what+you+have+been+searching+for+-+this+place%22" rel="nofollow">https:&#x2F;&#x2F;www.google.com&#x2F;search?q=%22David%2C+we+have+been%22+...</a><p>My question at this point is -- did this literal search ever work on Google?
josefcullhed超过 3 年前
Interesting, in Sweden I only got this story when I made the same searches: <a href="https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;k1Avbtm" rel="nofollow">https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;k1Avbtm</a>
hoppyhoppy2超过 3 年前
I was trying to search for a specific phrase related to antenna gain (technical term) and kept getting results for &quot;antenna gets&quot;. Even when putting the phrase in quotation marks.
ttybird2超过 3 年前
For a few months now google search has been basically useless. The only thing that you get nowadays is just SEO spam and mainstream sites. At this point even bing&#x2F;ddg is better.
narrator超过 3 年前
To see how badly Google results have deteriorated, especially for politicized topics, search for &quot;what countries are using ivermectin&quot; with Google and then do it again with <a href="http:&#x2F;&#x2F;Yandex.com" rel="nofollow">http:&#x2F;&#x2F;Yandex.com</a> . All the Yandex results answer the question. All the Google results are either broken or are pages talking about why ivermectin should not be used.
MildlySerious超过 3 年前
I&#x27;ve had several frustrating experiences recently with search engines, Google included. The most notable: I have tried to find a video clip of Merkel joking about the German city of Bielefeld not existing. There are plenty of articles, but the clip, which I know exists, doesn&#x27;t come up. Not on YT, not on Google, not on Bing. Maybe I&#x27;ve lost my Google-fu somewhere along the way, I don&#x27;t know anymore.
_ink_超过 3 年前
I agree, but I think there are three reasons for the bad results: - Google tweaks. They target more and more the main stream users and thus neglecting the power users with their Google hacks knowledge - Content drain. More and more content is created in walled gardens, like discord, facebook, etc. - SEO. These optimized pages pollute the results
guerrilla超过 3 年前
I stopped using Google entirely. I honestly feel violated every time it strips out words that I asked it to search for on the very first page. NO, I said search for this, do not do something ELSE you piece of shit.<p>I actually use DuckDuckGo exclusively now, not because it got better (it did a tiny bit), but because Google got so absolutely horrible that DDG is now actually better! I have the habit of trying Google if I can&#x27;t find something with DuckDuckGo, but honestly I don&#x27;t even know why I bother because not once has it helped since this degradation started.<p>I do wonder why though. I got the feeling that maybe they just gave up. Maybe they don&#x27;t have to care anymore being a <i>de facto</i> monopoly and having so many other projects. It&#x27;s hard not to think that spammers run the internet now... Ad networks run everything and then content is just generated shit spammed into results and feeds.<p>&lt;&#x2F;rant&gt;
评论 #30131288 未加载
评论 #30131260 未加载
评论 #30131776 未加载
emptyparadise超过 3 年前
I&#x27;ve been looking up exact variable names from various API documentation and getting no relevant results despite these variable names appearing in multiple Stack Overflow posts, the API documentation where I copies the text from and of course GitHub.
FrozenVoid超过 3 年前
<a href="https:&#x2F;&#x2F;www.google.com&#x2F;search?hl=en&amp;q=%22Where%20Dreams%20Are%20Born%20%2D%2D%20An%20A.I.%20Epilogue%22" rel="nofollow">https:&#x2F;&#x2F;www.google.com&#x2F;search?hl=en&amp;q=%22Where%20Dreams%20Ar...</a>
Jensson超过 3 年前
&gt; Can someone do a magic call and find this page on Google?<p>I just pasted this unquoted into Google and found the site:<p>&gt; David, we have been expecting you - this is what you have been searching for - this place, David, is where dreams are born<p>Why would you quote this?
jeffbee超过 3 年前
Web indexing and search is a constant battle between space and time, so it does not really surprise me that results for any given input may not be stable over time. Generalizing from single examples, however, is illogical.
MattGaiser超过 3 年前
Replicated the problem here in Canada too. Bing does not find it either though.
EVa5I7bHFq9mnYK超过 3 年前
Bing returns all relevant results for that realsense query.
indigodaddy超过 3 年前
Reminds me of googlewhack days.. those were fun
sinuhe69超过 3 年前
Besides DDG, Yandex (the Russian search engine) also found the verbatim result. It reminds me of the time of meta-search engines, where they compare and combine the result of different search engines to produce a better one. Perhaps it’s time for such ideas again.
hamiltonians超过 3 年前
google hardly works for anything
评论 #30131249 未加载
tetek超过 3 年前
yandex.com