TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Why doesn't anyone create a search engine comparable to 2005 Google?

398 pointsby syedkarimover 3 years ago
I seem to recall that Google consistently produced relevant results and strictly respected search operators in 2005 (?), unlike the modern Google. And back then, I think search results were the same for everyone, rather than being customized for each user.

74 comments

gbmattover 3 years ago
Ha, yes, I&#x27;ve done that at <a href="https:&#x2F;&#x2F;gigablast.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;gigablast.com&#x2F;</a> . The biggest problems now are the following: 1) Too hard to spider the web. Gatekeeper companies like Cloudflare (owned in part by Google) and Cloudfront make it really difficult for upstart search engines to download web pages. 2) Hardware costs are too high. It&#x27;s much more expensive now to build a large index (50B+ pages) to be competitive.<p>I believe my algorithms are decent, but the biggest problem for Gigablast is now the index size. You do a search on Gigablast and say, well, why didn&#x27;t it get this result that Google got. And that&#x27;s because the index isn&#x27;t big enough because I don&#x27;t have the cash for the hardware. btw, I&#x27;ve been working on this engine for over 20 years and have coded probably 1-2M lines of code on it.
评论 #29421323 未加载
评论 #29422432 未加载
评论 #29417493 未加载
评论 #29424735 未加载
评论 #29418602 未加载
评论 #29417470 未加载
评论 #29418561 未加载
评论 #29422680 未加载
评论 #29417756 未加载
评论 #29418044 未加载
评论 #29423063 未加载
评论 #29418419 未加载
评论 #29425522 未加载
评论 #29419692 未加载
评论 #29417549 未加载
评论 #29439858 未加载
评论 #29417915 未加载
评论 #29418208 未加载
评论 #29426518 未加载
评论 #29421818 未加载
评论 #29425543 未加载
评论 #29425705 未加载
评论 #29446811 未加载
评论 #29426514 未加载
评论 #29421902 未加载
评论 #29426012 未加载
评论 #29425247 未加载
评论 #29421579 未加载
评论 #29425837 未加载
评论 #29417370 未加载
评论 #29417583 未加载
评论 #29423675 未加载
评论 #29426452 未加载
评论 #29417956 未加载
评论 #29422964 未加载
评论 #29424814 未加载
评论 #29423061 未加载
评论 #29428586 未加载
评论 #29436432 未加载
评论 #29425577 未加载
评论 #29425436 未加载
评论 #29423228 未加载
评论 #29423759 未加载
评论 #29437344 未加载
评论 #29418315 未加载
评论 #29417668 未加载
评论 #29422318 未加载
评论 #29417594 未加载
评论 #29417826 未加载
评论 #29417456 未加载
评论 #29425314 未加载
lolinderover 3 years ago
The consistent theme every time this comes up is that dealing with the sheer weight of the internet is almost impossible today. SEO spam is hard to fight and the index gets too heavy. However, I wonder if this is a sign that we&#x27;re looking at the problem wrong.<p>What if instead of even <i>trying</i> to index the entire web, we moved one step back towards the curated directories of the early web? Give users a search engine and indexer that they control and host. Allow them to &quot;follow&quot; domains (or any partial URLs, like subreddits) that they trust.<p>Make it so that you can configure how many hops it is allowed to take from those trusted sources, similar to LinkedIn&#x27;s levels of connections. If I&#x27;m hosting on my laptop, I might set it at 1 step removed, but if I&#x27;ve got an S3 bucket for my index I might go as far as 3 or 4 steps removed.<p>There are further optimizations that you could do, such as having your instance <i>not</i> index Wikipedia or Stack Overflow or whatever (instead using the built-in search and aggregating results).<p>I&#x27;m sure there are technical challenges I&#x27;m not thinking of, and this would absolutely be a tool that would best serve power users and programmers rather than average internet users. Such an engine wouldn&#x27;t ever replace Google, but I&#x27;d think it would go a long way to making a better search engine for a single user&#x27;s (or a certain subset of users&#x27;) everyday web experience.
评论 #29423918 未加载
评论 #29423586 未加载
评论 #29423715 未加载
评论 #29424117 未加载
评论 #29428657 未加载
评论 #29423760 未加载
评论 #29424195 未加载
评论 #29423570 未加载
评论 #29423896 未加载
评论 #29423627 未加载
BitwiseFoolover 3 years ago
Natural Language Processing is a pox on modern search engines. I suspect that Google et. al. wanted to transform their product into an answer engine that powers voice assistants like Siri and just assumed everyone would naturally like the new way better. I can&#x27;t stand how Google is always trying to guess what I want, rather than simply returning non-personalized results solely based on exactly what I typed in the textbox.<p>While that may be good for most people, there is still a lot of power and utility in simple keyword-driven searches. Sadly, it seems like every major search engine <i>has</i> to follow Google&#x27;s lead.
评论 #29417347 未加载
评论 #29418237 未加载
评论 #29417505 未加载
vincent_sover 3 years ago
Some people try:<p><a href="https:&#x2F;&#x2F;www.mojeek.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.mojeek.com&#x2F;</a><p><a href="https:&#x2F;&#x2F;fireball.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;fireball.com&#x2F;</a><p><a href="https:&#x2F;&#x2F;search.brave.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;search.brave.com&#x2F;</a>
评论 #29417671 未加载
评论 #29425872 未加载
评论 #29417653 未加载
abhaynayarover 3 years ago
I&#x27;m probably the only person who doesn&#x27;t think Google search has deteriorated. I play security CTFs, so a lot of times I have to search for peculiar technical details on various software. Also, like any other human being, I also make generic queries. In both cases, I feel like I almost always get to the desired webpage within the top few results.
评论 #29419033 未加载
评论 #29418301 未加载
评论 #29425271 未加载
pydryover 3 years ago
Early 2000s google index ran in a garage. The current google index has dedicated power stations.<p>It&#x27;s a bit like the car industry - you could run a startup from your garage in the early days but you need titanic amounts of capital to compete now thanks to vertical integration.<p>Major governments and billionaires can compete but everybody else is locked out of the market (most &quot;startups&quot; use bings index).
评论 #29427859 未加载
评论 #29422941 未加载
评论 #29425209 未加载
rovingEngineover 3 years ago
I think Google was “better” from a users point of view in 2005 because it wasn’t that good at selling ads yet. I still remember the epiphany of the first time I used Google in 1999. It was amazing.<p>I’ve thought the same about pre-ad Twitter and Facebook.<p>Early on, startups with free services look a lot like non-profits and just maximize user benefit to grow. The problem is they’re not non-profits, and have to make money at some point. That has tended to mean ads.<p>I’d easily pay, say, $9&#x2F;mo to have access to an ad-free search engine that made me feel the way 1999 Google did.
评论 #29417903 未加载
wodenokotoover 3 years ago
The web has changed drastically. I’d imagine 2005-google engine today would be nothing but abandoned Wordpress blogs with comment spam.
评论 #29417783 未加载
评论 #29417324 未加载
nfriedlyover 3 years ago
I think DuckDuckGo is closer to what you want. Same results for everyone, better privacy, and they&#x27;re proactive about improving their results.<p><a href="https:&#x2F;&#x2F;duckduckgo.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;duckduckgo.com&#x2F;</a><p>Part of the problem is that there&#x27;s a lot more low-quality content to wade through now than there was in 2005. I think the Google of 2005 would have trouble delivering quality results today also.
评论 #29417844 未加载
评论 #29417688 未加载
评论 #29417708 未加载
评论 #29422570 未加载
评论 #29417836 未加载
评论 #29417813 未加载
评论 #29417835 未加载
评论 #29417791 未加载
mrkramerover 3 years ago
They do[0] but nobody cares anymore. Google controls web distribution through Google Chrome. I think we are at the point of no return. There won&#x27;t be any competition anytime soon no matter what US government does. Only innovation can displace Google.<p>[0] <a href="https:&#x2F;&#x2F;search.marginalia.nu&#x2F;" rel="nofollow">https:&#x2F;&#x2F;search.marginalia.nu&#x2F;</a>
评论 #29417434 未加载
pkambover 3 years ago
I would use a search engine that only indexed Reddit, Stack Exchange, Wikipedia, and a small number of other sites.<p>And that specifically blocked Pinterest, Quora, most non-personal “blogs”, etc.<p>People suggest DDG ! operators, but I don’t want to use a site’s (bad, single-site) search box. I want a multi-site SERP that only displays results from known good sites, which are customizable.
评论 #29424338 未加载
评论 #29418687 未加载
评论 #29423671 未加载
评论 #29419180 未加载
moralestapiaover 3 years ago
Please do it! Google is now complete trash.<p>Also gmail, used to have the best spam filters out there, now it&#x27;s utter crap. Emails from my google analytics account, for whatever reason and disregarding how many times I have clicked on &quot;Not Spam&quot;, go to spam, and it&#x27;s their own service; while messages who are textbook spam (&quot;Hi, I just got some inheritance ...&quot;) go to my inbox.<p>AI (in its current state) is crap, when is the industry going to accept these are the emperor&#x27;s new clothes.
nickppover 3 years ago
Because we&#x27;re not having a 2005-Web anymore. More to the point, SEO &amp; Google have evolved together. To have barely relevant results today you need to be <i>good</i>. That takes stellar talent which costs huge amounts of money.<p>Thus, the Google of today, which is optimized to extract that money from us.
评论 #29421362 未加载
评论 #29417837 未加载
评论 #29417304 未加载
indymikeover 3 years ago
Brave&#x27;s new search engine seems to work pretty well. Have been using it as my primary for about 10 days, and so far, I&#x27;ve only had to revert to Google once, and when I did the results were chock full of spam.
评论 #29418151 未加载
评论 #29417662 未加载
freediverover 3 years ago
We are building one [1] as well as a few other people that I am aware of with different approaches and business models.<p>We also need to be aware that when we remember past times it usually carries a romantic, nostalgic note. Web is very different than it was 15 years ago and the problem of search has evolved.<p>What you are looking for is basically &#x27;grep for the web&#x27; but it is just one facet of search that we use today. 15 years ago you would not get an instant answer to a question like you do today and many users would not be able to live without that today. There are also maps and location based answers, all sorts of widgets like translation etc. Also world became more polarized so an objective best search result became more difficult to produce, specially for events covered in news, which means bias inevitably starts to creep in.<p>This is not to say that Google is good or bad today, it is what it is and they are doing best they can. Startups like ours see an opportunity on the market, in large part to help savvy users find what they want.<p>[1] <a href="https:&#x2F;&#x2F;kagi.com" rel="nofollow">https:&#x2F;&#x2F;kagi.com</a>
ColinHayhurstover 3 years ago
You might call this a search engine based on the principle of Information Neutrality.<p>“Information Neutrality is the principle to treat all information provided (by a service) equally. The information provided, after being processed by an information-neutral service, is the same for every user requesting it, independent of the user’s attributes, including, e.g., origin, history or personal preferences and independent of the financial or influential interest of the service provider, as well as independent of the timeliness of information.&quot;<p>I wrote about this in relation to search [0]. We need to be allowed more freedom to choose search engines and services. One (default or selected) choice for search is unhealthy. We shouldn&#x27;t have to choose between Google or Bing; DuckDuckGo or Startpage; Brave or Ecosia; Mojeek or Gigablast ..... Personally I use all 8 of these and more, as also explained [0].<p>[0] <a href="https:&#x2F;&#x2F;blog.mojeek.com&#x2F;2021&#x2F;09&#x2F;multiple-choice-in-search.html" rel="nofollow">https:&#x2F;&#x2F;blog.mojeek.com&#x2F;2021&#x2F;09&#x2F;multiple-choice-in-search.ht...</a>
评论 #29425231 未加载
boyterover 3 years ago
I had a brief stab at this with <a href="https:&#x2F;&#x2F;bonzamate.com.au" rel="nofollow">https:&#x2F;&#x2F;bonzamate.com.au</a> although its Australia specific to reduce the crawling and indexing requirements. It&#x27;s main twist is that it runs entirely in AWS Lambda&#x27;s meaning it costs nothing when it&#x27;s not being used.
ravenstineover 3 years ago
I think what [some] people actually want isn&#x27;t the Google of 2005 but to have a search engine where they don&#x27;t feel like they&#x27;re being manipulated.
erpellanover 3 years ago
Even if Google dusted off their 2005 codebase and ran it on today&#x27;s web it wouldn&#x27;t come close to the results quality of Google in 2005. The SEO industry has been in an arms race with the search engines for 16 years. 2005 Google would be like a goldfish in a piranha tank.
jakub_gover 3 years ago
Cliqz wanted to build new search engine but failed. It&#x27;s just too difficult to operate at that scale and break the existing monopoly of big G.<p><a href="https:&#x2F;&#x2F;www.burda.com&#x2F;en&#x2F;news&#x2F;cliqz-closes-areas-browser-and-search-technologies&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.burda.com&#x2F;en&#x2F;news&#x2F;cliqz-closes-areas-browser-and...</a><p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=23031520" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=23031520</a><p><a href="https:&#x2F;&#x2F;0x65.dev&#x2F;blog&#x2F;2019-12-06&#x2F;building-a-search-engine-from-scratch.html" rel="nofollow">https:&#x2F;&#x2F;0x65.dev&#x2F;blog&#x2F;2019-12-06&#x2F;building-a-search-engine-fr...</a>
评论 #29417896 未加载
评论 #29417911 未加载
ab_testingover 3 years ago
I think a lot of people are ignoring the issue that the web has changed considerably since 2005. It is approximately 10 times larger in terms of number of websites and web pages. And a lot of it is SEO junk that is just designed for search engines to be easier to parse and show ads in your face.<p>Also user preferences have changed in the last decade or so. I know millenaials and users in their late 30&#x27;s or early 40&#x27;s still yearn for the old web where they would type a search term and correct results would astonish them. However, younger users tend to gravitate to videos and that is why a large portion of the google results are now video results.
greymanover 3 years ago
1) Google is better at AI, for example let&#x27;s take this sloppy search: &quot;some joke where you can&#x27;t tell if it is serious or joke&quot;<p>It is called Poe&#x27;s law, and Google returned it at #4. Bing or Duckduckgo don&#x27;t have a clue...<p>2) They have a years of user&#x27;s data, like for specific term, they see what users clicked most, so they see which results were perceived as most relevant. It is hard to catch up if you dont have such data.<p>3) They developed anti-spamming tools during the years of fighting against SEO-spammers.
评论 #29422178 未加载
评论 #29418661 未加载
gorgoilerover 3 years ago
Random thought, based especially on using DuckDuckGo for two years:<p>Search engine isn’t singular, it’s plural.<p>(1) Search engine for something I know exists.<p>(2) Search engine for finding something new.<p>There’s a market for both, but you don’t have to solve both problems with the same product.<p>Sometimes I switch to Google for the former, but the latter works well enough for me that I don’t care what else Google would’ve shown me.<p>More often than not, my feeling is Google would only have shown me more ads in addition to whatever I could already find elsewhere.
keddadover 3 years ago
While I feel that Google has become worse in last couple of years, I&#x27;m pretty sure it is still better now when 15 years ago. Maybe it is just some kind of nostalgia?
评论 #29417491 未加载
ineedasernameover 3 years ago
SEO wars are at least part of it. Google&#x27;s algorithm has evolved over time not just top optimize advertising views&#x2F;clicks and take over more screen space, but also to battle the constant gamification of their algorithm by SEO that, once you eventually get to the real results, will surface less relevant&#x2F;spammy&#x2F;scammy etc results if Google doesn&#x27;t constantly push back against the worst SEO abusers.
phendrenad2over 3 years ago
The 2005 Google model only made sense in the 2005 internet. Google had the luck to become a search monopoly, and they quickly created Chrome to ensure that no one would ever switch away from Google search, so they could maintain the monopoly.<p>Now that Google exists, you can&#x27;t create another one. There&#x27;s only room for one.<p>Another thing is the rise of &quot;content sites&quot;, like this one (Hacker News). I&#x27;m sure YCombinator doesn&#x27;t like getting hit by dozens of crawlers. The impulse to ban everything that crawls except (Google|Bing|Baidu|VK) is too great.<p>A lot of alternative suggestions are being thrown into this discussion. Let me throw in mine: Reverse the concept of the &quot;crawler&quot;. Instead of following links around the internet randomly, require sites to register with you and request to be crawled and&#x2F;or submit a sitemap. It would be hard to get started, but once something like this gained momentum, I believe that there&#x27;s room for several of these reverse-search-engines to compete.
chillingover 3 years ago
Yesterday there was a discussion[1] about it and someone suggested yandex.com. I&#x27;m using it since than and really love it. It&#x27;s like going back to 2003 where everything was just plain and simple.<p>[1]: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=29393467" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=29393467</a>
jerhewetover 3 years ago
Do not force me into autocomplete mode when I&#x27;m typing in my search terms. I don&#x27;t care what anyone&#x27;s &quot;reasons&quot; are for forcing me to put up with flashing, irrelevant bullshit when I&#x27;m searching for something. I don&#x27;t care how &quot;fast&quot; it is.<p>Just let me type stuff into the search box -- including typo corrections and modifications to what I&#x27;m searching for -- and hit ENTER to start the actual search.<p>When I&#x27;m ready to start my search I&#x27;ll hit the fucking ENTER key. Stop annoying me with your stupid assumptions about what I&#x27;m looking for.<p>This ONE THING is why I switched to Webcrawler.com two years ago. I type in five or ten words with ZERO craptastic guesses flashing around on my screen, hit ENTER, and THEN it returns what I&#x27;m looking for.
causiover 3 years ago
Lately I&#x27;ve noticed Google has just started ignoring search operators. Search results are missing terms in quotes and include terms with a leading - sign on them. It&#x27;s like they&#x27;ve decided we&#x27;re too stupid to know what we&#x27;re looking for.
8bitsruleover 3 years ago
Looks like millionshort.com (which I learned-of on HN) died recently. For me, its results were more useful than most others (even without the &#x27;leave out the top nnn sources&#x27; feature). Hoping it was an experiment that will bear fruit.
drcongoover 3 years ago
I&#x27;ve been using kagi.com for a month or so now, and it consistently beats DDG and Ecosia for result quality. I&#x27;d guess it beats Google too, since last time I used Google it was nothing but ads and spam which is why I stopped.
评论 #29418783 未加载
BbzzbBover 3 years ago
No mention of DDG in the comments? Is there a reason I&#x27;m not seeing or it&#x27;s just not the preferred alt-search on HN? Seems to have been working fine for me when I struggle to get past the funnels and content mills on Google.
评论 #29417714 未加载
评论 #29417548 未加载
评论 #29417489 未加载
评论 #29417608 未加载
flipdotover 3 years ago
Not sure if this is any close to what you’re trying to find, but there’s <a href="https:&#x2F;&#x2F;github.com&#x2F;benbusby&#x2F;whoogle-search" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;benbusby&#x2F;whoogle-search</a>
llaollehover 3 years ago
Everyone runs in the other direction anytime a search engine is mentioned. The thought of competing with Google turns people off.<p>Even in 2021, despite how bad it&#x27;s become, it&#x27;s still miles ahead of other competitors.
评论 #29418468 未加载
not2bover 3 years ago
I think you&#x27;re being nostalgic for something you don&#x27;t remember very well.<p>In that era, Google would return a match based on words that appear in the links to a URL but not in the article itself, meaning that it was easy to produce &quot;Googlebombs&quot;. For example, from 2005-2007 the top hit for &quot;miserable failure&quot; was the Wikipedia article for George W. Bush.<p>See <a href="https:&#x2F;&#x2F;www.screamingfrog.co.uk&#x2F;google-bombs&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.screamingfrog.co.uk&#x2F;google-bombs&#x2F;</a> for some of the &quot;better&quot; ones.
karmasimidaover 3 years ago
Google does its job.<p>I heard HN constantly crying over its deteriorating quality, but I am not noticing it that much, not better not worse, it just does its job.<p>To create 05 Google, it is easily billions of dollars and years of investment, before people will treat you seriously.<p>The reason we didn&#x27;t get 05 Google could only because it is not profitable. Some nation state attempt to demonopolize the search engine business might work, but I didn&#x27;t expect any for profit organization to easily attempt doing this, let alone individual hobbyists
simonebrunozziover 3 years ago
These guys [0] have built something really close to 2005-Google, and possibly slightly better.<p>The parent company, Tiscali, was a huge hit in the 1990s, as it provided internet access to millions of Italians. It went through some struggle for several years, but lately the original founder, Renato Soru, came back to run the company.<p>The company is based in Cagliari, the capital of Sardinia, Italy.<p>[0]: <a href="https:&#x2F;&#x2F;www.istella.it&#x2F;en&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.istella.it&#x2F;en&#x2F;</a>
dave333over 3 years ago
Does Gigablast ignore or downrate stuff on .info domains? Seems to like <a href="https:&#x2F;&#x2F;www.fiendishsudoku.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.fiendishsudoku.com&#x2F;</a> for &quot;fiendish sudoku&quot; search but doesn&#x27;t know about <a href="https:&#x2F;&#x2F;www.extremesudoku.info&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.extremesudoku.info&#x2F;</a> for &quot;extreme sudoku&quot; search.
jacquesmover 3 years ago
I&#x27;d love a much simpler version of search engines: an engine that I can give a long list of websites to crawl, and to completely ignore the rest.
criddellover 3 years ago
Why don&#x27;t you want personalized results? If I search for &quot;subaru service&quot; I want to find Austin Subaru, not Thorp Subaru in Cape Town.
评论 #29417683 未加载
评论 #29418060 未加载
willciprianoover 3 years ago
I&#x27;d like to see a &quot;just search&quot; engine, all it does it search for a specific string, case insensitively, across the entire web. No curation or anything, just sorted in lexicological order closest match first maybe falling back to page age if it has more then one exact match. Perhaps give me some regular expressions as well.
评论 #29418374 未加载
评论 #29418429 未加载
swframe2over 3 years ago
Have a look at gpt-3 if you want to see what the future dominant search engine will be. It will not find relevant results, it will write it on the fly customized for exactly what you want to read. (Maybe products will just ship to your door and be auto paid because the future ad targeting AIs will know you so well.)
评论 #29417634 未加载
emodendroketover 3 years ago
Well, Cuil had a lot of money and couldn&#x27;t do it. I don&#x27;t know how you quantify your assertions but I suspect that if you brought back 2005 Google it would be easily gamed and struggle to deal with social media sites where a lot of content people are looking for is now found.
kumarswover 3 years ago
I feel like we are at the low point or even losing the battle between search engines and SEO spam. Maybe it is time for the Yahoo-style curated directory to return? We seem to be getting a microcosm of this with the awsome-* GitHub lists and Gemini with its near-nonexistent search.
andrewclunnover 3 years ago
What about a search engine that only indexed information and technology &quot;alternative&quot; sites, specifically to give you the results most likely to be purged or demoted from Google&#x27;s results? Would be simple enough in scope and have a built in market and use.
motohagiographyover 3 years ago
I do like the idea that instead of crawling and indexing, the next generation search will likely be more like a federated community search app that indexes the stuff members actually read. Google search isn&#x27;t so much a repository as a consensus about what&#x27;s important, hence why it&#x27;s so politicized to the point of becoming unreliable, but also why it too is vulnerable to disruption.<p>Imo, 2005 google got initial traction because of its tech forum post indexing, as I remember my switch to it was because it became an extension and then replacement for manpages. In that sense, what made it good was it reflected the consensus of what its incredibly influential userbase thought was important and just managed that really well. The demographic impact of the U.S. Gen X all using it at once didn&#x27;t hurt either.<p>The equivalent today, as a lot of us say, is that blockchains are in the 1997 internet phase, and the service that makes the content of those as navigable as the 90&#x27;s internet, will likely grow in a similar way.<p>Search that provides young people with privacy and freedom to pursue their true interests will be the dominant strategy. Its success will be because it&#x27;s a product that rides growth, and not because it &quot;solved a problem.&quot; Imo, we all index too much on the privacy pattern because the freedom pattern is too risky.<p>What&#x27;s changed since that time are the maturity of things like Bloom and other probabilistic filters, Apple&#x27;s private set intersection, differential privacy, zksnarks, and everybody you&#x27;d ask an opinion from now gets their content through mobile devices. Apple&#x27;s ecosystem is equipped to do this kind of search, but they&#x27;re too exposed politically to get into it. Meta will likely go there, but nobody&#x27;s going to trust them willingly.<p>A protocol that generated a cryptograpically strong anonymous index from your browsing - and instead of putting it on google&#x27;s servers, it was on a chain, or the content index information and its evolving consensus score was included in something like a DNS record - may still unseat these ensconced interests. IPFS and other P2P or torrents might do something like that as well. Blockchains maybe good for that consensus&#x2F;desire score.<p>It&#x27;s not something you architect and design top down that has to solve all cases, it will be just another useful product that grows while riding a demographic change. It would be on the level of inventing HTML&#x2F;HTTP again, which, when you think about it, was just another dude making a thing he needed.
baggachipzover 3 years ago
<a href="https:&#x2F;&#x2F;kagi.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;kagi.com&#x2F;</a> is a new engine (and Orion Browser) which seems like what you&#x27;re talking about. I&#x27;ve been using it some and like it so far. The browser is fantastic.
hvasilevover 3 years ago
All big tech businesses at their core are monopolies. Once a significant field has been figured out, it is very difficult to compete with the market standard, unless they screw up so hard that that THE AVERAGE user starts searching for an alternative.
WalterBrightover 3 years ago
I&#x27;d like to see categories like travel, science, history, art, etc. The web pages could pick which categories their page falls into using meta tags. The user has the option of selecting which category they are interested in searching within.
lgrialnover 3 years ago
What I miss most of all from the Good Old Days was getting as many hits back as I could read.<p>Rather than being told &quot;No, there are only eight pages of results on anything in the goddamned world. Really. Would I lie to you?&quot;
dangover 3 years ago
Ongoing related thread:<p><i>Gigablast Search Engine</i> - <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=29421898" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=29421898</a> - Dec 2021 (10 comments)
marksbrownover 3 years ago
I&#x27;d like a way of automatically filtering for websites that :<p>* Don&#x27;t use JS * Don&#x27;t use Google analytics * Don&#x27;t weigh more than a few kB per page * Don&#x27;t show any sites with ads<p>That would be a place to begin.
评论 #29428171 未加载
评论 #29425122 未加载
dragonwriterover 3 years ago
&gt; Why doesn&#x27;t anyone create a search engine comparable to 2005-Google?<p>Because the universe being searched isn&#x27;t the internet of 2005 and earlier, and because user expectations have moved on, too.<p>Plus the index expense.
mrfusionover 3 years ago
I’ve always wondered why you can’t use SEO optimizations for GOOGLE as a negative weight and penalize those pages.<p>For example if my search term appears in the URL I can almost guarantee I don’t want that page.
tigerlilyover 3 years ago
Surely there must be some way to have distributed search compute a la folding&#x2F;seti@home or those mersenne prime guys.<p>I&#x27;d gladly pool in some of my CPU time if it helps build a better search.
评论 #29418069 未加载
gompertzover 3 years ago
I&#x27;ve been having a lot of good luck with Lycos (yeah, that Lycos, from 1995!)... It never returns pay wall or &quot;opinion&quot; based results (I.e. Medium).
estover 3 years ago
Because today&#x27;s web are full of walled gardens, and most content are going mobile , in streaming, and SPA rendered, which is no longer plain text based.
s1k3sover 3 years ago
I don’t know how Google was in 2005, but in ~2010 I was able to pull a website on #1 with 0 cash spent, just by manipulating PR. That doesn’t seem great to me.
vangelisover 3 years ago
They have, sort of: <a href="https:&#x2F;&#x2F;search.marginalia.nu&#x2F;" rel="nofollow">https:&#x2F;&#x2F;search.marginalia.nu&#x2F;</a>
beefmanover 3 years ago
Can you also create a web comparable to the 2005 web?<p>Well, it&#x27;s wikipedia. So just create a search engine for that, since their search sucks rocks.
ameliusover 3 years ago
Also, where are the books about writing a search engine?<p>Knuth&#x27;s &quot;Searching and Sorting&quot; volume desperately needs an update.
评论 #29418445 未加载
anotheraccount9over 3 years ago
Check out the dead internet theory. If most people browse 1% of the web, what&#x27;s up with popular search engines?
hermann123over 3 years ago
I use <a href="https:&#x2F;&#x2F;swisscows.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;swisscows.com&#x2F;</a>
axegon_over 3 years ago
Two major reasons: costs to build and maintain and manpower needed. Both are practically impossible to come by.
ChrisArchitectover 3 years ago
related 2 days ago:<p><i>Ask HN: Has Google search become quantitatively worse?</i><p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=29392702" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=29392702</a><p>Inviting all the paranoid&#x2F;speculative&#x2F;hearsay&#x2F;personal experience responses. Lame Ask HNs!!!!!
mkbknover 3 years ago
I am a non-dev and Ecosia and DuckDuckGo are perfect for me. Not used Google since more than 3 years now.
ofouover 3 years ago
An end-to-end machine learning system should compress all the web into a good search engine soon.
hereforphoneover 3 years ago
Because the money lies in modulating your product according to the whims of the highest bidders.
chrisgomanover 3 years ago
too many crappy websites, probably needs a &quot;committee&quot; to whitelist domains (only good quality ones) but probably too much work for not enough money or needs some monetization strategy
rasenganover 3 years ago
This is how Private Search [1] works since it decouples the search from the user. This means nobody knows both who searched and what they searched for. This is a huge leap for privacy in search.<p>[1] <a href="https:&#x2F;&#x2F;private.sh" rel="nofollow">https:&#x2F;&#x2F;private.sh</a>
评论 #29417221 未加载
评论 #29417191 未加载
评论 #29417889 未加载
评论 #29417421 未加载
评论 #29418927 未加载
peanut_wormover 3 years ago
Isn’t that what DuckduckGo is?<p>DDG is pretty useless though unfortunately.
richardsocherover 3 years ago
you.com supports many of the standard operators and has specific reddit, stackoverflow, MDN apps for developers.
fnord77over 3 years ago
information-dense pages of yore have been replaced by really wordy, probably generated SEO optimized blog junk.
Hakashiroover 3 years ago
What is your gripe with DuckDuckGo?