TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

YaCy, a distributed Web Search Engine, based on a peer-to-peer network

288 pointsby Timotheeabout 1 year ago

21 comments

boyterabout 1 year ago
I actually half wrote a RFC of a spec and 2 implementations of a federated search last year. Rather than do the disturbed hash table that yacy does.<p>I wanted results to be re-rankable by the peers by sharing the scores that went into them. The idea being with a common protocol based on the ideas of ActivityPub you could get peers of searches working together to hopefully surface interesting things.<p>Something I should probably finish and publish at some point. It worked to the hundreds of peers I tested.<p>The reason I mention this is because I wanted to also add a front into yacy which tuned out to be harder than I expected. It’s a wonderful project and you can find great stuff through it but the way the peers return results sometimes it’s hard to find it again. It’s also not quite as hackable as I would have hoped at the time probably due to he project age.<p>I still think there is value in it though and I’d love to see yacy have its protocol explained as an apex so people could,build implementations in other languages more easily.
评论 #39617742 未加载
ssijakabout 1 year ago
Long time ago I worked for a startup called Wowd which built distributed search engine. It was acquihired by Facebook.<p>On of the biggest issues was how to entice people to download and run the client&#x2F;node.<p>I half wondered afterwards if slapping some crypto on top of it which would be mined by running the node and providing resources would help. My gut says easy yes, but my mind grimace at the abomination.
评论 #39613909 未加载
评论 #39613970 未加载
评论 #39613780 未加载
评论 #39618160 未加载
评论 #39619089 未加载
评论 #39613680 未加载
评论 #39617265 未加载
dredmorbiusabout 1 year ago
Previously:<p>YaCy – your own search engine | <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=32597309">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=32597309</a> | 2 years ago | 93 comments<p>YaCy: Decentralized Web Search | <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=22246732">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=22246732</a> | 4 years ago | 41 comments<p>YaCy – The Peer to Peer Search Engine | <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=17089240">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=17089240</a> | 6 years ago | 3 comments<p>YaCy: a free distributed search engine | <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=12433010">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=12433010</a> | 8 years ago | 24 comments<p>YaCy: Decentralized Web Search | <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=8746883">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=8746883</a> | 9 years ago | 29 comments<p>YaCy takes on Google with open source search engine | <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=3288586">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=3288586</a> | 12 years ago | 17 comments
renegat0x0about 1 year ago
There are already many project about search:<p>- <a href="https:&#x2F;&#x2F;www.marginalia.nu&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.marginalia.nu&#x2F;</a><p>- <a href="https:&#x2F;&#x2F;searchmysite.net&#x2F;" rel="nofollow">https:&#x2F;&#x2F;searchmysite.net&#x2F;</a><p>- <a href="https:&#x2F;&#x2F;lucene.apache.org&#x2F;" rel="nofollow">https:&#x2F;&#x2F;lucene.apache.org&#x2F;</a><p>- elastic search<p>- <a href="https:&#x2F;&#x2F;presearch.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;presearch.com&#x2F;</a><p>- <a href="https:&#x2F;&#x2F;stract.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;stract.com&#x2F;</a><p>- <a href="https:&#x2F;&#x2F;wiby.me&#x2F;" rel="nofollow">https:&#x2F;&#x2F;wiby.me&#x2F;</a><p>I think that all project are fun. I would like to see one succeeding at reaching mainstream level of attention.<p>I have also been gathering links meta data for some time. Maybe I will use them to feed any eventual self hosted search engine, or language model, if I decide to experiment with that.<p>- domains for seed <a href="https:&#x2F;&#x2F;github.com&#x2F;rumca-js&#x2F;Internet-Places-Database">https:&#x2F;&#x2F;github.com&#x2F;rumca-js&#x2F;Internet-Places-Database</a><p>- bookmarks seed <a href="https:&#x2F;&#x2F;github.com&#x2F;rumca-js&#x2F;RSS-Link-Database">https:&#x2F;&#x2F;github.com&#x2F;rumca-js&#x2F;RSS-Link-Database</a><p>- links for year <a href="https:&#x2F;&#x2F;github.com&#x2F;rumca-js&#x2F;RSS-Link-Database-2024">https:&#x2F;&#x2F;github.com&#x2F;rumca-js&#x2F;RSS-Link-Database-2024</a>
评论 #39616624 未加载
评论 #39616365 未加载
评论 #39618339 未加载
评论 #39617096 未加载
WarOnPrivacyabout 1 year ago
Yacy&#x27;s still around. Nice.<p>After a year or two of hosting a Yacy instance (2014?) I started winding up on some general (probes, etc) blacklists.<p>I also host a small mail server and I was getting mail returned. I&#x27;d force an IP swap and a few weeks later it&#x27;d be the same. I had to let Yacy go.
评论 #39617120 未加载
vGPUabout 1 year ago
Has it gotten any better recently?<p>I run a node but I haven’t actually used it as a search engine in a while, as I found the result quality to be exceedingly poor.
评论 #39616236 未加载
评论 #39613788 未加载
charcircuitabout 1 year ago
Are the results still being gamed by sites using content keyword stuffing? The last time I used it the searching and ranking technology felt like they were 40 years behind state of the art.
评论 #39615804 未加载
DrDroopabout 1 year ago
I once went to a workshop on a Sunday morning at the local makerspace to listen to someone talk about some kind of distributed search engine or something like that. One of the developers came from (I think) Germany to explain this to us the centralized sheeple. He just gave a demonstration of the thing, like here is the box you type stuff and here are the results. When I started to ask questions about how it worked an all he sort of acted annoyed saying it was all too difficult to explain. This was more than ten years ago, and yes I am still angry about it.
评论 #39613454 未加载
评论 #39613644 未加载
rasulkireevabout 1 year ago
Love it. Super easy to self host and use. Now I have a personal Google!
jrussbowmanabout 1 year ago
Nice to see search projects are still popping up. After a move, family life taking over and me getting more interested in Unreal Engine, my poor search engine is now more of an experiment in seeing how well it runs while basically on life-support maintenance updates I do. Starting to think I honestly should just take it down and save my $50 a month I spend maintaining it.<p>But I&#x27;ll post it in a hacker news comment and maybe you all will give it enough traffic I can get excited about it again, lol<p><a href="https:&#x2F;&#x2F;www.unscatter.com" rel="nofollow">https:&#x2F;&#x2F;www.unscatter.com</a>
评论 #39618406 未加载
fhoabout 1 year ago
I&#x27;ve been using several times over the last decades and never got good results. I think one instance is still running on my old computer at uni :-)
treprinumabout 1 year ago
Is it worth dedicating 1-2 low power NUCs (4-8 core) to this on a 250MBit&#x2F;s connection? Or does it need beefier CPUs&#x2F;network?
b2bsaas00about 1 year ago
Could this be used for a Torrent search engine?
评论 #39614265 未加载
评论 #39613817 未加载
评论 #39613802 未加载
buffalobuffaloabout 1 year ago
I ran YaCy for a while, but not as a node on their distributed search index. I just ran it as a search engine for all my own bookmarks. Unfortunately I never found a particularly good way of getting bookmarks into the system. So eventually I shut it down. Cool idea in theory though.
评论 #39624584 未加载
fortran77about 1 year ago
Related to this — I’d love to see individuals making web pages again, and federated search engines indexing them. People don’t make their own hobby or fan or art websites anymore, and I think that’s partly because nobody will every find them with the big search engines.
评论 #39618012 未加载
maxlohabout 1 year ago
See also: Presearch, another decentralized search engine, claimed that it will be open source. No source code available at the moment though.<p><a href="https:&#x2F;&#x2F;presearch.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;presearch.com&#x2F;</a>
gonesilentabout 1 year ago
Infrasearch &#x2F; Gonesilent sold to Sun turned into project JXTA and died.
评论 #39618719 未加载
nairboonabout 1 year ago
If you run YaCy with docker and it is still a junior peer, does the search return results from the global index or just the one that appears to be &#x27;preinstalled&#x27;?
anthkabout 1 year ago
Ugh, Java. I&#x27;ll wait for something like i2pd does for I2P, something called yacyd either in c, c++ or golang.
评论 #39617606 未加载
RGBCubeabout 1 year ago
<p><pre><code> curl failed to verify the legitimacy of the server and therefore could not establish a secure connection to it. To learn more about this situation and how to fix it, please visit the web page mentioned above. </code></pre> Can&#x27;t seem to access the page.
arbolesabout 1 year ago
Sort of hijacking the thread to ask, can YaCy or similar, be an alternative to Google&#x27;s Programmable Search Engine? All I use it for is limit a search to a medium-sized list of domains. The aspect that makes running a search engine difficult on your own is lack of resources for crawling, I expect. But since I only care about a small list of domains, could I ditch Google&#x27;s and run my own crawler like YaCy?
评论 #39615069 未加载