TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Where do backlink checking services get their data from?

8 点作者 maurtinshkreli将近 8 年前
I'm talking about Moz, ahrefs, megaindex and the like...

2 条评论

sebst将近 8 年前
There is a comment which was voted dead which actually answered the question. These services have their own crawlers. If you ever spot for example MajesticBot in your access logs you have found one of the biggest.
tconaugh将近 8 年前
They use distributed web crawlers to crawl 100s of billions of web pages. Probably one of the following options:<p>1) Built their own crawlers.<p>2) Using an Apache Nutch&#x2F;Heritrix cluster in a colo facility.<p>3) Use 3rd party services like mixnode.