TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

The Science of Crawl, Part 2: Content Freshness

27 pointsby jisaacsoover 10 years ago

1 comment

dennybritzover 10 years ago
Nice post.<p>I only skimmed the post, but I believe you are assuming the utility of all pages to be equal. By &quot;utility&quot; I mean the value of information contained on the page relative to what your business is trying to achieve (not the organization&#x27;s utility as you define it in the blog post). However, in practice, aren&#x27;t information certain pages of much greater value to the business than others? For example, finding a new front page article on the NYT website could be more valuable than detecting 50 new Hacker News submissions. However, the NYT page would exhibit less divergence than the HN page.