TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: How to Build a News Aggregator?

4 pointsby psikomanjakabout 3 years ago

1 comment

PaulHouleabout 3 years ago
The front end that polls RSS feeds, sitemaps, and otherwise imports content into a database isn&#x27;t too hard.<p>What&#x27;s devilishly difficult is the nature of &quot;news&quot;.<p>That is, when a &quot;news&quot; story happens (say Will Smith slaps Chris Rock at the Oscars) there will be hundreds of articles about it from mainstream publications right away.<p>For the news feed to be manageable you have to cluster these, otherwise you are going to be furious that you can&#x27;t find any news in the middle of all that &quot;spam&quot;.<p>Defining the cluster boundaries are tricky. For instance &#x27;Will Smith v. Chris Rock&#x27; is an ongoing story. There is news about the initial event but there could be news about possible lawsuits, apologies, hard feelings, revenge. Also people are going to write opinion pieces blowing it out their ass forever. So it&#x27;s not so simple as &quot;say something once why say it again...&quot; but rather you have to be able to initially identify an event and then identify a string of events which are connected to that original event.