TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: How to Build a News Aggregator?

4 点作者 psikomanjak大约 3 年前

1 comment

PaulHoule大约 3 年前
The front end that polls RSS feeds, sitemaps, and otherwise imports content into a database isn&#x27;t too hard.<p>What&#x27;s devilishly difficult is the nature of &quot;news&quot;.<p>That is, when a &quot;news&quot; story happens (say Will Smith slaps Chris Rock at the Oscars) there will be hundreds of articles about it from mainstream publications right away.<p>For the news feed to be manageable you have to cluster these, otherwise you are going to be furious that you can&#x27;t find any news in the middle of all that &quot;spam&quot;.<p>Defining the cluster boundaries are tricky. For instance &#x27;Will Smith v. Chris Rock&#x27; is an ongoing story. There is news about the initial event but there could be news about possible lawsuits, apologies, hard feelings, revenge. Also people are going to write opinion pieces blowing it out their ass forever. So it&#x27;s not so simple as &quot;say something once why say it again...&quot; but rather you have to be able to initially identify an event and then identify a string of events which are connected to that original event.