TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Privacy-focused news feed that groups stories, using NLP

5 点作者 fortytwo79超过 3 年前

2 条评论

fortytwo79超过 3 年前
I built Drewes.NEWS as a way to learn NLP and serverless architecture, and to find bias in news by reading the same story from multiple outlets. Now it&#x27;s evolved into a useful, privacy-focused tool.<p>It&#x27;s privacy-focused in that there are no cookies, no usage of Google or Facebook components (like Google Analytics or Ads). No data tracking on users whatsoever.<p>There are bugs I&#x27;m aware of, but am looking for feedback on if this format and function is useful.<p>For those interested in the NLP side of this or the serverless side, I&#x27;d be happy to answer questions about how it was put together. The short version is, I pull down RSS feeds from 33 news sites (approximately 1M stories in the database so far), store them, create a term frequency model, cluster the most recent 10k stories based on TF similarity vectors, then store story similarities.<p>In the future I&#x27;d like to add paging, searching, and more filtering. I&#x27;m also thinking about having a URL for each story, that would show all the similar articles. That way, if you don&#x27;t want to link directly to a particular news source, you can link to the drewesnews aggregate URL, and the reader can pick whichever source he&#x2F;she wants to read.<p>Any feedback would be much appreciated!
评论 #29168667 未加载
bredren超过 3 年前
Looks interesting.<p>Quick note: the first time I went to the site it was blank other than the navbar. I did a refresh, but back and then forward again, finally went to tap the hamburger icon and the content showed.<p>I suspect maybe the content load is massive, since the page length looks very long. Maybe a spinner or shorter page would prevent this UX delay.<p>Subsequent page loads bring up the content right away.
评论 #29169711 未加载