TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Does anyone use Bayesian stats to filter interesting stories on the Net?

4 点作者 jcubic超过 2 年前
This idea was on my mind of a while.<p>There are a lot of stuff online and if you want to read something interesting you need to scroll endlessly on different platforms. Hacker News is one of them (even that it has top stories, you may be not interested in any of them but like something that was submitted but not get that much attention), other are:<p>* LinkedIN<p>* DEV.to<p>* Twitter Hashtags, Communities and timeline<p>* Facebook Groups and timeline<p>* Reddit<p>So my idea was why not use Bayesian SPAM filter to sort interesting stories. Feed them all in BogoFilter (That I use on my Linux box with Claws-Mail email client) or something similar. I think that most SPAM filter use statistics after Paul Graham famous essay.<p>You don&#x27;t even need AI for this. I&#x27;m almost certain that your and mine interesting stories have a lot of common that you can train SPAM filter to filter out noise.<p>From my experience and you probably agree only 1-5% of stuff that you see on above platforms are interesting to me and you.<p>So the question is does anyone tried something like this? I think that I&#x27;ve read a story about someone use something like this. But I would love to see something that will give me this. It can be Open Source I would welcome that with my open arms. But a paid service like this would be worth my money, even that I don&#x27;t like subscription based services.<p>I wanted to share my idea maybe it will inspire someone to create something like this, or maybe something like this already exists.<p>I have rough idea how something like may work. * You take Open Source RSS Reader and only need to add interface to like or dislike a story * Feed everything into the reader if something doesn&#x27;t have RSS try to came up with something that will convert that into RSS, e.g. Web Scrapping or Puppeteer (the last one would probably be required to Facebook).

1 comment

at-w超过 2 年前
Doesn&#x27;t the upvote system used by HN&#x2F;Reddit&#x2F;etc. accomplish the same thing? Outright spam of the kind detectable by a basic filter would presumably be detected by other users who vote on that content. It also has the added benefit of ranking content based on the kind of common appeal you mention, beyond just filtering outright spam, and avoiding the kind of false positives almost any spam filter would be susceptible to.
评论 #33630798 未加载