TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

RSS doesn't necessarily means firehose

10 点作者 surprisetalk14 天前

1 comment

PaulHoule14 天前
RSS readers need <i>algorithic feeds</i> [1] but unfortunately everyone interested in RSS thinks algorithm = bad.<p>My YOShInOn reader downloads somewhere between 3,000 to 30,000 items in a cycle [2] and chooses 300 top-scoring items out of 20 clusters. When I complete those, it runs another cycle. It has extra screens that show articles that it thinks would get &gt;10 votes or a comment&#x2F;vote ratio &gt; 0.5 on HN as well as screens to show top-scoring articles from particular sites and feeds (arXiv, lobsters, ...)<p>Articles in the primary feed are shown to me one at a time, I thumbs up or I thumbs down. The RoC for the classifier is about 0.78, I read TikTok gets 0.84 so I&#x27;m pretty happy.<p>The problems with it: (1) It depends on arangodb for which the license doesn&#x27;t allow me to commercialize it and I wouldn&#x27;t feel OK with open sourcing it. Right now I&#x27;m writing a python-arango replacement which will get it and my image sorter running on postgres out of a single code base. (2) the batch organization doesn&#x27;t work well for certain topics like sports where articles have a shelf life.<p>[1] doesn&#x27;t have to be &quot;creepy blond girls want to follow you&quot; or all outrage all the time, an algorithmic feed can apply any heuristic that <i>you</i> like.<p>[2] depending on how fast I am reading, quality gets better when I am reading slow. The system blends in a certain percentage of randomly chosen results to maintain calibration -- I&#x27;ve been thinking about making it run at a target quality level where it blends in more randoms if it thinks it is showing me too many good results.
评论 #43822505 未加载
评论 #43824062 未加载
评论 #43822720 未加载
评论 #43822897 未加载