TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Hacker News on BigQuery: Now with daily updates. Top domains and time to post?

52 点作者 fhoffa大约 8 年前

2 条评论

minimaxir大约 8 年前
A few comments about on Hacker News data (i.e why I haven&#x27;t played with the data in awhile):<p>1. The algorithm changed recently. This post uses &gt;40pts as a proxy for front pageness. That&#x27;s too conservative; even my 10pt threshold back then was conservative. With recent algorithm changes to Hacker News (&lt;1 yr), I&#x27;ve seen posts with <i>3pts</i> get into the Top 10 for whatever reason, which breaks predictive analysis.<p>2) The dataset&#x2F;this submission only includes submissions&#x2F; submission scores; comment scores were removed from the API which is disappointing.<p>3) Given that HN titles&#x2F;links can be edited by moderators (and they do a good job), it&#x27;s harder to judge initial submissions from the final result.<p>4) Slight edge case in the article, but link shorteners are auto-killed which is why youtu.be&#x2F;goo.gl links are not prominent.
评论 #13898412 未加载
评论 #13911481 未加载
koolba大约 8 年前
How does the data get to BigQuery? Anything special&#x2F;fun or just repeatedly polling the API endpoint?
评论 #13898380 未加载