TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: News Searching API

40 点作者 caballeto将近 5 年前

10 条评论

caballeto将近 5 年前
This is the side-project, that I have been building recently. The API allows searching over the data from many news sites and news aggregators, it also allows to easily extract whole news sites. I built it for two reasons. First, I wanted to have a source of recent news data to play with, essentially to run data analysis and train ml models on. Secondly, because alternative services had wildly high pricing, like 400-500$ per month, and I thought I could build and deploy such a system for a much cheaper price and improve my programming skills along the way. I would very much appreciate any comments/suggestions about the serive or how to make it better :)
评论 #23759556 未加载
评论 #23788004 未加载
评论 #23759146 未加载
评论 #23762004 未加载
asdkhadsj将近 5 年前
Do you host the content as well? What&#x27;s the legality of that?<p>I ask because I want to embed some archiving and &quot;reader mode&quot; logic into an app of mine that would be FOSS and self hosted. However that means each individual would be effectively scraping and archiving, and possibly p2p spreading, news content<i>(as data sources)</i>.<p>So I&#x27;m curious if there is some underlying &quot;fair use&quot;-like mechanism that allows Archive, Outline.com, and you to consume news content without it being considered piracy.
评论 #23761157 未加载
graupel将近 5 年前
I used to run a large US publisher with 50+ sites writing 50,000 articles a month; if we found someone scraping full articles and offering them for use via an API it would be a problem - even moreso if that person was charging for it. I&#x27;m all for hacking and innovation around this space, just be careful here and don&#x27;t break TOS.
评论 #23785502 未加载
artembugara将近 5 年前
Disclaimer: I am a co-founder of a similar News API service <a href="https:&#x2F;&#x2F;newscatcherapi.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;newscatcherapi.com&#x2F;</a><p>No news API solution returns the full body text of the article (including us). The reason is - copyright infringement in US and EU.<p><i></i>You can return only chunk of it<i></i><p>At least, that&#x27;s what all lawyers I spoke to told me
评论 #23768356 未加载
评论 #23764054 未加载
xiekomb将近 5 年前
Hi,<p>Simple and effective API. The documentation lacks details about the query parameter. I want to search for A AND B but q=?A%20B does not yield the expected result (seems a OR query) or for exact phrase &quot;A invests in B&quot; do not seem to work. Can you please post details for advanced syntax like this?
评论 #23768438 未加载
ibdf将近 5 年前
Tried it.. pretty straight forward. Free account rate is good for personal use. I think it comes down to 4 calls per day which is reasonable if you are using this as maybe your daily news fetcher.
评论 #23761601 未加载
todsacerdoti将近 5 年前
I added your API to Pipedream. Check out an example -- <a href="https:&#x2F;&#x2F;pipedream.com&#x2F;@tod&#x2F;datanews-io-example-p_ljC13p&#x2F;edit" rel="nofollow">https:&#x2F;&#x2F;pipedream.com&#x2F;@tod&#x2F;datanews-io-example-p_ljC13p&#x2F;edit</a><p>If you or your users need to build automated workflows with Datanews, please let us know any feedback.
moneywoes将近 5 年前
Can you talk about how you avoid getting blocked by captcha? Do you use RSS feeds? In that case do you not return images?
评论 #23760940 未加载
评论 #23760999 未加载
jslakro将近 5 年前
How do you get news from its sources?
评论 #23761290 未加载
sagunsh将近 5 年前
What tech stack are you using? And how many news source does it collect data from?
评论 #23789987 未加载