TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Distributed Scraper

27 点作者 Nimsical大约 8 年前

4 条评论

afandian大约 8 年前
Is it me or is 'stdlib' not the best name for something that's not the <stdlib.h>?
评论 #13863673 未加载
评论 #13863463 未加载
评论 #13863282 未加载
评论 #13865021 未加载
djyaz1200大约 8 年前
Does anyone know of a scraper with a simple UI that&#x27;s usable by less technical people? Similar to Kimono (bought by Palantir and shut down). <a href="https:&#x2F;&#x2F;techcrunch.com&#x2F;2016&#x2F;02&#x2F;15&#x2F;palantir-acquires-kimono-labs-for-its-web-scraping-service&#x2F;" rel="nofollow">https:&#x2F;&#x2F;techcrunch.com&#x2F;2016&#x2F;02&#x2F;15&#x2F;palantir-acquires-kimono-l...</a>
评论 #13863633 未加载
评论 #13863799 未加载
评论 #13863361 未加载
评论 #13863712 未加载
评论 #13863510 未加载
Nimsical大约 8 年前
Mainly built this as an experiment to pull a bunch of data for some ML work I&#x27;ve been doing. Wrote about it more extensively here: <a href="https:&#x2F;&#x2F;hackernoon.com&#x2F;microservice-series-scraper-ee970df3e81f#.ex6qh4aek" rel="nofollow">https:&#x2F;&#x2F;hackernoon.com&#x2F;microservice-series-scraper-ee970df3e...</a><p>AMA!
wenbert大约 8 年前
&quot;Distributed&quot; as in using proxies? Where do the requests come from when I scrape a page?
评论 #13863277 未加载