TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Scraping Reddit in python

1 点作者 xcoding超过 8 年前
How do yo scrape reddit?

3 条评论

fhoffa超过 8 年前
Why do you want to scrape reddit?<p>If you want comments and posts, they are already shared.<p>For example, in BigQuery:<p><a href="https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;bigquery&#x2F;comments&#x2F;3cej2b&#x2F;17_billion_reddit_comments_loaded_on_bigquery&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;bigquery&#x2F;comments&#x2F;3cej2b&#x2F;17_billion...</a>
habitual_coder超过 8 年前
read the api documentation for reddit, follow the rules, use python+praw to make requests. Writing a script with BeautifulSoup or similar seems like a headache.
savethefuture超过 8 年前
You write a scrapper, you could use python. Beautiful Soup might be useful.