TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Best languages or frameworks for high concurrency web scraping?

2 pointsby CoreSetalmost 10 years ago
Hi HN. I&#x27;m doing research that encourages me to take as tight a temporal snapshot of various websites as I possibly can (i.e.grabbing content from them all simultaneously) I&#x27;ve been playing around with phantomjs and various python solutions but neither is very performant.<p>Any suggestions on where to start looking for a more rigorous answer?

1 comment

philbrittonalmost 10 years ago
If you don&#x27;t need to execute js, then maybe try a simple http get to retrieve the contents, then process it separately. If you&#x27;re looking to parse and extract while on page I&#x27;d recommend Beautiful Soup. If interested in trying a node alternative check out Cheerio.