TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Web scraping in production?

4 pointsby arkmm26 days ago
Are any of you maintaining any web scrapers in production?<p>I&#x27;ve done some for side projects, automated testing, and personal scripts (checking personal bank balances, getting a Global Entry interview slot, etc.), but it always feels very brittle.<p>Curious what applications people have in industry and what sorts of techniques people use for reliability.

2 comments

sargstuff26 days ago
excel web scraping[0] (vs. using python[1] and&#x2F;or odbc&#x2F;delimited files)<p>A few 2025 use cases [2],[3]:<p><pre><code> Use publically available database information (construction, taxes, sales, traffic report, proposed building&#x2F;zone changes, etc) to find out what&#x27;s going on withing an area aka. zip code, housing area, &#x27;vacation spot&#x27;, etc </code></pre> ----<p><pre><code> creative take on topic: modern looming &#x2F; static &#x27;threaded&#x27; approach : https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=43977384 Structurally reprogrammable magnetic maetamaterials hold promise for biomedicine, soft robotics. (&quot;web&quot; support formed via scraping material in relevant patterns) : https:&#x2F;&#x2F;techxplore.com&#x2F;news&#x2F;2025-05-reprogrammable-magnetic-metamaterials-biomedicine-soft.html 3d printed smart-fabrics : https:&#x2F;&#x2F;techxplore.com&#x2F;news&#x2F;2025-05-d-smart-fabrics-flexibility-ability.html </code></pre> ----<p>[0] : excel scraping : <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?app=desktop&amp;v=6coVzIt93vk" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?app=desktop&amp;v=6coVzIt93vk</a><p>[1] : python scraping : <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=Oo8-nEuDBkk" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=Oo8-nEuDBkk</a><p>[2] : <a href="https:&#x2F;&#x2F;dataforest.ai&#x2F;blog&#x2F;top-web-scraping-use-cases" rel="nofollow">https:&#x2F;&#x2F;dataforest.ai&#x2F;blog&#x2F;top-web-scraping-use-cases</a><p>[3] : <a href="https:&#x2F;&#x2F;www.parsehub.com&#x2F;blog&#x2F;web-scraping-examples&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.parsehub.com&#x2F;blog&#x2F;web-scraping-examples&#x2F;</a>
评论 #43977540 未加载
9d26 days ago
Scraping is inherently brittle, but it can be very useful for short-term scraping in very specific circumstances. I haven&#x27;t had any in maybe 10 years.
评论 #43977647 未加载