TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Web scraping in production?

4 点作者 arkmm1 天前
Are any of you maintaining any web scrapers in production?<p>I&#x27;ve done some for side projects, automated testing, and personal scripts (checking personal bank balances, getting a Global Entry interview slot, etc.), but it always feels very brittle.<p>Curious what applications people have in industry and what sorts of techniques people use for reliability.

2 条评论

9d1 天前
Scraping is inherently brittle, but it can be very useful for short-term scraping in very specific circumstances. I haven&#x27;t had any in maybe 10 years.
评论 #43977647 未加载
sargstuff1 天前
excel web scraping[0] (vs. using python[1] and&#x2F;or odbc&#x2F;delimited files)<p>A few 2025 use cases [2],[3]:<p><pre><code> Use publically available database information (construction, taxes, sales, traffic report, proposed building&#x2F;zone changes, etc) to find out what&#x27;s going on withing an area aka. zip code, housing area, &#x27;vacation spot&#x27;, etc </code></pre> ----<p><pre><code> creative take on topic: modern looming &#x2F; static &#x27;threaded&#x27; approach : https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=43977384 Structurally reprogrammable magnetic maetamaterials hold promise for biomedicine, soft robotics. (&quot;web&quot; support formed via scraping material in relevant patterns) : https:&#x2F;&#x2F;techxplore.com&#x2F;news&#x2F;2025-05-reprogrammable-magnetic-metamaterials-biomedicine-soft.html 3d printed smart-fabrics : https:&#x2F;&#x2F;techxplore.com&#x2F;news&#x2F;2025-05-d-smart-fabrics-flexibility-ability.html </code></pre> ----<p>[0] : excel scraping : <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?app=desktop&amp;v=6coVzIt93vk" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?app=desktop&amp;v=6coVzIt93vk</a><p>[1] : python scraping : <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=Oo8-nEuDBkk" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=Oo8-nEuDBkk</a><p>[2] : <a href="https:&#x2F;&#x2F;dataforest.ai&#x2F;blog&#x2F;top-web-scraping-use-cases" rel="nofollow">https:&#x2F;&#x2F;dataforest.ai&#x2F;blog&#x2F;top-web-scraping-use-cases</a><p>[3] : <a href="https:&#x2F;&#x2F;www.parsehub.com&#x2F;blog&#x2F;web-scraping-examples&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.parsehub.com&#x2F;blog&#x2F;web-scraping-examples&#x2F;</a>
评论 #43977540 未加载