TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Abusive AI Web Crawlers: Get Off My Lawn

23 pointsby bluehatbritabout 1 month ago

4 comments

PeterStuerabout 1 month ago
You can monetize your app users by partnering with providers that offer SDKs for residential proxy networks. These services let users opt-in to share their internet connection, earning you revenue while they get benefits like ad-free experiences.<p>How It Works: Providers like Proxyrack, Live Proxies, Rayobyte, and Infatica allow you to integrate their SDKs into your app. Users who agree to join the proxy network contribute their device’s bandwidth, often used for web scraping, and you get paid based on their activity—typically per monthly or daily active user.<p>So it need not be &quot;compromised Android SetTop Boxes&quot;, but just millions of free apps running on user&#x27;s phones.
评论 #43558428 未加载
DarkPlayerabout 1 month ago
We observed the same behavior. Each request used a different IP address and a random user agent. In our case, most of the IP addresses belonged to Chinese ISPs. They went to great lengths to avoid being blocked, but at the same time used user agents such as Windows 95&#x2F;98 or IE 5. Fortunately, the combination of the odd user agents and the fact that they still use HTTP&#x2F;1.1 makes them somewhat easy to identify. So you can use a captcha on more expensive endpoints to block them.
intellectronicaabout 1 month ago
I don&#x27;t understand the current thing about &quot;AI Crawlers&quot;. Maybe someone can help educate me.<p>How is it related to AI? Do AI crawlers do something different from traditional search index crawlers? Or is it simply a proliferation of crawlers because of the growth of AI products?<p>What makes AI special in this context?
评论 #43556947 未加载
评论 #43556924 未加载
评论 #43576987 未加载
评论 #43556845 未加载
lostmsuabout 1 month ago
Why does the author of this post assume their increase in traffic has anything to do with &quot;AI&quot; specifically?