TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: How do you deal with rogue bots?

6 pointsby zaderabout 13 years ago
I run a niche dating and social networking website. After I built my own traffic analysis suite, instead of just relying on Google Analytics, I realized a ton of my bandwidth and server resources are devoted to serving up content to bots.<p>Many are valid bots whose spidering I welcome, but some of the most aggressive do not identify themselves as bots and are from places like North Korea and Russia. So I'm researching solutions, and the best I've come up with so far is using a bad_ips table in my Rails app to block addresses such as the ones listed in the blacklist at myip.ms.<p>How are other online entrepreneurs dealing with this phenomenon? Are there pre-existing solutions out there that are worth using, or should I proceed with my own custom model? Can any of you recommend best practices or seasoned advice in this area?

2 comments

paulsutterabout 13 years ago
You may want to use countermeasures that aren't so easily detected by the bot owners. You want to lengthen their feedback loop. They know instantly when you block them and can just move to a different server at amazon.<p>For example you could put them in a tarpit, and gradually serve successive pages more and more slowly. Exponential backoff or something.<p>You could also serve bad data back to them when they get so far into the tarpit.<p>It all depends on how much work it's worth doing.<p>Id love to hear more ideas, it's a fascinating question. So many sites have this problem, and its an endless cat and mouse game, perhaps there is a product opportunity in here somewhere.
TeHCrAzYabout 13 years ago
Just ban the most intrusive bots; Its just going to turn into whack a mole otherwise.