TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Anyone interested in contributing to Domo web crawler project?

2 pointsby hinogluabout 15 years ago
Hi Hn,<p>http://github.com/hinoglu/Domo/ is a python based remotely manageable web crawler based on Python, multiprocessing, pyro, lxml and pycurl.<p>It's main purpose is to be able provide a lightweight python based alternative to archive.org's heritrix. Main structure is similar to heritrix and Domo provides almost similar but somewhat lesser features.<p>Though Domo is a web crawler/archiver, it can be used for anything that requires remote multiinstance / multiprocess job management features, thanks to Pyro and multiprocessing libraries.<p>I'd be glad if anyone would like to contribute or join the project.

no comments

no comments