TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: The Java Web Scraping Handbook

7 pointsby ksahinabout 5 years ago

1 comment

ksahinabout 5 years ago
Hey Hacker News,<p>Today Pierre and I are releasing the Java Web Scraping Handbook for FREE!<p>And by free we mean you don&#x27;t even have to give us your email address!<p>Some backstory about the book: I originally wrote it in 2018 after working in different web scraping projects for startups (Mint.com like) and banks.<p>The first four chapters are language agnostic, and the last can be applied to any language, so don&#x27;t be scared if you don&#x27;t know Java!<p>By the end of the book, you will know:<p>- How to scrape any website<p>- Just enough XPath &#x2F; Regex &#x2F; DOM knowledge to be dangerous.<p>- How to deal with Javascript-heavy websites (Single Page application...)<p>- How to programmatically perform actions on a website behind a login form<p>- Parse information inside PDFs<p>- Bypass captchas<p>- Deploy your scrapers in the cloud<p>I&#x27;m happy to answer any questions about the book :)