TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Is there any tool to create pdf parsing heuristics?

1 pointsby jcmp3 months ago
There are a lot of PDF parsing tools, non of them really works for me. So i thought I can define heuristics myself (whitespace, headers, footer) since i only need to parse one type of pds. Is there any tool which can help me to configure those heuristics?

1 comment

cratermoon3 months ago
The xpdf tools <a href="https:&#x2F;&#x2F;www.xpdfreader.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.xpdfreader.com&#x2F;</a> have some pretty good options.