TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Is there any tool to create pdf parsing heuristics?

1 点作者 jcmp2 个月前
There are a lot of PDF parsing tools, non of them really works for me. So i thought I can define heuristics myself (whitespace, headers, footer) since i only need to parse one type of pds. Is there any tool which can help me to configure those heuristics?

1 comment

cratermoon2 个月前
The xpdf tools <a href="https:&#x2F;&#x2F;www.xpdfreader.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.xpdfreader.com&#x2F;</a> have some pretty good options.