Why extracting data from PDFs is still a nightmare for data experts

14 点作者 ilamont大约 2 个月前

1 comment

lxgr大约 2 个月前

At least personally, it all made a lot more sense to me once I realized that PDF is effectively a vector graphics format, and as such is much closer to e.g. SVG than to a text file or rich document format.