TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Home
Why extracting data from PDFs is still a nightmare for data experts
14 points
by
ilamont
about 2 months ago
1 comment
lxgr
about 2 months ago
At least personally, it all made a lot more sense to me once I realized that PDF is effectively a vector graphics format, and as such is much closer to e.g. SVG than to a text file or rich document format.