I probably could have used this recently when I had a project which required a close encounter with extracting data from PDFs. Fortunately the PDFs were generated as a report by a VB6 application (!) so they had a fairly regular format once I figured out the quirks of PDF, as the authors describe here.<p>I did learn a few neat tricks by doing it myself though. The library I used to extract the text was none other than Mozilla's own PDF.js, so in the final version my users could just drag and drop the PDF onto the browser window, and my little algorithm parsed the tables into arrays, with AngularJS rendering them as HTML tables.<p>Obviously computer-vision assisted, general purpose reconstruction of tabular data is the secret sauce in this project, but if you have the right use case you can do some cool things in the client. You do have to dig into the PDF.js internals a bit to figure out how to use it but I'm sure that it will improve in that respect.