科技回声

1 comment

blaydator大约 5 年前

Hi Hackers,<p>Often I get pdfs which I want to extract text from and paste it somewhere else. Not all PDFs are always well constructed and a lot of them are scanned ones. Unfortunately Mac's Preview or other classic PDF viewers can not extract text from those.<p>So I have built a minimalist website to extract text from any PDFs, scanned ones as well. It uses OCR to extract text and the user can highlight specific areas on the document to extract from. The extraction is made locally by the browser thanks to the awesome Tesseract.js library.<p>I would love to have your feedback before adding more features (zoom setting, improve areas selections, png/jpeg support, mobile support, offline support, ...).

评论 #22967475 未加载

评论 #22976132 未加载

评论 #22967441 未加载

Show HN: Extract text from any pdf in the browser

1 comment

Show HN: Extract text from any pdf in the browser

1 comment