科技回声

5 条评论

sandreas将近 2 年前

It depends on what input amount, format and quality you have.There are free / open source tools (like Tesseract), but if you would like to use them, some manual or (semi-)auto preprocessing steps are very important (threshold / binarization, deskew, noise removal[1]) too get nearly comparable results to commercial tools.Some tesseract based solutions are better integrated with automatic preprocessing, you could take a look at Papermerge or other self hosted document management solutions[2].There are also commercial SDKs around tesseract with good price point, like Vintasoft OCR[5], which supports automatic preprocessing and delivers a decent quality.If you don't mind having a (free) clicking adventure with small amounts of documents, you could also try the free verson of PDF X-Change viewer[3], which has a small but pretty good OCR to embedded PDF-Layer option which makes PDFs "searchable". But the embedded OCR data cannot be easily extracted.The best "no cloud" / offline solution I found, was Abbyy FineReader[4] which also has a command line tool, but if you really want a ready to use, easy and good quality solution, I would go with Google Lens (if you don't mind google)[1] <a href="https://towardsdatascience.com/pre-processing-in-ocr-fc231c6035a7" rel="nofollow noreferrer">https://towardsdatascience.com/pre-processing-in-ocr-fc231c6...</a>[2] <a href="https://github.com/awesome-selfhosted/awesome-selfhosted#document-management">https://github.com/awesome-selfhosted/awesome-selfhosted#doc...</a>[3] <a href="https://www.tracker-software.com/product/pdf-xchange-editor" rel="nofollow noreferrer">https://www.tracker-software.com/product/pdf-xchange-editor</a>[4] <a href="https://www.pdf-xchange.de/pdf-xchange-viewer/" rel="nofollow noreferrer">https://www.pdf-xchange.de/pdf-xchange-viewer/</a>[5] <a href="https://www.vintasoft.com/vsocr-dotnet-index.html" rel="nofollow noreferrer">https://www.vintasoft.com/vsocr-dotnet-index.html</a>

beardyw将近 2 年前

A bit off topic but I've just started using Google Lens to extract whole pages from books with my phone. Near perfect conversion to text is great for taking notes.

评论 #36760989 未加载

smoldesu将近 2 年前

I still use Tesseract. It's not the fastest or most-accurate anymore, but it gets what I need off of PDF files.

评论 #36759814 未加载

is_true将近 2 年前

We started using tesseract for a project that needed to extract text from video frames. But in the end we moved to easyocr, as it needed less preprocessing for our use case.

itake将近 2 年前

What languages do you need to support? Off the shelf models don't work well on non-Latin languages. You may need to train your own.

5 条评论

sandreas将近 2 年前

beardyw将近 2 年前

A bit off topic but I've just started using Google Lens to extract whole pages from books with my phone. Near perfect conversion to text is great for taking notes.

评论 #36760989 未加载

smoldesu将近 2 年前

I still use Tesseract. It's not the fastest or most-accurate anymore, but it gets what I need off of PDF files.

评论 #36759814 未加载

is_true将近 2 年前

We started using tesseract for a project that needed to extract text from video frames. But in the end we moved to easyocr, as it needed less preprocessing for our use case.

itake将近 2 年前

What languages do you need to support? Off the shelf models don't work well on non-Latin languages. You may need to train your own.

Ask HN: What OCR tool do you use in your project?

5 条评论

Ask HN: What OCR tool do you use in your project?

5 条评论