Appears to be a nice wrapper around Tesseract:<p><a href="https://github.com/tesseract-ocr/tessdata" rel="nofollow">https://github.com/tesseract-ocr/tessdata</a><p><a href="https://en.wikipedia.org/wiki/Tesseract_(software)" rel="nofollow">https://en.wikipedia.org/wiki/Tesseract_(software)</a><p>The demo of course works perfectly on a Mac as this is already built into Ventura.<p>If you haven't experienced it yet ye olde ctrl-f now seamlessly sneaks a peak into images on the page for example, surprisingly useful.<p><pre><code> In November 2020, Brewster Kahle from the Internet Archive praised Tesseract saying:
Tesseract has made a major step forward in the last few years. When we last evaluated the accuracy it was not as good as the proprietary OCR, but that has changed– we have done evaluations and it is just as good, and can get better for our application because of its new architecture.
</code></pre>
Anybody have an up to date breakdown of available OCR solutions?