TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Frog: OCR Tool for Linux

172 pointsby Epitom3over 2 years ago

13 comments

recuterover 2 years ago
Appears to be a nice wrapper around Tesseract:<p><a href="https:&#x2F;&#x2F;github.com&#x2F;tesseract-ocr&#x2F;tessdata" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;tesseract-ocr&#x2F;tessdata</a><p><a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Tesseract_(software)" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Tesseract_(software)</a><p>The demo of course works perfectly on a Mac as this is already built into Ventura.<p>If you haven&#x27;t experienced it yet ye olde ctrl-f now seamlessly sneaks a peak into images on the page for example, surprisingly useful.<p><pre><code> In November 2020, Brewster Kahle from the Internet Archive praised Tesseract saying: Tesseract has made a major step forward in the last few years. When we last evaluated the accuracy it was not as good as the proprietary OCR, but that has changed– we have done evaluations and it is just as good, and can get better for our application because of its new architecture. </code></pre> Anybody have an up to date breakdown of available OCR solutions?
评论 #33705082 未加载
评论 #33704953 未加载
评论 #33705595 未加载
评论 #33705843 未加载
评论 #33710953 未加载
评论 #33706970 未加载
评论 #33705357 未加载
twobitshifterover 2 years ago
On windows this is built into powertoys. win+shift+T is the default shortcut.<p><a href="https:&#x2F;&#x2F;learn.microsoft.com&#x2F;en-us&#x2F;windows&#x2F;powertoys&#x2F;text-extractor" rel="nofollow">https:&#x2F;&#x2F;learn.microsoft.com&#x2F;en-us&#x2F;windows&#x2F;powertoys&#x2F;text-ext...</a>
评论 #33706672 未加载
mavuover 2 years ago
I probably shouldn&#x27;t but I dislike using libraries for the main part of your project and then not even mentioning that you are using them.<p>Seems dishonest to me, but maybe I&#x27;m just too strict.
评论 #33708739 未加载
rjzzleepover 2 years ago
This looks like a nice app. I was looking for something like this a while back until I noticed that there are &quot;one&quot; liners that can you can setup for a hotkey:<p><pre><code> #!&#x2F;usr&#x2F;bin&#x2F;env bash langs=(eng ara fas chi_sim chi_tra deu ell fin heb hun jpn kor nld rus tur) lang=$(printf &#x27;%s\n&#x27; &quot;${langs[@]}&quot; | dmenu &quot;$@&quot;) maim -us | tesseract --dpi 145 -l eng+${lang} - - | xsel -bi</code></pre>
评论 #33705667 未加载
评论 #33710264 未加载
评论 #33709311 未加载
评论 #33708773 未加载
lervagover 2 years ago
Cool! I&#x27;ve seen similar ideas before and made my own inspired by these some years ago. It&#x27;s a simple bash script based on Flameshot [0] for taking the screenshot and Tesseract:<p><pre><code> #!&#x2F;usr&#x2F;bin&#x2F;env bash rm -f &#x2F;tmp&#x2F;screen.png flameshot gui -p &#x2F;tmp&#x2F;screen.png tesseract \ -c page_separator=&quot;&quot; \ -l &quot;eng&quot; \ --dpi 145 \ &#x2F;tmp&#x2F;screen.png &#x2F;tmp&#x2F;screen if [ &quot;$(wc -l &lt; &#x2F;tmp&#x2F;screen.txt)&quot; -eq 0 ]; then notify-send &quot;ocrmyscreen&quot; &quot;No text was detected!&quot; exit 1 fi xclip &#x2F;tmp&#x2F;screen.txt notify-send &quot;ocrmyscreen&quot; &quot;$(cat &#x2F;tmp&#x2F;screen.txt)&quot; </code></pre> [0]: <a href="https:&#x2F;&#x2F;flameshot.org&#x2F;" rel="nofollow">https:&#x2F;&#x2F;flameshot.org&#x2F;</a>
ensocodeover 2 years ago
This is a nice app, thanks. I am using a similar a bit less UI-heavy tool based on Tesseract as well. It&#x27;s called Normcap: <a href="https:&#x2F;&#x2F;github.com&#x2F;dynobo&#x2F;normcap" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;dynobo&#x2F;normcap</a>
xchipover 2 years ago
Nice, but it should give credits to Tesseract and mention how much HD space it requires for the UI dependencies.
seltzered_over 2 years ago
Oh nice. There hasn&#x27;t been a good ocr screenshot tool with Wayland support yet so look forward to trying this. IIRC there&#x27;s been..<p>Linux: dpScreenOCR - x11 only last I checked in and now Frog<p>MacOS: screenotate, prizmo<p>Windows: screenotate<p>I don&#x27;t get all the nitpick comments. OCR tools like this are extremely useful when dealing with excerpting text from certain websites (slack) or taking class notes from video.
评论 #33709157 未加载
habiburover 2 years ago
Uses tesseract OCR on the ocr part.
schappimover 2 years ago
FYI if you’re on a Mac, I’ve made this similar tool: <a href="https:&#x2F;&#x2F;github.com&#x2F;schappim&#x2F;macOCR" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;schappim&#x2F;macOCR</a>
noisediverover 2 years ago
A useful tool and great UI work. A handy extension would be the ability to extract text of specific colour, e.g. the highlights in Kindle&#x27;s Cloud Reader, to get around the 10% highlight export cap that Amazon puts on most books. I did this previously by running the screenshot through ImageMagick&#x27;s colour filling and thresholding options before passing the output to Tesseract. A colour picker tool might be a nice addition.
throwawaaarrghover 2 years ago
....why is it named frog?
评论 #33705259 未加载
jalacangover 2 years ago
L