TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: Textra – extract text from images, PDFs, and audio (Mac CLI tool)

1 pointsby freedmandover 2 years ago

1 comment

freedmandover 2 years ago
Hi HN!<p>A few months ago, I was playing with Apple&#x27;s live text tool which allows the user to select text from images. I noticed the quality of the text extraction was unreasonably good, and it worked offline. As someone who has studied free, on-device OCR libraries extensively, I had not seen anything with such high quality that works locally.<p>I built out Textra over a few weeks as a response, which is a command-line tool for Mac OS 13+ (needed to access the latest high quality APIs) that gives the user easy access to Apple&#x27;s on-device text extraction APIs. The tool is in early stages but has hit a point where it feels like it&#x27;s approaching a good mix of stability and utility.<p>The text extraction (OCR) quality is among the best and fastest I&#x27;ve seen for on-device recognition (approaches the quality of cloud services like AWS&#x2F;Google). The speech extraction is not as high quality as recent alternatives like OpenAI&#x27;s Whisper, but it is very fast, efficient, and probably good enough for many use cases (and doesn&#x27;t require a large download).<p>It is my hope that this open source tool can help journalists and the public with text extraction needs.