TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Best way to digitize and translate printed articles?

1 pointsby kdom13over 6 years ago
I have a bunch of newspaper clippings and pages from old articles about my grandfather, all in Swedish.<p>I would like to digitize all articles and translate them to English in a semi-automated way. I know little Swedish so I can&#x27;t translate them myself, plus there&#x27;s over 100 article clippings.<p>Has anyone ever been through this process or something similar? I would appreciate any tips on what software to use.

1 comment

jppopeover 6 years ago
Theres a series of Optical Character Recognition repos that should help you with task #1. They are all based around Google&#x27;s Tesseract. If I remember correctly this is one of the top=&gt; <a href="https:&#x2F;&#x2F;github.com&#x2F;danielquinn&#x2F;paperless" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;danielquinn&#x2F;paperless</a> I&#x27;ve used project naptha in the past... and little known fact that google docs can do the OCR automatically too.<p>regarding the translation... never had to do it. sorry!
评论 #18251044 未加载