TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Best way to digitize and translate printed articles?

1 点作者 kdom13超过 6 年前
I have a bunch of newspaper clippings and pages from old articles about my grandfather, all in Swedish.<p>I would like to digitize all articles and translate them to English in a semi-automated way. I know little Swedish so I can&#x27;t translate them myself, plus there&#x27;s over 100 article clippings.<p>Has anyone ever been through this process or something similar? I would appreciate any tips on what software to use.

1 comment

jppope超过 6 年前
Theres a series of Optical Character Recognition repos that should help you with task #1. They are all based around Google&#x27;s Tesseract. If I remember correctly this is one of the top=&gt; <a href="https:&#x2F;&#x2F;github.com&#x2F;danielquinn&#x2F;paperless" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;danielquinn&#x2F;paperless</a> I&#x27;ve used project naptha in the past... and little known fact that google docs can do the OCR automatically too.<p>regarding the translation... never had to do it. sorry!
评论 #18251044 未加载