OCR on old documents is hard. OCR Workbench uses AI for OCR and provides an editing environment to clean things up, as is inevitably required.<p>Inspired by this Hacker News post: <a href="https://news.ycombinator.com/item?id=43048698">https://news.ycombinator.com/item?id=43048698</a><p>Backstory: I was having trouble producing transcriptions of Colonial American documents, which have their own unique challenges for OCR, and things like Tesseract fail miserably. So I built something. Uses Gemini and seems to work pretty well (disclaimer: you need your own API key). I didn't build Claude but I expect it works similarly well.<p>FWIW: largely vibe coded, with human review and intervention as required.
Interesting concept. I tried it with a text written in Church Slavonic, didn't work. I guess the documents don't have to be THAT old. It would also be nice if you could upload images individually instead of selecting everything from a folder. Either way, nice work.