Thanks for this. I tried using Tesseract over the weekend to extract text from a game screenshot and had no luck. The documentation for Tesseract is rather opaque; maybe I'll have better luck with Ocropus.
I wonder if it's possible to remove the need for post-processing of the LSTM's output by integrating transcription into the neural network model directly.