I'm assuming that this was the intended link:<p><a href="https://aws.amazon.com/textract/" rel="nofollow">https://aws.amazon.com/textract/</a>
This one’s interesting, because it seems to support more formats than Apache Tika and even includes speech recognition and OCR, all conveniently rolled into one package.