I built a CLI tool for experimenting with Mistral OCR here: <a href="https://simonwillison.net/2025/Mar/7/mistral-ocr/" rel="nofollow">https://simonwillison.net/2025/Mar/7/mistral-ocr/</a><p>Honestly, the vibes aren't great. Gemini is a lot more flexible for handling PDFs - you can prompt it to do a bunch of other things - and Mistral OCR appears to hallucinate if it can't correctly read handwriting, a common problem with vision LLM based OCR tools.<p>The way Mistral OCR handles images within the text is disappointing - it doesn't attempt to interpret them, just extracts them out as binary blobs. A vision LLM can usually do a great job of describing an image, but with Mistral OCR you have to manually run that as a separate step.