科技回声

18 条评论

sbarre2 个月前

I find it challenging to accept something that talks about "OCR" but then I upload a PDF with text in images, and when I query the document after upload, I get a message that says "I can't interpret images"..Then are you actually doing OCR, or are you just extracting embedded text?

评论 #43297550 未加载

评论 #43297567 未加载

评论 #43297532 未加载

setnone2 个月前

Sweet branding! Grandma told me she's not happy with lack of privacy policy.

评论 #43301196 未加载

simonw2 个月前

I built a CLI tool for experimenting with Mistral OCR here: <a href="https://simonwillison.net/2025/Mar/7/mistral-ocr/" rel="nofollow">https://simonwillison.net/2025/Mar/7/mistral-ocr/</a>Honestly, the vibes aren't great. Gemini is a lot more flexible for handling PDFs - you can prompt it to do a bunch of other things - and Mistral OCR appears to hallucinate if it can't correctly read handwriting, a common problem with vision LLM based OCR tools.The way Mistral OCR handles images within the text is disappointing - it doesn't attempt to interpret them, just extracts them out as binary blobs. A vision LLM can usually do a great job of describing an image, but with Mistral OCR you have to manually run that as a separate step.

评论 #43300800 未加载

评论 #43301179 未加载

bilater2 个月前

OK I've been critical of Mistral AI but credit where credit is due. Mistral OCR seems cool.So cool in fact, I got distracted and ended up building an open source PDF parser and chat app!Presenting Auntie PDF - your all-knowing guide that unpacks every PDF into clear, actionable insights.You can upload a pdf or point to a public link, parse it, and then ask questions. All open source and free.

评论 #43297383 未加载

评论 #43297254 未加载

jbaudanza2 个月前

I have a question about Mistral OCR. If I give the model a PDF that is 90% text, is it actually performing OCR on an image representation of the text? Or is it smart enough to extract the text directly and only use OCR on images?

foundzen2 个月前

Love the creativity in the branding but it did not work in my case either. Gibberish raw content and error in answering any question.

t-32 个月前

What are people using these OCR programs for? Are there really that many PDFs being made without embedded text these days?

评论 #43298475 未加载

elanning2 个月前

It looks great, nice work. I’m impressed at the quick development too.

评论 #43301204 未加载

JoelJacobson2 个月前

Thanks for creating, really useful!Would be nice with a [Download Combined Rendered] button to download a self-contained .html web page of the rendered combined page.

评论 #43301508 未加载

shnpln2 个月前

I would like it if my chat session did not clear if go to Document Content and back to chat. Or I wish I could see my document when chatting.

daft_pink2 个月前

Is there a way to use mistral ocr on us servers so your data never leaves our borders?

mjyoon2 个月前

Unfortunate that Mistral OCR can't tell me details presented in charts and graphs.

yannis2 个月前

Pretty impressive and did a good job for an academic pdf I uploaded. Nice UI also.

评论 #43304159 未加载

ab_testing2 个月前

This is amazing. Could you share the prompts that were used for this product ?

评论 #43301207 未加载

triyambakam2 个月前

The coolest thing about this is the short and easy to pronounce .com

评论 #43304162 未加载

n8m82 个月前

im on mobile and don’t have a pdf to test it with, but I love your styling and text copy.

评论 #43301208 未加载

throwaway813482 个月前

what about privacy?

eastoeast2 个月前

Awesome UI!

评论 #43301210 未加载

18 条评论

sbarre2 个月前

评论 #43297550 未加载

评论 #43297567 未加载

评论 #43297532 未加载

setnone2 个月前

Sweet branding! Grandma told me she's not happy with lack of privacy policy.

评论 #43301196 未加载

simonw2 个月前

评论 #43300800 未加载

评论 #43301179 未加载

bilater2 个月前

评论 #43297383 未加载

评论 #43297254 未加载

jbaudanza2 个月前

foundzen2 个月前

Love the creativity in the branding but it did not work in my case either. Gibberish raw content and error in answering any question.

t-32 个月前

What are people using these OCR programs for? Are there really that many PDFs being made without embedded text these days?

评论 #43298475 未加载

elanning2 个月前

It looks great, nice work. I’m impressed at the quick development too.

评论 #43301204 未加载

JoelJacobson2 个月前

Thanks for creating, really useful!Would be nice with a [Download Combined Rendered] button to download a self-contained .html web page of the rendered combined page.

评论 #43301508 未加载

shnpln2 个月前

I would like it if my chat session did not clear if go to Document Content and back to chat. Or I wish I could see my document when chatting.

daft_pink2 个月前

Is there a way to use mistral ocr on us servers so your data never leaves our borders?

mjyoon2 个月前

Unfortunate that Mistral OCR can't tell me details presented in charts and graphs.

yannis2 个月前

Pretty impressive and did a good job for an academic pdf I uploaded. Nice UI also.

评论 #43304159 未加载

ab_testing2 个月前

This is amazing. Could you share the prompts that were used for this product ?

评论 #43301207 未加载

triyambakam2 个月前

The coolest thing about this is the short and easy to pronounce .com

评论 #43304162 未加载

n8m82 个月前

im on mobile and don’t have a pdf to test it with, but I love your styling and text copy.

评论 #43301208 未加载

throwaway813482 个月前

what about privacy?

eastoeast2 个月前

Awesome UI!

评论 #43301210 未加载

Auntie PDF – an open source app built using Mistral OCR

18 条评论

Auntie PDF – an open source app built using Mistral OCR

18 条评论