Reverse OCR

745 点作者 mrtbld超过 10 年前

28 条评论

I was thinking about this: <a href="http://www.cs.toronto.edu/~graves/handwriting.html" rel="nofollow">http://www.cs.toronto.edu/~graves/handwriting.html</a>

评论 #8564642 未加载

评论 #8563091 未加载

评论 #8561891 未加载

评论 #8564220 未加载

评论 #8562041 未加载

praptak超过 10 年前

This is similar to the project where images of clouds were fed to face recognition software: <a href="http://ssbkyh.com/works/cloud_face/" rel="nofollow">http://ssbkyh.com/works/cloud_face/</a>

评论 #8562515 未加载

评论 #8562279 未加载

jparishy超过 10 年前

Not strictly related, but reminded me of the exercise in genetic programming by Roger Alsing: <a href="http://rogeralsing.com/2008/12/07/genetic-programming-evolution-of-mona-lisa/" rel="nofollow">http://rogeralsing.com/2008/12/07/genetic-programming-evolut...</a>It's a rather cool attempt to draw the Mona Lisa using random, semi-transparent polygons

评论 #8562646 未加载

bane超过 10 年前

This could be a cool way to visually "encrypt" messages. They're readable, but only by the correct tool. I wonder how these squiggles might be creatively arranged steganographicly in an image and still be "read" by the OCR tool.

评论 #8562219 未加载

评论 #8562885 未加载

评论 #8562062 未加载

kitd超过 10 年前

Could be used for automated printing of doctors' prescriptions ;)

mrtbld超过 10 年前

Perhaps this could lead to a new kind of captcha that only bots can solve. I doubt it would be efficient, though.

评论 #8562178 未加载

评论 #8562591 未加载

评论 #8562320 未加载

评论 #8562138 未加载

评论 #8562093 未加载

评论 #8562052 未加载

carsonreinke超过 10 年前

Looks like he has written tons of very creative bots. They are all very interesting ideas (e.g. <a href="http://randomshopper.tumblr.com" rel="nofollow">http://randomshopper.tumblr.com</a>)

评论 #8564629 未加载

sgentle超过 10 年前

It would be pretty interesting to see one degree of abstraction up from this - what sets of lines are close enough to match a certain word?If you averaged over all those sets, would the resulting blobby heatmap resemble the original word in a legible form? Or something else?

userbinator超过 10 年前

I can imagine generating a few pages or even an entire book of this, and some future generations attempting to figure out what sort of language it was written in... reminds me of this:<a href="http://en.wikipedia.org/wiki/Voynich_manuscript" rel="nofollow">http://en.wikipedia.org/wiki/Voynich_manuscript</a>

cosarara97超过 10 年前

I couldn't get that OCR to read my mouse-written E. It's a nice experiment nevertheless.

评论 #8561902 未加载

klausa超过 10 年前

I highly recommend watching talk Darius Kazemi (author of Reverse OCR) gave at this years XOXO: <a href="http://www.youtube.com/watch?v=l_F9jxsfGCw" rel="nofollow">http://www.youtube.com/watch?v=l_F9jxsfGCw</a>

emhart超过 10 年前

It has been fantastic watching Darius' myriad experiments over the past few years. His work always has a great mixture of whimsy and serious experimentation.

MrBra超过 10 年前

Nice. Finally computers approached the age of writing. :)

lucb1e超过 10 年前

I can already imagine the innovation:> Type over this text to prove that you are a computer.> Human detected. Shoo, shoo!

Aaronneyer超过 10 年前

Looks like my handwriting

z3t4超过 10 年前

I can't believe OCR has not been solved yet. The only one even close is OmniPage.

评论 #8566653 未加载

driverdan超过 10 年前

Here's the source code on github: <a href="https://github.com/dariusk/reverseocr" rel="nofollow">https://github.com/dariusk/reverseocr</a>

jostmey超过 10 年前

A generative model, although computationally expensive, would not suffer this problem. Essentially a generative model can run in reverse, which means that if you feed values into the output you get inputs that could explain the output. Check out "Boltzmann Machines" for an example. There are plenty of examples for the MNIST dataset of hand written digits.

k_sze超过 10 年前

I think one of the problems is that the OCR assumes the images to be (English) letters.To be really really useful, the OCR would need to consider at least all characters in the Unicode Basic Multilingual Plane. And then it needs to be able to reject an image as containing any word, and then it needs to solve the halting problem.

zwass超过 10 年前

This reminds me of an experiment I played with using random search to "teach" the browser how to draw characters: <a href="http://zwass.github.io/Learn2Write/" rel="nofollow">http://zwass.github.io/Learn2Write/</a>

bmh100超过 10 年前

This actually seems like a great program for automatically generating adversarial examples to improve OCR. A human could rate this text as being illegible or legible. Each example can then be added to the training data to improve its quality.

评论 #8565073 未加载

eurleif超过 10 年前

It would be neat to see the same thing, except using two OCR libraries instead of just one, and requiring both libraries to be able to read the message. I imagine the letters would start to look a bit less insane.

shangxiao超过 10 年前

This is pretty cool, although it makes me wonder what the real world applications could be. It does, at the very least, tantalise my curiosity and gets me thinking.

achr2超过 10 年前

Could this be used in a pseudo reverse CAPTCHA by showing a series of words, and asking the user to say which is not human readable?

methyl超过 10 年前

I wonder what would happen if you run this program letter-by-letter, possibly the readability could increase.

mslot超过 10 年前

I love algorithmic art.

Applico超过 10 年前

very cool idea.

jdimov超过 10 年前

What (if anything) is this saying about the quality of the OCR process? Especially since none of these seem human readable.

评论 #8562251 未加载

评论 #8562326 未加载

评论 #8563325 未加载

评论 #8563329 未加载