科技回声

4 条评论

titanix2超过 7 年前

If I understand correctly, the guy downloaded a data set, train it and get happy because it performs over 95% on the training set from the data set.This is not interesting at all because the hard part is to actually segment and classify characters in a real text, and in real life they don’t conveniently come as 28x28 images. As someone with a few hours of training in reading that kind of texts the difficulties are:- taking appart characters (hentaigana) that looks very close- segmenting characters because size and length vary a lot- differentiating between kana and kanji because the former are deritaves of the latter and they sometime look quite alikeSo the experimenter should have started his post telling he didn’t know any Japanese instead of closing it as such (even if one can notice when he just use modern kana as classes).

评论 #15700841 未加载

评论 #15701521 未加载

pornel超过 7 年前

The training set contains errors. The に row is especially messy: it's full of 小, ふ, and み.

评论 #15703214 未加载

peterburkimsher超过 7 年前

I want to do the same for Chinese.I have a collection of 1500 fonts, and finally exported all the PNGs for 75,000 characters. Now I need to pad, crop, and scale to make 28x28 (or 32x32, or 64x64, or another resolution).Then I want to do the Machine Learning (Classify) step.The article doesn't go into any detail about how to install Classify, how to import the training and testing data, and how to then actually run it. I watched the videos from CS231n because of my boss, but again, I'm still not really sure what to do practically.If I have lots of folders of images, what should I do to build an OCR program?

评论 #15702013 未加载

评论 #15701540 未加载

评论 #15701736 未加载

wodenokoto超过 7 年前

Anyone know the work he refers to as the inspiration for this article?

4 条评论

titanix2超过 7 年前

评论 #15700841 未加载

评论 #15701521 未加载

pornel超过 7 年前

The training set contains errors. The に row is especially messy: it's full of 小, ふ, and み.

Classifying Japanese characters from the Edo period

4 条评论

Classifying Japanese characters from the Edo period

4 条评论