科技回声

8 条评论

jph00超过 7 年前

I don't understand why this seems to be getting so much attention. There are plenty of small image datasets around, and wide recognition of the issues with MNIST.I see no evidence at all that this particular dataset is better than MNIST. None of the issues they themselves list with MNIST are discussed with relation to their proposed replacement.The benchmarks they provide are entirely useless - sklearn does not claim to be a platform for computer vision models. A quick WRN model gets 96% of this dataset (h/t @ajmooch on Twitter), suggesting that it doesn't deal with the "too easy" issue.The images clearly don't deal with the problem of lack of translation invariance.On the downside, they don't have the same ease of understanding of hand-drawn digits, which is extremely helpful for teaching, debugging, and visualizing.

评论 #15121949 未加载

评论 #15121987 未加载

评论 #15121528 未加载

评论 #15125113 未加载

nip超过 7 年前

How would you go about generating such dataset?1. Scrape images and store as png2. Downscale to 28px3. Convert each image to grayscale4. Convert to matrices and add label (additional row?)5. Normalize to have matrices of 1 and 0 for faster computation6. Vectorize said matrices7. Concatenate into one big vectorDid I miss something / Am I fooling myself?I plan on working on my first ML side project and I would love to gain some insights from HN.

评论 #15120867 未加载

评论 #15119747 未加载

评论 #15120583 未加载

eggie5超过 7 年前

Looks like this was sourced from in-house at some German online retailer: zalando.de. There is a similar data set from from amazon sourced by UCSD: <a href="http://jmcauley.ucsd.edu/data/amazon/" rel="nofollow">http://jmcauley.ucsd.edu/data/amazon/</a>And our research on recommenders using it: <a href="http://sharknado.eggie5.com" rel="nofollow">http://sharknado.eggie5.com</a>Particularly, the 2D scatter of the CNN features: <a href="http://sharknado.eggie5.com/tsne" rel="nofollow">http://sharknado.eggie5.com/tsne</a>

评论 #15119651 未加载

edshiro超过 7 年前

I'd love to play around with this dataset! It certainly seems richer than MNIST, and would most likely force the network to extract more features.But just like MNIST, it seems to lack variety in the positioning of the important elements, they are all centered which means that they don't train the network in being translation invariant. I presume this issue can be tackled with data augmentation techniques like applying affine transformations.

评论 #15119985 未加载

stared超过 7 年前

For a MNIST-like dataset, I often use not-MNIST (<a href="http://yaroslavvb.blogspot.com/2011/09/notmnist-dataset.html" rel="nofollow">http://yaroslavvb.blogspot.com/2011/09/notmnist-dataset.html</a>), which is more difficult than the original one (see examples of misclassified digits here: <a href="https://docs.neptune.ml/get-started/character-recognition/" rel="nofollow">https://docs.neptune.ml/get-started/character-recognition/</a>).However, I am not sure if we need more MNIST-like datasets. With small size many things make much less sense (data augmentation, even convnets as images are centered anyway) plus using many channels is a typical things (IRL I rarely work with grayscale images). So I am curious, in which way this dataset is better than CIFAR-10?See my note on datasets in Learning Deep Learning, <a href="http://p.migdal.pl/2017/04/30/teaching-deep-learning.html#datasets" rel="nofollow">http://p.migdal.pl/2017/04/30/teaching-deep-learning.html#da...</a>.

a3864超过 7 年前

If I am understanding the side-by-side comparison correctly, then the performance is highly correlated with MNIST (at least for high accuracy methods).<a href="https://i.imgur.com/viV7gFB.png" rel="nofollow">https://i.imgur.com/viV7gFB.png</a> (x-axis: Fashion, y-axis: MNIST)

评论 #15120658 未加载

ntenenz超过 7 年前

One of the reasons people have shifted away from MNIST is that it's simply too easy. Single channel, small image size, few classes, etc. Unfortunately, this does not address any of these concerns.

singularity2001超过 7 年前

How is this 'better' then cifar10 / cifar100?

评论 #15120622 未加载

8 条评论

jph00超过 7 年前

评论 #15121949 未加载

评论 #15121987 未加载

评论 #15121528 未加载

评论 #15125113 未加载

nip超过 7 年前

评论 #15120867 未加载

评论 #15119747 未加载

评论 #15120583 未加载

eggie5超过 7 年前

评论 #15119651 未加载

edshiro超过 7 年前

评论 #15119985 未加载

stared超过 7 年前

a3864超过 7 年前

评论 #15120658 未加载

ntenenz超过 7 年前

One of the reasons people have shifted away from MNIST is that it's simply too easy. Single channel, small image size, few classes, etc. Unfortunately, this does not address any of these concerns.

singularity2001超过 7 年前

How is this 'better' then cifar10 / cifar100?

评论 #15120622 未加载

An MNIST-like fashion product dataset

8 条评论

An MNIST-like fashion product dataset

8 条评论