科技回声

7 条评论

randcraw将近 7 年前

It might be fun to exercise this method across an information-theoretic well-bounded set of shapes or object domains to try to quantify its limitations in generating useful independent forms of novelty.For example, you might use it to formulate a set of wavelets that when combined judiciously would effectively span a well-defined distribution of shapes generated from a small grammar. In so doing, you could quantify the shape variance and identify which augmentation transformations added most value for training (minimally modeling that variance) and which added least.Maybe you could also combine this with t-SNE to gain some intuition of which 'wavelet' manifested where in the trained net, which resonated most, and in concert with which other wavelets. You could explore this across different CNN sizes and designs, looking for evidence of wavelet ensemble or hierarchy.With some careful engineering, you could try to force emergent autoencoders to reveal themselves and then explore their interactions.

PaulHoule将近 7 年前

Since the 1990's at least, augmentation has been one of the most important "tricks of the trade" in NN and it may be even more important in the deep learning era.

评论 #17350435 未加载

kriro将近 7 年前

Direct link to the paper: <a href="https://arxiv.org/abs/1805.09501" rel="nofollow">https://arxiv.org/abs/1805.09501</a>PDF: <a href="https://arxiv.org/pdf/1805.09501.pdf" rel="nofollow">https://arxiv.org/pdf/1805.09501.pdf</a>

paradroid将近 7 年前

AutoOverfit is more like it.

评论 #17355031 未加载

anchpop将近 7 年前

I wonder how large your dataset has to be for this to be useful. You can get by with small datasets in some fields (i.e. retraining the last layer of Mobilenet, you can get good results with 200 annotations), I'd be interested to see how useful this is there.

mlthoughts2018将近 7 年前

This seems like it could dramatically worsen overfitting-like effects for algorithms like CNNs for image processing, where surface statistics of the available data set seem to be more responsible for the learned model than any type of “semantic” understanding.If you prespecify what data augmentation you would do, like preregistering the details of a clinical trial, you’ll be less susceptible to a spurious result from this.It seems like especially things like color distribution manipulation would have a potentially very adverse effect that counters any gains from clamping the supervised learning to be “robust” to that color variation.I’m thinking in the spirit of: < <a href="https://arxiv.org/abs/1711.11561" rel="nofollow">https://arxiv.org/abs/1711.11561</a> >.

评论 #17350765 未加载

评论 #17351308 未加载

XnoiVeX将近 7 年前

Did they share any code?

7 条评论

randcraw将近 7 年前

PaulHoule将近 7 年前

Since the 1990's at least, augmentation has been one of the most important "tricks of the trade" in NN and it may be even more important in the deep learning era.

Improving Deep Learning Performance with AutoAugment

7 条评论

Improving Deep Learning Performance with AutoAugment

7 条评论