TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

An Intuitive Explanation of Convolutional Neural Networks (2016)

150 pointsby jlukecarlsonover 7 years ago

7 comments

jatsignover 7 years ago
How do CNNs work when the output is multiple categories? For instance, in the same image is a cat and a dog and a car. What&#x27;s the architecture look like - multiple CNNs, each that can predict one category? Or does one CNN have multiple outputs and if the score &gt; threshold, add that category to the list shown to the user?<p>Also, how do CNNs draw a box around the target in the image?
评论 #15331626 未加载
评论 #15330796 未加载
评论 #15331990 未加载
air7over 7 years ago
&gt; Parameters like number of filters, filter sizes, architecture of the network etc. have all been fixed before Step 1 and do not change during training process – only the values of the filter matrix and connection weights get updated.<p>Is this just the article&#x27;s over-simplification or are these values really just randomly selected?
评论 #15331556 未加载
biocomputationover 7 years ago
I actually don&#x27;t think this is a good explanation at all. I&#x27;m not saying it&#x27;s badly written, just that it&#x27;s not a good explanation for the stated purpose (serving as an intuitive explanation).<p>To this point, the article is certainly NOT intuitive if you don&#x27;t already understand image convolution. The explanation is also very long and rambling. While I understand the author has made an effort, I don&#x27;t think the article really presents the subject matter in a new way: I can learn all of this elsewhere. This is a common problem when people write about complex subject matter without fully understanding the knowledge gap between teacher and audience.<p>If I were the author, I might try to read up on technical communication and spend some time figuring out how to correctly simply something. As it stands, this article using the typical strategy of information hiding to simplify the subject matter. The problem is that information hiding doesn&#x27;t doesn&#x27;t work very well unless it is expertly done. I do like the animation, but again, it only serves to show how image convolution works, and doesn&#x27;t actually teach us anything about a CNN.<p>I would suggest the author break the document into three separate sections, the first being very simple (maybe start with the part that says &#x27;images are just matrices&#x27;) and then add more details in each section. The final section would have a lot of detail. That way you counteract the information blindness that occurs from simplification by providing the information later.<p>Otherwise, this article is really more of a data dump than an intuitive explanation, and since it doesn&#x27;t really teach us anything we can&#x27;t learn elsewhere, I don&#x27;t see what it contributes.<p>A cleaner explanation, expertly prepared, could really elevate the effort that went into this.
评论 #15333421 未加载
评论 #15346334 未加载
评论 #15356480 未加载
junkcollectorover 7 years ago
The article is all right but for newbies reading it; be a little careful. The author is sloppy with terminology in a way that can trip up someone who is just learning. An example being that a Kernel and a Filter are not the same thing.
评论 #15334974 未加载
sigstoatover 7 years ago
anyone happen to be familiar with any uses of CNNs on 1D &quot;images&quot;? (like you&#x27;d get from linear image sensors <a href="https:&#x2F;&#x2F;toshiba.semicon-storage.com&#x2F;ap-en&#x2F;product&#x2F;sensor&#x2F;linear-sensor.html" rel="nofollow">https:&#x2F;&#x2F;toshiba.semicon-storage.com&#x2F;ap-en&#x2F;product&#x2F;sensor&#x2F;lin...</a> )<p>i hit up google scholar occasionally looking for references, but literally everything seems to be applying them to 2D images.
评论 #15332680 未加载
评论 #15332606 未加载
评论 #15332840 未加载
AlphaWeaverover 7 years ago
This article was very helpful. The animations did wonders to show how the networks iterate.
评论 #15331481 未加载
tehsauceover 7 years ago
Breezes right over back-propagation, arguably the most crucial part :&#x2F;