TechEcho

10 comments

svantanaover 11 years ago

Very well written, and I applaud the effort. But personally I don't care for the "magical" aura that writers tend to give ANNs - to me, they are simply (non-linear) function approximators that have a nice fitting algorithm. They work well for some problems and poorly for others. Also, beware of over-fitting - ANNs tend to be parameter-heavy, although there are approaches to prune the connections.

评论 #6797130 未加载

评论 #6798114 未加载

ivoflipseover 11 years ago

If you liked his first chapter, consider supporting his IndieGogo campaign for the whole book (<a href="http://www.indiegogo.com/projects/neural-networks-and-deep-learning-book-project/" rel="nofollow">http://www.indiegogo.com/projects/neural-networks-and-deep-l...</a>).

评论 #6795723 未加载

joe_the_userover 11 years ago

"The adder example demonstrates how a network of perceptrons can be used to simulate a circuit containing many NAND gates. And because NAND gates are universal for computation, it follows that perceptrons are also universal for computation."I think this comment from the article needs caveats. Of course, a neural network would not qualify as Turing Complete just because it's finite. Keep in mind also that neural network, lacking anything like counters, tape, or recursion, couldn't approximate a Turing in the way that a finite Von Neuman architecture machine does. (A NN can represent any given function over a domain if it get large enough, kind of the universality of a finite automaton).I know this a reference to this generation of NN having overcome an earlier problem of not being able to represent a NAND gate but still, it's worthing keeping mind that an ordinary computer can simulate an NN with just a program but this doesn't work vice-versa, so that NN's in that sense are far from universal.

评论 #6798071 未加载

tbaover 11 years ago

This is a cool exercise! After completing it, I wanted to find out exactly what each NN hidden node represented. I trained a tiny (10 hidden node) NN on an OCR dataset and created a visualization here: <a href="https://rawgithub.com/tashmore/nn-visualizer/master/nn_visualizer.html" rel="nofollow">https://rawgithub.com/tashmore/nn-visualizer/master/nn_visua...</a> .Can anyone figure out what each hidden node represents?You can also select a node and press "A" (Gradient Ascent). This will change the input in a way that increases the selected node's value. By selecting an output node and mashing "A", you can run the NN in reverse, causing it to "hallucinate" a digit.

MechSkepover 11 years ago

What about convolutional neural nets? They weren't mentioned, but that's really what most of the deep learning approaches use...

评论 #6795633 未加载

cdurrover 11 years ago

Ahh, the MNIST database of handwritten digits. I never took an ML course and I was only able to achieve 87% recognition rate 6 years ago for a university software engineering project. I read others achieving 99.9% recognition rates with their ANNs so I wasn't happy with my result. I tried to self-study about ANNs but I found most material to be either too simple or too complicated. I finally found some articles about ANNs (<a href="http://visualstudiomagazine.com/Articles/List/Neural-Network-Lab.aspx" rel="nofollow">http://visualstudiomagazine.com/Articles/List/Neural-Network...</a>) with code samples in C#, so I'll finally be looking into rewriting my old code to get a better result.

snarfyover 11 years ago

That reminds me of a video I saw about something called restricted boltzmann machines:<a href="http://www.youtube.com/watch?v=AyzOUbkUf3M&t=24m0s" rel="nofollow">http://www.youtube.com/watch?v=AyzOUbkUf3M&t=24m0s</a>

评论 #6798228 未加载

Draco6slayerover 11 years ago

I'm not sure if I am making a mistake, but I couldn't use the command listed in order to clone the repository. I'm using windows, with git installed, and I received the error: "Permission Denied: publickey"I was able to get everything by looking you up on github and using the url of the repository.Edit: Also, you might mention the repository earlier, because it's rather large and I've had to break from the book while it downloads.

评论 #6798283 未加载

bcuccioliover 11 years ago

I did exactly this for a school project about a year ago:<a href="https://github.com/bcuccioli/neural-ocr" rel="nofollow">https://github.com/bcuccioli/neural-ocr</a>There's a paper in there that explains the design of the system and my results, which weren't great, probably due to the small size of training data.

hcarvalhoalvesover 11 years ago

Isn't that the same material covered on Andrew Ng's Coursera course "Machine Learning", down to the training data?

评论 #6797039 未加载

评论 #6797014 未加载

评论 #6796743 未加载

评论 #6796764 未加载

10 comments

svantanaover 11 years ago

评论 #6797130 未加载

评论 #6798114 未加载

ivoflipseover 11 years ago

评论 #6795723 未加载

joe_the_userover 11 years ago

评论 #6798071 未加载

tbaover 11 years ago

MechSkepover 11 years ago

What about convolutional neural nets? They weren't mentioned, but that's really what most of the deep learning approaches use...

评论 #6795633 未加载

cdurrover 11 years ago

snarfyover 11 years ago

评论 #6798228 未加载

Draco6slayerover 11 years ago

评论 #6798283 未加载

bcuccioliover 11 years ago

hcarvalhoalvesover 11 years ago

Isn't that the same material covered on Andrew Ng's Coursera course "Machine Learning", down to the training data?

评论 #6797039 未加载

评论 #6797014 未加载

评论 #6796743 未加载

评论 #6796764 未加载

Using neural nets to recognize handwritten digits

10 comments

Using neural nets to recognize handwritten digits

10 comments