The truth about deep learning

297 点作者 clmcleod将近 9 年前

24 条评论

vonnik将近 9 年前

Anyone following DL news knows that DL alone will not lead to strong AI. The most impressive feats in the last year or so have come from combining deep artificial neural networks with other algorithms, just as DeepMind combined deep ConvNets with reinforcement learning and Monte Carlo Tree Search. There's not really an interesting conversation to be had about whether DL will get us to strong AI. It won't. It is just machine perception; that is, it classifies, clusters and makes predictions about data very well in many situations, but it's not going to solve goal-oriented learning. But it solves perception problems very well, often better than human experts. So in the not too distant future, as people wake up to its potential, we will use those infinitely replicable NNs to extract actionable knowledge from the raw data of the world. That is, the world will become more transparent. It will offer fewer surprises. We may not solve cancer with DL, but we will spot it in X-rays more consistently with image recognition, and save more lives.Disclosure: I work on the open-source DL project Deeplearning4j: <a href="http://deeplearning4j.org/" rel="nofollow">http://deeplearning4j.org/</a>

评论 #11840371 未加载

评论 #11823865 未加载

AndrewKemendo将近 9 年前

I understand and empathize with the skepticism or rather criticisms around hand wringing with respect to the implications of current deep learning methods.However, as someone who builds them for vision applications I'm increasingly convinced that some form of ANN will underlie AGI - what he calls a universal algorithm.If we assume that general intelligence comes from highly trained, highly connected single processors (neurons) with a massive and complex sensor system, then replicating that neuron is step one - which arguably is what we are building, albeit comparatively crudely, with ANN's.If you compare at a high level how infants learn and how we train RNN/CNNs they are remarkably similar.I think where the author, and in general the ML crowd focuses too much is on unsupervised learning as being pivotal for AGI.In fact if you look again at biological models the bulk of animal learning is supervised training in the strict technical sense. Just look at feral children studies as proof of this.Where the author detours too much is assuming the academic world would prove a broader scope for ANN if it were there. In fact however research priorities are across the board not focused on general intelligence and most machine learning programs explicitly forbid this research for graduate students as it's not productive over the timeline of a program.Bengio and others I think are on the right track, focusing on the question of ANN towards AGI and I think it will start producing results as our training methods.

评论 #11820376 未加载

评论 #11821255 未加载

评论 #11824281 未加载

评论 #11821849 未加载

aab0将近 9 年前

"Here is my personal answer to the second question: deep neural networks are more useful than traditional neural networks for two reasons: The automatic encoding of features which previously had to be hand engineered. The exploitation of structurally/spatially associated features. At the risk of sounding bold, that’s it — if you believe there is another benefit which is not somehow encompassed by these two traits, please let me know."Let me ask a very simple question. What set of hand-engineered features gives <5% error on ImageNet?

评论 #11819768 未加载

评论 #11820279 未加载

评论 #11820223 未加载

mrdrozdov将近 9 年前

My top reasons why everyone getting into maths/stats/cs should go straight for deep learning:a. recent findings are documented incredibly well in both research and codeb. because of its success, there are many areas for useful contribution at relatively less effort from the researcherc. because of its success, it'll help you develop marketable skillsd. it's funMaybe it won't solve General AI, but it seems like a damn good foundation for the person/people that will eventually come out with ideas that move us closer in that direction.

评论 #11821486 未加载

评论 #11821625 未加载

chris_va将近 9 年前

"The automatic encoding of features which previously had to be hand engineered." Yes, that is the main benefit.The drawback is that we are still hand tuning architectures, slowly inventing (or incorporating) things like LSTM and the like into the model.One goal would be to achieve a universal building block that can be stacked/repeated without the need for architectural tuning.Maybe something that combines recurrence, one-shot learning, deep learning, and something stolen from AI (like alpha-beta, graph search, or something self-referencing and stochastic with secondary neural networks) into a single "node". Then we won't have to worry about architecture so much.

morgante将近 9 年前

Why are we so preoccupied with the notion of "artificial intelligence" in the first place?Artificial intelligence, if it can even be defined, does not seem like a particularly valuable goal. Why is emulating human cognition the metric by which we assess the utility of machine learning systems?I'd take a bunch of systems with superhuman abilities in specialized fields (driving, Go, etc.) over so-called "artificial intelligence" any day.

评论 #11820441 未加载

评论 #11820491 未加载

评论 #11820448 未加载

评论 #11820570 未加载

评论 #11821427 未加载

评论 #11820435 未加载

return0将近 9 年前

What are "traditional nets" ? What are the "other learning algorithms" ? What is a universal algorithm (and for what)? Neural nets are universal function approximators. There isnt something [edit: a function] they can't learn. When stacked they seem to produce results that are eerily human-like.I think the "universal algorithm" in the article refers to some kind of emergent intelligence. Well, nothing that he mentions precludes it. Our brains aren't magical machines. Neural nets may not model real neurons, yet it is amazing how they can produce results that we identify as similar to the way we think. There is nothing in computational neuroscience that comes close to this. If anything, the success of deep nets bolsters my belief in connectionism rather than the opposite. I would expect it is very difficult to formulate "intelligence" mathematically, and to prove that DNs can or cannot produce it.

评论 #11819977 未加载

评论 #11819878 未加载

评论 #11819920 未加载

评论 #11819939 未加载

评论 #11819866 未加载

评论 #11820160 未加载

评论 #11823431 未加载

评论 #11820430 未加载

评论 #11820020 未加载

EGreg将近 9 年前

The truth about most automation in general:The logic is written by humans. The main mechanism by which computers / robots begin to outperform people in eg playing Chess, Go or Driving, is copying what works.Humans outperformed animals because they were able to try stuff, recognize what works and transmit that abstract information using language.The main advantage of computers is being able to quickly and easily copy bits and check for errors. You can have perfect copies now, preserving things that before could only be copied imperfectly.And now you copy algorithms that work. The selection process might need work but the actual logic is still written by some human somewhere. It's almost never written by a computer. Almost all the code is actually either written by a human or at most generated by an algorithm written by a human, which takes as input code written by another human.What's the "smarter" thing is the system of humans banging away at a platform, all making little contributions, and the selection process for what goes into the next version. That's what's smarter than a single human. That and the ability to collect and process tons of data.All the current AI does is throw a lot of machines at a problem, and stores the result in giant databases as precomputed input for later. That's what most big data is today. Whoever has the training sets and the results is now hoarding it for a competitive advantage.But really, the thing that makes all the system smart is that so many humans can make their own diff/patch/"pull request". Anyone can write a test and submit a bug that something doesn't work. That openness what made science and open source successful.Open source has served the long tail better, too. Microsoft builds software that runs on some hardware. Linux has been forked to run on toasters. Open source drug platforms would have helped solve malaria, zika and other diseases faster.If we had patentleft in drugs, we'd outpace bacterial resistance. Instead we have the profit motive, which stagnated development of new drugs.

评论 #11819983 未加载

argonaut将近 9 年前

Not sure why this is so highly upvoted. Nobody is questioning that deep networks work better than shallow ones, and there is a good understanding in academia of why (that fits with most lay people's intuition). I hardly consider that the most interesting or relevant question.

评论 #11819879 未加载

arcanus将近 9 年前

"Since I am feeling especially bold, I will make another prediction: deep learning will not produce the universal algorithm. There is simply not enough there to create such a complex system."While I (emotionally) agree, it will be interesting to see if the complexity (and non-linearity) of these algorithms permit 'emergent' behavior to appear.

评论 #11819829 未加载

评论 #11819796 未加载

评论 #11819835 未加载

nbvehrfr将近 9 年前

From my intuitive understanding (not an expert), very abstract description how it works in general: - you have real world problem -> task which you need to solve - you build model (algorithm, math method & etc) which should solve the task - you need to find optimum of the complex function (error function)Third step is usually finding optimum of the function. Deep neural networks help you to move complexity from step 2 to step 3. One example you mentioned, when feature engineering is moved from 2 -> 3. So you can use simpler methods on step2 to solve same problems, or extend problems area which you can solve with the same complexity on step2.

estefan将近 9 年前

Can anyone recommend a good resource that summarises what the different algorithms are best suited for aimed at novices?I've been working my way through <a href="http://neuralnetworksanddeeplearning.com/" rel="nofollow">http://neuralnetworksanddeeplearning.com/</a> (with a big detour back into maths thanks to the Khan Academy) and have done a few ML courses, but they mainly cover a couple of algorithms, not all the ones available in spark's MLLib or tensorflow for example.

yason将近 9 年前

In my opinion, in the 80-90's, neural networks and machine learning used to be 10% a solid concept in terms of academic research and 90% hype. Now neural networks and machine learning are 10% a solid concept in terms of being a practical applicable tool and 90% hype. Things have changed a lot because I almost run out of fingers when trying to express the orders of magnitude in which raw processing power has increased. You can literally feed the network with anything when training and get reasonable results later in recognition. That's one impressive yet humanly vague hash table there. And no, you don't have to wait for months or weeks anymore to train new things. Not even days, necessarily.Why people pull in artificial intelligence is both naively optimistic and quite understandable. Modelling something of a neural system is so close to how biological brain works that the parallel is blatantly obvious. On the other hand, the current deep networks do not translate to intelligence; not at all. Machine learning might be, in part, something we could describe as "intelligent" as it's able to connect dots that are very difficult to connect by traditional algorithms but it absolutely is no intelligence. Then again, we do hang out in the same neighbourhood. If we will ever create an artificial intelligence in software I'm quite certain it will be very much based on some sort of massively deep and parallel network of dynamic connections.I'm not that interested in artificial intelligence myself. I would be interested in artificial creativity and emotional senses, but to model those there are bigger metaphysical questions to be answered first.

pkghost将近 9 年前

I love the last sentence, and want to expand on it. If ANNs are tools to help computers perceive, then they are analogous to components or layers in the nervous system. If we map the nervous system thoroughly enough and understand the inputs and outputs of each layer/region, then reproducing a human-like nervous system might not be all that complicated.

评论 #11820332 未加载

peter303将近 9 年前

People have been working on neural nets for over 50 years now. The topic goes in and out of fashion. Nets are more powerful now and computers vastly more powerful. <a href="https://en.m.wikipedia.org/wiki/Perceptrons_(book)" rel="nofollow">https://en.m.wikipedia.org/wiki/Perceptrons_(book)</a>

Cozumel将近 9 年前

When you have to train a network with a zillion images of a dumbbell (<a href="http://www.businessinsider.sg/googles-ai-can-teach-us-about-the-human-brain-2015-7/#.V1AWhT9VK1E" rel="nofollow">http://www.businessinsider.sg/googles-ai-can-teach-us-about-...</a>) for it to recognise what a dumbbell is and then it still gets it wrong (adding arms!), then somethings fundamentally broken, in so much humans don't learn like that. DL is a huge step forward but it's not ever going to be any kind of AGI.

DrNuke将近 9 年前

As usual with tools, even these, a clear understanding of the specific problem, the relevant metrics and the expected goal is decisive. I am saying that experimental protocols are still devised by humans against a cost vs opportunity matrix. Brute computational force is not independent yet, artificial intelligence has not emerged yet.

EGreg将近 9 年前

This article shows how desp learning is different than true human-like understanding:<a href="http://www.wired.com/2016/03/doug-lenat-artificial-intelligence-common-sense-engine/" rel="nofollow">http://www.wired.com/2016/03/doug-lenat-artificial-intellige...</a>

Xcelerate将近 9 年前

> deep learning will not produce the universal algorithmI'm curious what HN users think the "universal algorithm" will end up looking like?My own guess (wild speculation) is that we'll start moving in the direction of concepts like tensor networks. While that term sounds like it has something to do with machine learning, it actually falls under the domain of theoretical physics. Tensor networks are a relatively recent development in quantum mechanics that show promise because of their ability to extract the "interesting" information from a quantum state. Generally speaking, it's very difficult to compute/describe/compress a quantum state because it "lives" in an exponentially large Hilbert space. Traditionally, the field of quantum chemistry has built this space up using Gaussian basis functions, and the field of solid state physics has built it up using plane waves. The problem is that regardless of the basis set chosen, it appears as though exponentially more basis vectors are required to accurately describe a quantum state as the system becomes larger.Tensor networks are an attempt to alleviate this problem. While it is true that the state space of an arbitrary quantum system is exponentially large in the number of particles, it turns out that for realistic quantum systems, the relevant state space is actually much smaller — i.e., real systems seem to live in a tiny corner of Hilbert space. And this tiny subspace even includes all of the possible states that one could put a collection of qubits into within the lifetime of the universe.The projection of a system's state vector into either the position or momentum basis is known as the system's "wavefunction" (some texts allow more than these two bases). Since the wavefunction exhibits the highly desirable property of being localized in position/momentum space, this allows one to build up a good approximation to the state using Gaussians or plane waves — that is, unless the wavefunction exhibits strong electron correlation (quantum entanglement). Quantum entanglement is the exception to nature's tendency to localize state space about a point in spacetime, and thus it is frequently the case that the most commonly used basis sets are highly suboptimal for many real electronic systems (superconductors stand out as a notable and somewhat pathological example).I'm not entirely familiar with all of the math behind it, but tensor networks essentially describe the small but relevant region of Hilbert space by exploiting properties of the renormalization group. In this sense, a compact way of describing "real world" quantum states is developed. I think this has applications to a "universal algorithm", because real world data rarely consists of a random or uniform scattering of information across the data's state space. In my own research, I've found that a lot of the NP-hard problems I run into are efficiently solvable in practice (stuff involving low rank PSD matrices) precisely because the data isn't random. If tensor networks are good at finding a basis set that is "local" in abstract Hilbert space with regard to some real-world set of quantum states, then it seems as though they would work equally well for a lot of the real world data that lives on a low-dimensional manifold in a high-dimensional space — the kind of data that machine learning (and eventually artificial general intelligence) seeks to tackle.

isseu将近 9 年前

Talking about the benefits of dnn, what about the levels of abstraction? Each layer add levels of abstraction that you can't see in shallow networks.

评论 #11819888 未加载

stared将近 9 年前

> "deep learning will not produce the universal algorithm"I doubt that a general algorithm exists (why should it?).But well, if we are talking about human-level (or superhuman-level) AI, it is good to remember that WE are deep, recurrent neural networks (with a very different implementation, and spikes instead of floats, but still). If it work in vivo, why its abstracted version shouldn't work in silico?

armitron将近 9 年前

Entirely content-free post. Click-bait most likely.

radarsat1将近 9 年前

> Nothing is more frustrating when discussing deep learning that someone explaining their views on why deep neural networks are “modeled after how the human brain works” (much less true than the name suggests) and thus are “the key to unlocking true artificial intelligence”.While I get what he is saying here, and more or less agree, I think it is not to be taken lightly that there is a significant difference in this discussion now as compared to 30 years ago. The difference is not how neural networks work, which clearly differs but is related in some ways to the brain, but rather what neural networks see.What is really significant when you can handle lots and lots of data, and throw it all at a giant neural network, is what we see happening in the network. The observation that the hidden-layer filters developed as an optimal feature for classifying images appear to be Gabor-like directional filters (I'm referring of course to this type of thing [1]) is not random, and not an insignificant result. It really does relate to perception, in the sense that 1) we know that the brain has directional filters in the visual cortex and 2) more importantly, from signal processing theory we know that such filters are "optimal" from a certain mathematical point of view, and if they develop naturally as the best way to interpret "natural" images (or other natural data, such as audio [2]), it shows that development of such filters in the brain is perhaps also quite likely. There is quite some research in neuroscience at the moment looking for evidence of such optimal filters in early neural pathways.So yes, neural networks are not models of "how the brain works", but the newly established ability to process huge amounts of data, and to examine what kind of learning happens in order to optimise this processing, can tell us a lot about the brain -- not how it works, but what it must do. Complemented with work in neuroscience, the idea of modeling information processing is not unrelated and can really lead to some significant contributions in our understand of perception.. and perhaps, eventually, cognition -- but who knows.The misunderstanding here is thinking that the be-all and end-all of neuroscience is studying how neurons fire and interact. Neuroscience is much more than that. Neuroscientists want to know how we experience and understand the world, and a big part of that is understanding what is required to process and interpret information, what is the information, what are its statistics, and what kind of neural processing would be required to extract it from our sensory inputs. Of course, this must be complemented by studies of how humans do react to stimuli, to try to verify that we do process information according to some model. But that model being verified -- that comes from what we know about information processing, and computer science can contribute there in a significant way.[1]: <a href="https://computervisionblog.files.wordpress.com/2013/05/gabor.png" rel="nofollow">https://computervisionblog.files.wordpress.com/2013/05/gabor...</a>[2]: <a href="http://www.nature.com/neuro/journal/v5/n4/abs/nn831.html" rel="nofollow">http://www.nature.com/neuro/journal/v5/n4/abs/nn831.html</a>

dredmorbius将近 9 年前

Define your terms. WTF is "Deap Learning"?

评论 #11819823 未加载

24 条评论

vonnik将近 9 年前

评论 #11840371 未加载

评论 #11823865 未加载

AndrewKemendo将近 9 年前

评论 #11820376 未加载

评论 #11821255 未加载

评论 #11824281 未加载

评论 #11821849 未加载

aab0将近 9 年前

评论 #11819768 未加载

评论 #11820279 未加载

评论 #11820223 未加载

mrdrozdov将近 9 年前

评论 #11821486 未加载

评论 #11821625 未加载

chris_va将近 9 年前

morgante将近 9 年前

评论 #11820441 未加载

评论 #11820491 未加载

评论 #11820448 未加载

评论 #11820570 未加载

评论 #11821427 未加载

评论 #11820435 未加载

return0将近 9 年前

评论 #11819977 未加载

评论 #11819878 未加载

评论 #11819920 未加载

评论 #11819939 未加载

评论 #11819866 未加载

评论 #11820160 未加载

评论 #11823431 未加载

评论 #11820430 未加载

评论 #11820020 未加载

EGreg将近 9 年前

评论 #11819983 未加载

argonaut将近 9 年前

评论 #11819879 未加载

arcanus将近 9 年前

评论 #11819829 未加载

评论 #11819796 未加载

评论 #11819835 未加载

nbvehrfr将近 9 年前

estefan将近 9 年前

yason将近 9 年前

pkghost将近 9 年前

评论 #11820332 未加载

peter303将近 9 年前

Cozumel将近 9 年前

DrNuke将近 9 年前

EGreg将近 9 年前

Xcelerate将近 9 年前

isseu将近 9 年前

Talking about the benefits of dnn, what about the levels of abstraction? Each layer add levels of abstraction that you can't see in shallow networks.

评论 #11819888 未加载

stared将近 9 年前

armitron将近 9 年前

Entirely content-free post. Click-bait most likely.

radarsat1将近 9 年前

dredmorbius将近 9 年前

Define your terms. WTF is "Deap Learning"?

评论 #11819823 未加载