Unreasonable Ineffectiveness of Machine Learning in Computer Systems Research

168 pointsby donnemartinabout 8 years ago

14 comments

60654about 8 years ago

> Driverless long-haul trucks are apparently just a few years away, and the main worry now is not so much the safety of these trucks but the specter of unemployment facing millions of people currently employed as truck drivers.No, no they're not. We have some lane tracking in good weather etc., but we are still decades (or more) away from full level-5 autonomy that would make drivers behind the wheel unnecessary.But it only goes to show that not even computer science experts are immune to marketing hype and well funded PR campaigns. :)As for the unreasonable ineffectiveness - it's not just in systems research. ML can be very effective in some areas (especially when there is a ton of training data), but many areas of human endeavor are hard to model via function approximation techniques like those used in most of ML.

评论 #14073597 未加载

评论 #14072852 未加载

评论 #14072823 未加载

评论 #14073380 未加载

评论 #14078397 未加载

评论 #14072862 未加载

评论 #14075069 未加载

评论 #14075796 未加载

评论 #14073167 未加载

评论 #14073328 未加载

评论 #14072683 未加载

dkarapetyanabout 8 years ago

I think this is because the kinds of problems that arise in system design are logical and symbolic in nature and the current crop of "AI" has no symbolic reasoning capabilities. All the current hype is about pattern matching. Very good pattern matching but just pattern matching nonetheless. Whereas when constructing a compiler or a JIT it's more like what mathematicians do by setting down some axioms and exploring the resulting theoretical landscape. None of the current hype is about theorem proving or the kinds of inductive constructions that crop up in the process of proving theorems or designing compilers and JITs.For an example of the kind of logical problem optimizers solve you can take a look at: <a href="https://github.com/google/souper" rel="nofollow">https://github.com/google/souper</a>.So I don't see how you can take the current neural nets and get them to design a more efficient CPU architecture or a better JIT.

评论 #14073794 未加载

评论 #14075693 未加载

andreasvcabout 8 years ago

Cute title but the post didn't really address either reasonableness or effectiveness, but mostly claimed that the potential has not yet been realized. It's a pet peeve of mine to see these hackneyed joke titles referencing famous papers, "considered harmful" is another case in point. Let's just stick to descriptive titles.

评论 #14072747 未加载

评论 #14072878 未加载

Animatsabout 8 years ago

The perceptron scheme for branch prediction (full paper) [1] probably works because it uses far more memory for branch history than the usual approaches. It's not doing a better job with comparable resources.[1] <a href="https://www.cs.utexas.edu/~lin/papers/hpca01.pdf" rel="nofollow">https://www.cs.utexas.edu/~lin/papers/hpca01.pdf</a>

saosebastiaoabout 8 years ago

Indirectly they have helped quite a bit. Some of the most advanced mathematical and symbolic solvers (MIP, IP, LP, CP, SAT, SMT) have slowly been incorporating machine learning to advance their capabilities. Their use cases in these solvers include: branch prediction, branch selection, constraint evaluation order, solver type selection, search strategy selection, cost estimations for column generation strategies, problem classification, solve time estimation, etc.And since advancements in our abilities at solving symbolic and mathematical problems have directly enabled the current research in PLT and formal systems, I see no reason to discount the impact ML has had in pushing that frontier.

colorincorrectabout 8 years ago

Perhaps this paper provides an explanation? <a href="https://arxiv.org/pdf/1608.08225.pdf" rel="nofollow">https://arxiv.org/pdf/1608.08225.pdf</a>"The exceptional simplicity of physics-based functions hinges on properties such as symmetry, locality, compositionality and polynomial log-probability, and we explore how these properties translate into exceptionally simple neural networks approximating both natural phenomena such as images and abstract representations thereof such as drawings."

评论 #14074951 未加载

评论 #14073294 未加载

mcguireabout 8 years ago

Branch predictions are an interesting use, although I'm wondering how expensive a misprediction really is.But this:"Another example is the use of regression techniques from machine learning to build models of program behavior. If I replace 64-bit arithmetic with 32-bit arithmetic in a program, how much does it change the output of the program and how much does it reduce energy consumption? For many programs, it is not feasible to build analytical models to answer these kinds of questions (among other things, the answers are usually dependent on the input values), but if you have a lot of training data, Xin Sui has shown that you can often use regression to build (non-linear) proxy functions to answer these kinds of questions fairly accurately."I'm not sure whether I am fascinated or horrified.

评论 #14073878 未加载

mfreedabout 8 years ago

Keith Winstein & collaborators have good work about using ML to train TCP's congestion control in different scenarios: <a href="http://web.mit.edu/remy/" rel="nofollow">http://web.mit.edu/remy/</a>

ganfortranabout 8 years ago

I guess computer is much more deterministic than what is required for ML to be useful.ML, in a very inaccurate way, can be seen as:1.We have observations and conclusions.2.We don't know exact those observations leads to the conclusions.3.The assumed procedure that leads the observations to conclusions is called model.4.With enough pairs of (observation, conclusion), we can train a good model that is good enough to make good decision on future observations.Problem for traditional computer science is that, the system is so deterministic that we know EXACTLY how it works on instruction level, while ML is good at dealing problem that is inherently probabilistic.

leecarraherabout 8 years ago

The latter part of the article seems to focus on deep methods not influencing lower level system architecture and design. Perhaps the reason is that those systems and problems are fundamentally different than the dynamics systems that NNs are finding so much success in. In short, compiler, programming and architecture are very formal exact systems, while driving, tts, stt, image association, etc are nowhere near as controlled of environments.

elvinyungabout 8 years ago

To contrast this opinion, a promising research project from CMU that uses an RNN to manage a database based on "forecasted" workloads (I know it's not quite architecture, but still): <a href="http://pelotondb.io/" rel="nofollow">http://pelotondb.io/</a>

austincheneyabout 8 years ago

Perhaps the biggest hurdle in this regard is the approach to machine learning. Nearly everything I have seen on machine learning is a primer on big data followed by a series of algorithms on making the best and smartest decision upon that mountain of data.This is completely the wrong approach. Machine learning can be done on a dime, provided the proper nurturing and environment, but you have to be willing to make some concessions.First and for most you have to be able to write a program that can make a decision. A simple "if" condition is sufficient.Secondly, that decision is open to modification by asserting the evaluation (the "if" condition) against its result. In this regard the logic is fluid opposed to a series of static conditions written by humans hoping to devise organic decisions.Finally, the decision is allowed to be completely wrong. Wrong decisions are better than either no decision or the same decision without deviation. This is how humans learn and it should be no surprise that computers would benefit from the same approach.The key to getting this right is bounds checking and simplicity. A decision must find a terminal point in which to stop improving upon its outcome, and a narrow set of boundaries must be affirmed to prevent unnecessary deviation. It is perfectly acceptable if some grand master must occasionally prod the infantile program in the right direction. This is also something that people do to other people who are learning.If you can do that you have machine learning. You don't need big data to get this. You certainly don't need complex transportation machines or voice activated software to validate it. AI on a dime. If you can do it on a dime you can certainly do it with a multi-billion dollar budget and thousands of developers.

评论 #14073679 未加载

csoursabout 8 years ago

OT: How and why is this page overriding my control key, and how can I stop it from doing that?I use ctrl+scroll wheel to zoom and it is very annoying when that behavior is overridden.

评论 #14077331 未加载

justicezyxabout 8 years ago

Since when the belief that "a silver bullet exists for computer science research" becomes a thing?