Level-Up Your Machine Learning

336 pointsby cjrdalmost 11 years ago

16 comments

xiaomaalmost 11 years ago

I think this is fantastic advice.As someone who has spent an embarrassing amount of time on various independent education, one of the key things I've taken from it is just how efficient text books are. Not only can you read more quickly than people can speak, but it's also active by nature. I've often found my attention wandering during videos, but it's just not possible to read without putting in a minimum amount of focus. It's also a lot easier to modulate your reading speed based on how easy material is for you than it is to do the same during a lecture video.Some general thoughts on MOOCs:Coursera and edX tend to be great for small, self-contained topics and the automated graders for programming assignments is great as well. The forums are also useful, though not ideal (since there are no practice questions students can get help with that don't fall under the honor code).Where modern MOOCs really fall down is prerequisites. It's surprisingly difficult to do something like structure an entire CS degree from Coursera classes. Though many classes are taught by famous CS professors, they are from different institutions that break material into courses in different ways. Worse still a lot of the classes are either watered-down or shortened or both.MIT's Open Courseware archives are actually a lot better for this. There are no certificates, and no credentials, but nearly all the material is freely available. The one biggest inefficiency though, is all the time spent in the lectures. At least they can be played back at a higher speed, but the lectures really do take a lot more time and cover less than the textbooks. For courses that have good textbooks, I think the best approach is to skip the lectures except in portions where you feel like you need more review.Finally Khan Academy is fantastic for answering specific, mechanical questions (e.g. how to calculate eigen values), but a bit light on material. I'd use it as a supplement for the other resources.

评论 #8062735 未加载

评论 #8062081 未加载

评论 #8063109 未加载

rfreyalmost 11 years ago

I love textbooks and spend more of my childrens' inheritance on them than I should.But what MOOCs give me is the exercises. I often think I understand a problem, but it's only after getting 2.1/10 on a Coursera quiz that I realize I've missed a key step or concept.Many textbooks have exercises but few have solutions. I've been working through Barto and Sutton's Reinforcement Learning for example (again), and although I do the exercises and programming questions, I never know if I've gotten it right. My experience with MOOCs shows I probably haven't in a large number of cases.The best of both worlds is when I can follow a MOOC with the textbook to gain more depth, for example with the PGM course on Coursera.

评论 #8062228 未加载

评论 #8062467 未加载

评论 #8062325 未加载

grayclhnalmost 11 years ago

Two free books that I haven't seen mentioned, that are from more of a stats perspective* James, Witten, Hastie, and Tibshirani's An Introduction to Statistical Learning, with Applications in R<a href="http://www-bcf.usc.edu/~gareth/ISL/" rel="nofollow">http://www-bcf.usc.edu/~gareth/ISL/</a>* Hastie, Tibshirani, and Freedman's Elements of statistical learning (more advanced)<a href="http://statweb.stanford.edu/~tibs/ElemStatLearn/" rel="nofollow">http://statweb.stanford.edu/~tibs/ElemStatLearn/</a>

scottlocklinalmost 11 years ago

PGM is a tough book. I'm not sure it's the right book for "level 3" unless you want to be a level-3 who is good at PGMs.The problem with ML is there are so many different kinds. Bishop's book is a decent light weight survey, but it doesn't come close to covering all the interesting fields. You could read that and Hastie/Tibshirani's book and still know almost nothing about online training (hugely important for "big data" and timeseries), reinforcement learning (mentioned, but not in any depth), agent learning, "compression" sequence predicting techniques, time series oriented techniques (recurrent ANNs for starters, but there is a ton to know here, and most interesting data is time ordered), image recognition tools, conformal prediction, speech recognition tools, ML in the presence of lots of noise, and unsupervised learning. I don't own PGM, but it probably wouldn't help much in these matters either. I know guys who are probably level 4 at machine learning who don't know about most of these subjects. On the other hand, Peter Flach's book "Machine Learning" at least mentions them and makes pointers to other resources."Deep learning" is becoming kind of a buzzword for a big basket of tricks. I think it's worth knowing about drop-out training, and the tricks used to do semi-supervised learning, but the buzzword is silly. Technically "deep learning" just means "improved gradient descent." I figure level-4 is anyone making progress coming up with new techniques.That said, reading good books is one way to make progress. Knowing the right people is the other way.

评论 #8071766 未加载

telalmost 11 years ago

PRML is great. I haven't read PGM, but I took a relatively intensive course on it which had great lecture notes. Which I'd like to also suggest—lecture notes are often "skeletal books" which can bring you up to speed on a topic quickly given that you (a) are willing to work a bit more and (b) can fill in the missing fleshy bits with your own experience.I'd also really like to suggest DGL (<a href="http://books.google.com/books/about/A_Probabilistic_Theory_of_Pattern_Recogn.html?id=5uCTngEACAAJ" rel="nofollow">http://books.google.com/books/about/A_Probabilistic_Theory_o...</a>) and Bickel and Doksum (<a href="http://www.amazon.com/Mathematical-Statistics-Basic-Selected-Topics/dp/0132306379" rel="nofollow">http://www.amazon.com/Mathematical-Statistics-Basic-Selected...</a>). These are two of my favorite core ML/stats books.

vkhucalmost 11 years ago

There are some (free) good books that haven't been mentioned yet:1) "Data Mining and Analysis: Fundamental Concepts and Algorithms" by Zaki and Meira <a href="http://www.cs.rpi.edu/~zaki/PaperDir/DMABOOK.pdf" rel="nofollow">http://www.cs.rpi.edu/~zaki/PaperDir/DMABOOK.pdf</a>This book covers many ML topics with concrete examples.2) "Computer Vision: Models, Learning, and Inference" by Simon Prince: <a href="http://web4.cs.ucl.ac.uk/staff/s.prince/book/book.pdf" rel="nofollow">http://web4.cs.ucl.ac.uk/staff/s.prince/book/book.pdf</a>Despite a CV book, the first half of it is like a statistics book that comes with examples in CV which are very easy to follow.

dimaturaalmost 11 years ago

I would also suggest K. Murphy's Machine Learning for the journeyman level. In the intermediate apprentice-journeyman level Alpaydin's Introduction to Machine Learning is very friendly.

评论 #8062466 未加载

kashifralmost 11 years ago

Form my own journey I would say that a good place to start for graphical models might be "Bayesian Reasoning and Machine Learning" by Barber. It's free (<a href="http://web4.cs.ucl.ac.uk/staff/D.Barber/pmwiki/pmwiki.php?n=Brml.Online" rel="nofollow">http://web4.cs.ucl.ac.uk/staff/D.Barber/pmwiki/pmwiki.php?n=...</a>). I haven't read through it, but I've heard good things. However, it doesn't cover some basic things like SVM, RVM, Neural Networks...For those I'd suggest "Pattern Recognition and Machine Learning" by Bishop. I've read throughout this and it's really well organized and thought out. For more mathematically advanced ML stuff I'd suggest "Foundations of Machine Learning" by Mohri. For a good reference for anything else I'd suggest "Machine Learning: A Probabilistic Perspective" by Murphy. For more depth on graphical models look at "Probabilistic Graphical Models: Principles and Techniques" by Koller.On the NLP front there's the standard texts "Speech and Language Processing" by Jurafsky and "Foundations of Statistical Natural Language Processing" by Manning.I also like "An Introduction to Statistical Learning" by James, Witten, Hastie and Tibshirani.

评论 #8066812 未加载

cipher0almost 11 years ago

Great recommendations, some people might also find this interesting as a general guideline to "Data science" <a href="http://nirvacana.com/thoughts/becoming-a-data-scientist/" rel="nofollow">http://nirvacana.com/thoughts/becoming-a-data-scientist/</a>[edit] scroll down and look at the map.

eli_gottliebalmost 11 years ago

What I'd really appreciate is ideas on how to learn or review the core math concepts. I haven't actually done any multivariable calculus, vector/matrix calculus, or linear algebra in years, even though I took them in undergrad.

joaomsaalmost 11 years ago

Wholeheartily agree with the author's sentiments on the value of textbooks. Not because of the medium itself, but because they're (usually) accompanied by well thoughtout examples and practice problems.When initially starting a dense subject such as PGM, having my hands held through the introductory material with incremental practice problems as the topic elaborated, helped me get a much more intimate grasp. Initially only reading superficially and watching lectures, I kept getting stumped trying to form a cohesive mental map of all the interleaved concepts.

GabrielF00almost 11 years ago

What are the HN community's thoughts on Learning from Data by Abu Mostafa, Magdon-Ismail and Lin (<a href="http://amlbook.com/" rel="nofollow">http://amlbook.com/</a>)? The lectures from their course are here: <a href="http://work.caltech.edu/lectures.html" rel="nofollow">http://work.caltech.edu/lectures.html</a>I haven't started it yet, but this book was recommended by some folks at my company.

评论 #8061831 未加载

lowglowalmost 11 years ago

I'm in the middle of PGM right now. It's actually really easy to follow if you put some time into it. I'm reading PRML next. I didn't realize there was a 'path' to learning ML though, thanks for that.We could use some more ML recs on <a href="https://books.techendo.com/" rel="nofollow">https://books.techendo.com/</a>

kp25almost 11 years ago

I would like to start my ML Journey in Python, then get to R.How about learning things in python? Good recommendations?

评论 #8062807 未加载

评论 #8062971 未加载

jpetersonalmost 11 years ago

For a really nice introductory book, try "Machine Learning" by Tom Mitchell.

orasisalmost 11 years ago

Textbooks? Really?How about start with a great lecturer like -Nando de Freitas - <a href="https://www.youtube.com/channel/UC0z_jCi0XWqI8awUuQRFnyw" rel="nofollow">https://www.youtube.com/channel/UC0z_jCi0XWqI8awUuQRFnyw</a>David Mackay -<a href="http://videolectures.net/course_information_theory_pattern_recognition/" rel="nofollow">http://videolectures.net/course_information_theory_pattern_r...</a>or the (sometimes too dense) Andrew Ng - <a href="https://www.coursera.org/course/ml" rel="nofollow">https://www.coursera.org/course/ml</a>

评论 #8061823 未加载

评论 #8062152 未加载

评论 #8062101 未加载