TechEcho

10 comments

Back in around 2008, SVMs were all the rage in computer vision. We would use hand designed visual features and then a linear SVM on top. That was how object detectors were built (remember DPM?)Funny how SVMs are just max-margin loss functions and we just took for granted that you needed domain expertise to craft features like HOG/SIFT by hand.By 2018, we use ConvNets to learn BOTH the features and the classifier. In fact, it’s hard to separate where the features end and the classifier begins (in a modern CNN).

评论 #18795572 未加载

评论 #18794977 未加载

评论 #18795232 未加载

评论 #18795586 未加载

评论 #18794828 未加载

评论 #18795778 未加载

评论 #18795378 未加载

abhghover 6 years ago

If you need closer to a ELI5 version I recommend this - [1].Disclaimer: written by me.[1] <a href="https://blog.statsbot.co/support-vector-machines-tutorial-c1618e635e93" rel="nofollow">https://blog.statsbot.co/support-vector-machines-tutorial-c1...</a>

评论 #18797293 未加载

评论 #18796833 未加载

cultusover 6 years ago

I notice this doesn't mention hinge loss, which is by far the simpler way of arriving at the SVM. Hinge loss is just max(0, 1- t*y), where y is the output of the linear model and t = +-1 is the label. Thus, it takes the common-sense approach of not penalizing losses that are far enough away from the decision boundary, and penalizing linearly after that.An SVM is literally just a linear model with hinge loss instead of log loss (logistic regression) or squared loss (ordinary linear regression) in primal form. For apparently historical reasons, it is usually derived from the "hard-margin" SVM in dual form, motivating with trying to maximize the margin. This is complicated and not very intuitive.This also causes people to conflate the kernel trick and the dual form, while in fact they have nothing to do with each other. You can use the kernel trick in primal svm just fine.Stochastic gradient descent can also be used for primal methods, while it doesn't work in the dual. That makes it much faster for large problems than the dual.

评论 #18796161 未加载

评论 #18799017 未加载

cultusover 6 years ago

There's been some work on variational Bayesian formulations of SVMs in the last few years. These can give actual uncertainty estimates and do automatic hyperparameter tuning. This one in particular is very cool:<a href="https://arxiv.org/pdf/1707.05532.pdf" rel="nofollow">https://arxiv.org/pdf/1707.05532.pdf</a>

usgroupover 6 years ago

Clearly idiots are not what they use to be in my day ...

评论 #18795440 未加载

评论 #18799543 未加载

rusbusover 6 years ago

It's interesting how quickly support vector machines went from the hot new thing to classify images to an afterthought after deep learning started having great results.

评论 #18794722 未加载

评论 #18798992 未加载

评论 #18795491 未加载

simonwover 6 years ago

Bullet point on page 2: "Optimal hyperplane for linearly separable patterns"I think the author may be working from a very different definition of the word "idiot".

评论 #18796251 未加载

iamwilover 6 years ago

I have a question!In the pdf, it said that the optimization problem in SVMs have a nice property in that it was quadratic, which means that there's a nice global minimum to go towards, and not lots of local minimum like in NN. That means, it seems SVMs won't get stuck at a suboptimal solution.Is that not a problem in DNNs now? Or is it that it's such high dimensionality that local minima don't stop the optimizer, because there's always another way around the local minimum?

评论 #18797055 未加载

评论 #18799242 未加载

评论 #18797084 未加载

评论 #18797023 未加载

评论 #18797546 未加载

评论 #18798563 未加载

strikelaserclawover 6 years ago

implementing a svm was my senior project in college. Brings back nightmares.

mistrial9over 6 years ago

compare to Tzotsos 2006 "A SUPPORT VECTOR MACHINE APPROACH FOR OBJECT BASED IMAGE ANALYSIS"

10 comments

quantomboneover 6 years ago

评论 #18795572 未加载

评论 #18794977 未加载

评论 #18795232 未加载

评论 #18795586 未加载

评论 #18794828 未加载

评论 #18795778 未加载

评论 #18795378 未加载

abhghover 6 years ago

评论 #18797293 未加载

评论 #18796833 未加载

cultusover 6 years ago

评论 #18796161 未加载

评论 #18799017 未加载

cultusover 6 years ago

usgroupover 6 years ago

Clearly idiots are not what they use to be in my day ...

评论 #18795440 未加载

评论 #18799543 未加载

rusbusover 6 years ago

It's interesting how quickly support vector machines went from the hot new thing to classify images to an afterthought after deep learning started having great results.

评论 #18794722 未加载

评论 #18798992 未加载

评论 #18795491 未加载

simonwover 6 years ago

Bullet point on page 2: "Optimal hyperplane for linearly separable patterns"I think the author may be working from a very different definition of the word "idiot".

评论 #18796251 未加载

iamwilover 6 years ago

评论 #18797055 未加载

评论 #18799242 未加载

评论 #18797084 未加载

评论 #18797023 未加载

评论 #18797546 未加载

评论 #18798563 未加载

strikelaserclawover 6 years ago

implementing a svm was my senior project in college. Brings back nightmares.

mistrial9over 6 years ago

compare to Tzotsos 2006 "A SUPPORT VECTOR MACHINE APPROACH FOR OBJECT BASED IMAGE ANALYSIS"

An Idiot’s guide to Support vector machines (2003) [pdf]

10 comments

An Idiot’s guide to Support vector machines (2003) [pdf]

10 comments