TechEcho

9 comments

minimaxiralmost 9 years ago

Wait, Spark has built-in Model Hyperparameter selection (<a href="http://spark.apache.org/docs/latest/ml-tuning.html" rel="nofollow">http://spark.apache.org/docs/latest/ml-tuning.html</a>), that was not mentioned in the article. What advantages does your service do?Relatedly, why are you advocating using MLLib/RDDs when they have been deprecated in favor of ML/DataFrames (<a href="http://spark.apache.org/docs/latest/ml-guide.html" rel="nofollow">http://spark.apache.org/docs/latest/ml-guide.html</a>)?

评论 #12258708 未加载

评论 #12257779 未加载

评论 #12258394 未加载

apathyalmost 9 years ago

I could give a shit about the hyperparameter tuning (CV... it Works For Me) but your writeup of Gaussian processes and why they are called kriging in spatial stats is awesome.<a href="http://blog.sigopt.com/post/130275376068/sigopt-fundamentals-intuition-behind-gaussian" rel="nofollow">http://blog.sigopt.com/post/130275376068/sigopt-fundamentals...</a>

评论 #12264238 未加载

a1k0nalmost 9 years ago

So SigOpt was tuning rank (number of latent factors), number of iterations to run the algorithm (in my experience alternating least squares generally converges within 10-20 iterations, but there'd be no downside to running it longer unless it's overfitting), and the regularization strength.What optimal parameters did it find for these?

评论 #12258099 未加载

Zephyr314almost 9 years ago

I'm one of the co-founders of SigOpt (YC W15) and am happy to answer any questions about this post (or anything about SigOpt).More info on the methods behind SigOpt can be found at <a href="https://sigopt.com/research" rel="nofollow">https://sigopt.com/research</a>.

apathyalmost 9 years ago

Oh, also, for students: <a href="https://sigopt.com/edu" rel="nofollow">https://sigopt.com/edu</a>I'm worried this is going to be like good Scotch for me.

blahialmost 9 years ago

There is package, mlrMBO, created by the great guys who created mlr (absolutely awesome for building pipelines, you will ditch caret in a second!). Not on Spark obviously, but thought some might find it useful.<a href="https://github.com/mlr-org/mlrMBO" rel="nofollow">https://github.com/mlr-org/mlrMBO</a>

tachimalmost 9 years ago

How does SigOpt compare to GPs?

评论 #12258343 未加载

visargaalmost 9 years ago

<a href="https://sigopt.com/pricing" rel="nofollow">https://sigopt.com/pricing</a>- Individual: $1,000/month- Enterprise: Custom pricingI am not a multi-million $ company, so I guess it's useless for me.

评论 #12263106 未加载

idewanckalmost 9 years ago

Post author here, happy to answer any questions as well.

9 comments

minimaxiralmost 9 years ago

评论 #12258708 未加载

评论 #12257779 未加载

评论 #12258394 未加载

apathyalmost 9 years ago

评论 #12264238 未加载

a1k0nalmost 9 years ago

评论 #12258099 未加载

Zephyr314almost 9 years ago

apathyalmost 9 years ago

Oh, also, for students: <a href="https://sigopt.com/edu" rel="nofollow">https://sigopt.com/edu</a>I'm worried this is going to be like good Scotch for me.

blahialmost 9 years ago

tachimalmost 9 years ago

How does SigOpt compare to GPs?

评论 #12258343 未加载

visargaalmost 9 years ago

评论 #12263106 未加载

idewanckalmost 9 years ago

Post author here, happy to answer any questions as well.

Bayesian Optimization for Collaborative Filtering with MLlib

9 comments

Bayesian Optimization for Collaborative Filtering with MLlib

9 comments