TechEcho

8 comments

Too many people focus on "properly" putting ML into production...I'd like to propose an alternative... Build a model (once) on your dev machine. Copy it to S3. Do CPU inference in some microservice. Get the production system to query your microservice, and if it doesn't reply in some (very short) timeout, fallback to whatever behaviour your company was using before ML came along.If the results of yor ML can be saved (eg. a per-customer score), save the output values for each customer and don't even run the ML realtime at all!Don't handle retraining the model. Don't bother with high reliability or failover. Don't page anyone if it breaks.By doing this, you get rid of 80% of the effort required to deploy an ML system, yet still get 80% of the gains. Sure, retraining the model hourly might be optimal, but for most businesses the gains simply don't pay for the complexity and ongoing maintenance.Insider knowledge says some very big companies deploy the above strategy very successfully...

评论 #24810915 未加载

评论 #24810466 未加载

评论 #24810539 未加载

simonebrunozziover 4 years ago

Overall, a well written article.If you're interested in ML Ops, I have a shameless plug to share: on November 19th I host a free online panel, "Rage Against the Machine Learning", with industry experts. [0][0]: <a href="https://cotacapital.zoom.us/webinar/register/8116020076218/WN_DIIptnvUQhi0AkSze_XhAw" rel="nofollow">https://cotacapital.zoom.us/webinar/register/8116020076218/W...</a>

评论 #24807313 未加载

评论 #24808386 未加载

gerblerover 4 years ago

There's a great paper from Google about this "Machine Learning: The High Interest Credit Card of Technical Debt" [0] which discusses why you should use a framework to deploy ML models (the authors are involved in developing TFX).In my experience, spending time explaining results to the business is also a very time consuming element of deploying a model too.0:<a href="https://research.google/pubs/pub43146/" rel="nofollow">https://research.google/pubs/pub43146/</a>

calebkaiserover 4 years ago

I was expecting this to be more about running inference in production, though the information in the article itself was interesting on its own.There does seem to be a dearth of writing on the actual topic of deploying models as prediction APIs, however. I work on an open source ML deployment platform ( <a href="https://github.com/cortexlabs/cortex" rel="nofollow">https://github.com/cortexlabs/cortex</a> ) and the problems we spend the most time on/teams struggle with the most don't seem to be written about very often, at least in depth (e.g. How do you optimize inference costs? When should you use batch vs realtime? How do you integrate retraining, validation, and deployment into a CI/CD pipeline for your ML service?).Not taking anyway from the article of course, it is well written and interesting imo.

评论 #24808100 未加载

dtjohnnybover 4 years ago

I've recently come across the MLOps community here <a href="https://mlops.community/" rel="nofollow">https://mlops.community/</a>.The meetups are all on YouTube and have great topics like putting models into production, but also more interesting ones (to me) like ml observability and feature stores.Their slack channel is great too, learned a lot about the reality of using kubeflow vs the medium article hype

steve_gover 4 years ago

As a practical detail, I'm wondering if it always makes sense to wrap your predictor in a simple if-then based predictor. If your learned model makes bad predictions in certain specific cases, you can "cheat" with Boolean logic. This could also be useful when the business has a special case that doesn't follow the main patterns.Any thoughts on that?

评论 #24810280 未加载

评论 #24809682 未加载

fphhotchipsover 4 years ago

The title doesn't really match the article in my mind. To me, it talks about everything but actually deploying a machine learning model in production. In particular, there are a lot of words around where training data is stored. In my experience, the training data is really more part of the the development process than the actual productionisation of the model.That said, there is a piece here on TFX, which is valuable in this context. I also think the advice about going with proprietary tools that speed up the process is good. Tools like Microsoft's AI tooling, Dataiku and H20 are good in that context.I would have liked to have seen some discussion around when you should deploy a model as an API vs generating batch predictions and storing them - I've done both on a test bench, but I don't really know how well the API scales.

评论 #24807370 未加载

sandGorgonover 4 years ago

is anyone running TFX in their companies in production ? how has the experience been ?since like everyone is on K8s, im wondering if kubeflow is not the more natural fit

评论 #24809762 未加载

8 comments

londons_exploreover 4 years ago

评论 #24810915 未加载

评论 #24810466 未加载

评论 #24810539 未加载

simonebrunozziover 4 years ago

评论 #24807313 未加载

评论 #24808386 未加载

gerblerover 4 years ago

calebkaiserover 4 years ago

评论 #24808100 未加载

dtjohnnybover 4 years ago

steve_gover 4 years ago

评论 #24810280 未加载

评论 #24809682 未加载

fphhotchipsover 4 years ago

评论 #24807370 未加载

sandGorgonover 4 years ago

is anyone running TFX in their companies in production ? how has the experience been ?since like everyone is on K8s, im wondering if kubeflow is not the more natural fit

评论 #24809762 未加载

How to put machine learning models into production

8 comments

How to put machine learning models into production

8 comments