TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Cheap hosting for ML API with intermittent usage

1 点作者 mtoohig超过 4 年前
Does a VPS&#x2F;cloud provider exist that charges by the minute or has flexible RAM&#x2F;CPU needs? Currently, I have a $5&#x2F;month VM on Linode for my app but now I want to add some ML features; however, running the ML API on the same VM as my app causes oom errors and kills the process.<p>Of course I could purchase a larger VM at a greater fixed cost per month but since this API will only be called a few hundred times per month there must be an option for per minute or per call pricing while the API mostly sits idly waiting for a request.<p>I am aware of serverless but the time to load the ML models for each call seems like it would take way too long to get a response, unless I have a misunderstanding about serverless then do please inform me.<p>And if it matters for any of the answers I&#x27;m using FastAPI and Celery for the web side and task queue then I have Yolov3 to detect objects of interest from an image then pass the object image to another model for OCR and make a prediction of the text it finds. I&#x27;m new to ML, so I&#x27;ve got a lot to learn and appreciate all the feedback.

1 comment

mindhash超过 4 年前
Check out algorithmia - if that works for you.<p>Instead of looking for ML specific services, look at docker or container options. Its better if they have instance warm ups that way you can launch the vm when necessary.<p>Also see if you can split your solution into 2 parts. One that gathers intelligence in an offline batch mode. This data should help the second part to respond faster. This would work with some Algos only. Not so much for Deep learning<p>Explore deep learning compression techniques to reduce size of the model.<p>There ain&#x27;t many services for one off models. It doesn&#x27;t make economical sense for cloud providers.
评论 #25154771 未加载