TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Cheapest GPU provider to host fine-tuned models?

1 pointsby DidISayTooMuchover 1 year ago
Who provides cheapest GPU inferencing and hosting of fine-tuned models (7B size)? I already have the finetuned model ready, just looking for a cheap place to host and run inferencing.<p>I&#x27;ve looked at Replicate and Together.ai, they both provide really the best tools in this space, but hosting is expensive. Together costs about 1.4&#x2F;hr to host a 7B model. Replicate is more expensive.<p>Ideally, I wouldn&#x27;t be charged for idle time and only active time (replicate does this already, but your finetuned model needs to be based off of a limited set of base models)<p>Any recommendations?

1 comment

sjkoelleover 1 year ago
Following - we host our own models for a variety of architectures in vocal synthesis, and have tried using Replicate and Mystic as well.<p>Roll your own k8s? Predibase?
评论 #38658368 未加载