TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: How can I monetize a load balancer for ML applications?

2 点作者 stormbeard8 个月前
I have a solution that solves a set of problems that keep showing up in ML workloads. The kind of systems I&#x27;m talking about are ones where:<p>- You have a GPU attached to each instance.<p>- Each request takes anywhere from 10ms to 2min.<p>- There&#x27;s a hard limit on the number of in-flight requests&#x2F;queries (I assume because of the GPUs).<p>Normally, I see people fronting the instances with software load balancers, but this doesn&#x27;t work very well for reasons. Assuming I have a solution in the form of a fancy load balancer, how would I go about monetizing it? Let&#x27;s assume the solution is non-trivial to create, but very straightforward to use (essentially a drop-in replacement).<p>I ask because I don&#x27;t think I can just &quot;sell a fancy load balancer&quot; like it&#x27;s the late 90s or something. Modern companies appear to always have more complicated products and I just want to sell a straightforward piece of infrastructure that solves a fairly hard problem.<p>Thanks in advance.

4 条评论

sargstuff8 个月前
Is there accessible documentation which covers installation &amp; non-functional requirements (aka hardware&#x2F;software requirements &amp; how to setup&#x2F;use the solution)
评论 #41525123 未加载
sargstuff8 个月前
what was done to access &quot;doesn&#x27;t work very well for reasons&quot;? aka monitored systems in questions and saw ...... ?????<p>What were the &quot;reasons&quot; for &quot;doesn&#x27;t work very well? aka trying to do goolgle search type work on 2mb intel 486 oover a 2mb network and expecting to be able to compete with google is never going to work out.<p>What type of load balancing? Load balancing typically has to be tuned&#x2F;adjusted based on end usage requirements&#x2F;production environment (not just per factory setting)
评论 #41525042 未加载
talldayo8 个月前
&gt; but this doesn&#x27;t work very well for reasons.<p>Which reasons? In my experience&#x2F;exposure, people are perfectly happy with Proxmox on a big GPU-laden boxen.
评论 #41525095 未加载
JSDevOps8 个月前
Why doesn’t software load balancers work? Also surely you just implement queuing?