TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Will it be a viable business model to offer llama.cpp as a service?

2 点作者 nancyp超过 1 年前
Wrapped in a nicer UI for b2b SaaS model.

4 条评论

version_five超过 1 年前
Short answer, possibly, especially if it was say part of a rag system or some other architecture like that. There is room for more. Nothing particularly special about llama.cpp as the llm back end though. It&#x27;s optimized for running on lower-end hardware which matters less if you&#x27;re serving models as a service. But it has many strengths.<p>Llama.cpp &#x2F; ggml is the open core of ggml.ai founded by GG and funded by the guy from github, so they have some monetization plan for it.
fbnbr超过 1 年前
Why would you need it as cpp? Do you mean just llama or specifically a version small enough running on edge?
throw03172019超过 1 年前
Do you mean just a non-fine tuned chat bot based on llama 70b? Probably not.
quickthrower2超过 1 年前
I suspect no.