TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Does your business prefer token-based pricing?

1 点作者 eskibars11 个月前
Currently, my product offers retrieval augmented generation as a service. Anyone doing LLM-based apps has probably noticed that the usage costs are typically on a per-token basis, which seems very &quot;fair&quot; in that each token takes some time and more tokens = more time = higher underlying costs.<p>However, discussions I&#x27;ve had with some CIOs is that this yields a very unpredictable billing cycle and that they have a hard time mapping tokens to business outcomes. This was why we originally priced by request + storage, and have modeled average token consumption by our users. We end up with larger margins for smaller requests&#x2F;responses and smaller margins for larger requests&#x2F;responses, which is obviously less fair but more predictable.<p>Curious about how you have felt on being charged per token? Are we making the right call by making things more predictable or is it better to be less predictable but more directly reflect the underlying costs?

暂无评论

暂无评论