TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: What is currently the best way to host AI models?

1 点作者 rashidujang大约 1 年前
AI infrastructure moves really quickly and best practices are constantly evolving so I was wondering what is HN's opinion on the best way to host custom AI models for inference in Q1 2024?

1 comment

PaulHoule大约 1 年前
A hard question to ask because we don't know your use case, what kind of models you are using, if privacy is a consideration, all of that. I mean it is one thing to do super low power inference for wake words or something like that on the edge, another to do it on a customer's phone or PC, and another to have a huge model that runs on a cluster.