TechEcho

1 comment

jpmcbabout 1 year ago

Original author here: thanks for posting.<p>I'm glad this is making the rounds since I haven't seen alot on the "AI-DevOps" or infrastructure side of actually running an at-scale AI service. Many of the AI inference engines that offer an OpenAI compatible API (like vLLM, llama.cpp, etc.) make it very approachable and cost effective. Today, this vLLM AI service handles all of our batching micro-services which scrape for content to generate text on over 40,000+ repos on GitHub.<p>I'm happy to answer any / all questions anyone might have!

评论 #40362322 未加载

We Saved 10s of Thousands of Dollars Deploying Low Cost Open Source AI

1 comment

We Saved 10s of Thousands of Dollars Deploying Low Cost Open Source AI

1 comment