TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

LLM Stack for 2024 – Initial Survey

3 pointsby yujianabout 1 year ago

3 comments

stevekaramabout 1 year ago
I think one of the biggest struggles small startups and practitioners are facing is lack of a good option between &quot;I wonder if this works&quot; and &quot;ready for prime time.&quot; Running locally is an option with consumer hardware but is cost prohibitive for a team. Cloud providers are full of complications and hidden costs. Tools like Friendli and Bento are good but ambiguous on costs and get difficult to price end-to-end once you need the full stack of options. Hugging Face inference endpoints and other tools still seem like the best option around along with cloud DBs like Zilliz.<p>That said, it&#x27;s no wonder people just pay extra for the simplicity of a slightly smarter endpoint like OpenAI. Sure, over time the costs are insane and you lack any flexibility to create a truly targeted solution, but it <i>feels</i> like an all-in-one easy fix.
yujianabout 1 year ago
Hi everyone, I put together this survey of tools for the LLM Stack in 2024. I&#x27;ve linked the friend-link for the Medium article in the URL. I&#x27;d love feedback from you guys about any tools I&#x27;ve missed.<p>If you&#x27;re a Medium member and want to support my writing, feel free to use the regular link - <a href="https:&#x2F;&#x2F;medium.com&#x2F;plain-simple-software&#x2F;the-llm-app-stack-2024-eac28b9dc1e7" rel="nofollow">https:&#x2F;&#x2F;medium.com&#x2F;plain-simple-software&#x2F;the-llm-app-stack-2...</a>
cybereporterabout 1 year ago
This is great! Out of curiosity, what&#x27;s the difference between choosing a dedicated vector database vs. a traditional database with vector indices (e.g. pgvector with postgres?
评论 #39432798 未加载