TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

LoRAX: Open-Source Serving for 100s of Fine-Tuned LLMs in Production

8 pointsby magdyksover 1 year ago

2 comments

magdyksover 1 year ago
A great framework for serving many fine-tuned llms in production by quickly swapping adapters for the same base model (eg. Llama-2-70b)
abhaymover 1 year ago
Whoa this looks pretty cool. One question though: is there increased latency when you have multiple adapters on a single base model?
评论 #38300477 未加载