TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Mockingbird is an LLM that outperforms GPT4 on RAG

9 点作者 eskibars10 个月前
Hi HN!<p>I lead product at Vectara and we&#x27;ve just released a new LLM in our platform that outperforms GPT4 and Gemini 1.5 Pro on RAG tasks.<p>Vectara is a Retrieval Augmented Generation (RAG) platform primarily deployed as a SaaS service which includes a generous free tier so you can try it for free.<p>The way we&#x27;ve been able to offer a &quot;better but cheaper&quot; is that we focus a lot of our attention on taking smaller models (which can be hosted in a cost efficient way) and fine tuning them to specific tasks: in this case RAG. This ends up with a model that is <i>less</i> capable of arbitrary tasks like crafting creative stories, but for many enterprises we&#x27;ve learned they don&#x27;t see the &quot;creativity&quot; of LLMs as a positive, as they also result in hallucinations.<p>Would love feedback!

1 comment

llm-apprentice10 个月前
I think we all recognize that specialized models are the future. Great to see Vectara already on that path for RAG (enterprises)!