TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Retrieval powered by object storage: AMA

15 点作者 jeffchuber7 个月前
Hey everyone - I&#x27;m Jeff from the Chroma team. Chroma is the most popular open-source vector database.<p>Chroma now has a fully-distributed and serverless version backed by object-storage (also Apache 2.0).<p>More on why and how we built this here:<p><a href="https:&#x2F;&#x2F;www.trychroma.com&#x2F;engineering&#x2F;serverless" rel="nofollow">https:&#x2F;&#x2F;www.trychroma.com&#x2F;engineering&#x2F;serverless</a><p>AMA about AI, retrieval, systems, Rust, and more.

4 条评论

waawaawaa7 个月前
If I wanted to build a rag around a generative transformer, how does the retrieved vector plug into the transformer? Is it cross attention from an encoder or a prefix on the decoder, or sonething else? Does it plug in directly as the retrieved vector or is it somehow re-embedded?
SunlitCat7 个月前
Okay! I have a question, more like an observation! (Sorry I am maybe not your target audience, but to get things started)<p>Many buzzwords used and after reading your project page, I requested the help of ChatGPT to make any sense out of it!<p>Is what Chatgpt tells me about your interesting project right?<p>This is what it has to say:<p>Chroma&#x27;s serverless system is designed to help developers build AI applications without managing servers. It allows you to efficiently store and retrieve data using vectors (for AI models) and text queries, all without needing to handle infrastructure. The system is scalable, cost-effective, and optimized for AI workloads, so developers can focus on their projects rather than backend management.
评论 #41804105 未加载
johntash7 个月前
It seems like a really cool idea.<p>Is this (or will it be) part of the oss version of chroma? And would it work with the embedded version of chroma? From the article, it looks like it might require more server-side components now vs chromadb today
评论 #41805419 未加载
swyx7 个月前
what do you think about ColBERT? doing embedding on every token feels wrong in ways i can&#x27;t articulate - is there a time to use it and not use it, or is it the one retrieval technique to rule them all?
评论 #41804096 未加载