TechEcho

5 comments

This title was a little misleading to me IMO because (maybe my skill issue) I associated "inferencing" with "generation".After reading the article, it seems Pinecone just now supports in-DB vectorization, a feature that is shared by:- DataStax Astra DB: <a href="https://www.datastax.com/blog/simplifying-vector-embedding-generation-with-astra-vectorize" rel="nofollow">https://www.datastax.com/blog/simplifying-vector-embedding-g...</a> (since May 2024)- Weaviate: <a href="https://weaviate.io/blog/introducing-weaviate-embeddings" rel="nofollow">https://weaviate.io/blog/introducing-weaviate-embeddings</a> (as of yesterday)

评论 #42316005 未加载

评论 #42315863 未加载

bobismyuncle6 months ago

This post has some more technical info: <a href="https://www.pinecone.io/blog/integrated-inference/" rel="nofollow">https://www.pinecone.io/blog/integrated-inference/</a>Makes a lot of sense to me to combine embedding, retrieval and reranking — I can imagine this being a way that they can differentiate themselves from the popular databases that have added support for vector search

kingkongjaffa6 months ago

Can someone please explain how this works?I assumed that a specific flavour of LLM was needed, an “embedding model” to generate the vectors. Is this announcement that pinecone is adding their own?Is it better or worse than the models here: <a href="https://ollama.com/search?c=embedding">https://ollama.com/search?c=embedding</a> For example?

评论 #42317220 未加载

评论 #42316454 未加载

tech2trees6 months ago

Nothing new, Marqo has been doing this for a while now with their all in one platform to train, embed, retrieve, and evaluate.I've played around with Weaviate & Astra DB but Marqo is the best and easiest solution imo.

dmezzetti6 months ago

txtai (<a href="https://github.com/neuml/txtai">https://github.com/neuml/txtai</a>) has had inline vectorization since 2020. It supports Transformers, llama.cpp and LLM API services. It also has inline integration with LLM models and a built-in RAG pipeline.

Pinecone integrates AI inferencing with vector database

5 comments

Pinecone integrates AI inferencing with vector database

5 comments