TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Vector Databases: Feature or Product?

12 点作者 AnhTho_FR3 个月前

3 条评论

marcusae3133 个月前
This is the same conversation we have when we add any specialized data store. Shortly after MongoDB, Cassandra, Couchbase, Elasticsearch, and Solr, critics asked couldn’t we just contort RDBMS to handle documents for flexible queries or searches. The history tells a different story. The scaling properties and feature expectations of the specialized stores culminating in tens of billions of market cap, and much more in terms of infrastructure spend.<p>Could the incumbents simply tack on vector features? Sure, that’s the JSONB story of Postgres. It’s the regex story of all RDBMS offerings after Oracle’s acquisition of Endeca suggested a real commercial opportunity for search-specific databases.<p>Vector-first storage engines have a place in the market as much as the tack on solutions do. PGVector will be good enough for most users, and Weaviate (or Milvus) will be better suited for the most ambitious, or those seeking the best dev ex.
评论 #43175876 未加载
hazalmestci3 个月前
There’s definitely some hype in any new data infrastructure trend (see: graph databases, time-series databases). But the problem vector DBs solve—retrieving context efficiently for LLMs—seems real enough. Maybe the question is whether LLM-native applications will be big enough to sustain a separate category. Is the industry moving toward general-purpose DBs incorporating vector search? or specialized vector DBs will still have a place in some time?
mjasay3 个月前
I know I&#x27;m biased on this, but it has always seemed obvious that vector search would be subsumed into other databases. At MongoDB we&#x27;ve made it easy to manage operational and vector data in the same place, simplifying data architecture. While I think we do this better than others, it&#x27;s also true that other vendors and communities (like Postgres with pgvector) have added vector capabilities and, frankly, always were going to do so. It&#x27;s just a natural extension. I don&#x27;t want to be dismissive of purpose-built vector databases, but they&#x27;re going to have to evolve to suit more general-purpose workloads. This could happen as Neo4j has done, e.g., making graphs a more general way of thinking about data. It will be interesting to see how it plays out.