vector-databases

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Vector Databases

向量数据库

Store and search embeddings for RAG, semantic search, and similarity applications.

为RAG、语义搜索和相似性应用存储并搜索嵌入向量。

Comparison

对比

Database	Best For	Filtering	Scale	Managed Option
Chroma	Local dev, prototyping	Yes	< 1M	No
FAISS	Max speed, GPU, batch	No	Billions	No
Qdrant	Production, hybrid search	Yes	Millions	Yes
Pinecone	Fully managed	Yes	Billions	Yes (only)
Weaviate	Hybrid search, GraphQL	Yes	Millions	Yes

数据库	最佳适用场景	支持过滤	规模	托管选项
Chroma	本地开发、原型开发	是	< 100万	无
FAISS	极致速度、GPU、批量处理	否	数十亿	无
Qdrant	生产环境、混合搜索	是	数百万	有
Pinecone	全托管	是	数十亿	有（仅托管）
Weaviate	混合搜索、GraphQL	是	数百万	有

Chroma

Embedded vector database for prototyping. No server needed.

Strengths: Zero-config, auto-embedding, metadata filtering, persistent storage Limitations: Not for production scale, single-node only

Key concept: Collections hold documents + embeddings + metadata. Auto-embeds text if no vectors provided.

用于原型开发的嵌入式向量数据库，无需服务器。

优势：零配置、自动生成嵌入向量、元数据过滤、持久化存储 局限性：不适合生产级规模，仅支持单节点

核心概念：集合（Collections）存储文档+嵌入向量+元数据。若未提供向量，会自动为文本生成嵌入向量。

FAISS (Facebook AI)

FAISS（Facebook AI）

Pure vector similarity - no metadata, no filtering, maximum speed.

Index types:

Flat: Exact search, small datasets (< 10K)
IVF: Inverted file, medium datasets (10K - 1M)
HNSW: Graph-based, good recall/speed tradeoff
PQ: Product quantization, memory efficient for billions

Strengths: Fastest, GPU support, scales to billions Limitations: No filtering, no metadata, vectors only

Key concept: Choose index based on dataset size. Trade accuracy for speed with approximate search.

纯向量相似性搜索 - 不支持元数据、不支持过滤，速度最快。

索引类型：

Flat：精确搜索，适用于小型数据集（<1万）
IVF：倒排文件，适用于中型数据集（1万 - 100万）
HNSW：基于图结构，召回率与速度的平衡最优
PQ：乘积量化，针对数十亿级向量的内存优化

优势：速度最快、支持GPU、可扩展至数十亿级向量 局限性：不支持过滤、无元数据，仅支持向量

核心概念：根据数据集大小选择索引类型。通过近似搜索在准确率与速度间做权衡。

Qdrant

Production-ready with rich filtering and hybrid search.

Strengths: Payload filtering, horizontal scaling, cloud option, gRPC API Limitations: More complex setup than Chroma

Key concept: "Payloads" are metadata attached to vectors. Filter during search, not after.

适用于生产环境，支持丰富的过滤和混合搜索。

优势：负载过滤、水平扩展、云服务选项、gRPC API 局限性：设置比Chroma更复杂

核心概念：“负载（Payloads）”是附加在向量上的元数据。在搜索过程中进行过滤，而非事后过滤。

Index Algorithm Concepts

索引算法概念

Algorithm	How It Works	Trade-off
Flat	Compare to every vector	Perfect recall, slow
IVF	Cluster vectors, search nearby clusters	Good recall, fast
HNSW	Graph of neighbors	Best recall/speed ratio
PQ	Compress vectors	Memory efficient, lower recall

算法	工作原理	权衡点
Flat	与每个向量逐一比较	召回率100%，速度慢
IVF	将向量聚类，仅搜索邻近簇	召回率良好，速度快
HNSW	基于邻居节点构建图结构	召回率与速度的平衡最佳
PQ	压缩向量	内存高效，召回率较低

Decision Guide

决策指南

Requirement	Recommendation
Quick prototype	Chroma
Metadata filtering	Chroma, Qdrant, Pinecone
Billions of vectors	FAISS
GPU acceleration	FAISS
Production deployment	Qdrant or Pinecone
Fully managed	Pinecone
On-premise control	Qdrant, Chroma

需求	推荐方案
快速原型开发	Chroma
元数据过滤	Chroma、Qdrant、Pinecone
数十亿级向量	FAISS
GPU加速	FAISS
生产环境部署	Qdrant 或 Pinecone
全托管	Pinecone
本地部署控制	Qdrant、Chroma

Resources

资源

Chroma: https://docs.trychroma.com
FAISS: https://github.com/facebookresearch/faiss
Qdrant: https://qdrant.tech/documentation/
Pinecone: https://docs.pinecone.io

Chroma: https://docs.trychroma.com
FAISS: https://github.com/facebookresearch/faiss
Qdrant: https://qdrant.tech/documentation/
Pinecone: https://docs.pinecone.io