alicloud-ai-search-milvus
Original:🇺🇸 English
Translated
1 scriptsChecked / no sensitive code detected
Use AliCloud Milvus (serverless) with PyMilvus to create collections, insert vectors, and run filtered similarity search. Optimized for Claude Code/Codex vector retrieval flows.
4installs
Sourcecinience/alicloud-skills
Added on
NPX Install
npx skill4agent add cinience/alicloud-skills alicloud-ai-search-milvusTags
Translated version includes tags in frontmatterSKILL.md Content
View Translation Comparison →Category: provider
AliCloud Milvus (Serverless) via PyMilvus
This skill uses standard PyMilvus APIs to connect to AliCloud Milvus and run vector search.
Prerequisites
- Install SDK (recommended in a venv to avoid PEP 668 limits):
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install --upgrade pymilvus- Provide connection via environment variables:
- (e.g.
MILVUS_URI)http://<host>:19530 - (
MILVUS_TOKEN)<username>:<password> - (default:
MILVUS_DB)default
Quickstart (Python)
python
import os
from pymilvus import MilvusClient
client = MilvusClient(
uri=os.getenv("MILVUS_URI"),
token=os.getenv("MILVUS_TOKEN"),
db_name=os.getenv("MILVUS_DB", "default"),
)
# 1) Create a collection
client.create_collection(
collection_name="docs",
dimension=768,
)
# 2) Insert data
items = [
{"id": 1, "vector": [0.01] * 768, "source": "kb", "chunk": 0},
{"id": 2, "vector": [0.02] * 768, "source": "kb", "chunk": 1},
]
client.insert(collection_name="docs", data=items)
# 3) Search
query_vectors = [[0.01] * 768]
res = client.search(
collection_name="docs",
data=query_vectors,
limit=5,
filter='source == "kb" and chunk >= 0',
output_fields=["source", "chunk"],
)
print(res)Script quickstart
bash
python skills/ai/search/alicloud-ai-search-milvus/scripts/quickstart.pyEnvironment variables:
MILVUS_URIMILVUS_TOKEN- (optional)
MILVUS_DB - (optional)
MILVUS_COLLECTION - (optional)
MILVUS_DIMENSION
Optional args: , , , .
--collection--dimension--limit--filterNotes for Claude Code/Codex
- Insert is async; wait a few seconds before searching newly inserted data.
- Keep vector aligned with your embedding model.
dimension - Use filters to enforce tenant scoping or dataset partitions.
Error handling
- Auth errors: check and instance permissions.
MILVUS_TOKEN - Dimension mismatch: ensure all vectors match collection dimension.
- Network errors: verify VPC/public access settings on the instance.
References
-
PyMilvusexamples for AliCloud Milvus
MilvusClient -
Source list:
references/sources.md