Loading...
Loading...
Understand images with Alibaba Cloud Model Studio Qwen VL models (qwen3-vl-plus/qwen3-vl-flash and latest aliases). Use when building image Q&A, visual analysis, OCR-like extraction, chart/table reading, or screenshot understanding workflows.
npx skill4agent add cinience/alicloud-skills alicloud-ai-multimodal-qwen-vlpython3 -m venv .venv
. .venv/bin/activate
python -m pip install requestsDASHSCOPE_API_KEYdashscope_api_key~/.alibabacloud/credentialsqwen3-vl-plusqwen3-vl-flashqwen3-vl-plus-latestqwen3-vl-plus-2025-12-19qwen3-vl-flash-latestqwen-vl-max-latestqwen-vl-plus-latestpromptimagedata:modelqwen3-vl-plusmax_tokens512temperature0.2detailautolowhighautojson_modeschemamax_retries429/5xx2retry_backoff_s1.5textmodelusagepython skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/scripts/analyze_image.py \
--request '{"prompt":"请概括这张图里的主要内容","image":"https://example.com/demo.jpg"}' \
--print-responsepython skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/scripts/analyze_image.py \
--request '{"prompt":"提取图片中的关键信息","image":"./samples/invoice.png","model":"qwen3-vl-plus"}' \
--print-responsepython skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/scripts/analyze_image.py \
--request '{"prompt":"提取字段: title, amount, date","image":"./samples/invoice.png"}' \
--json-mode \
--print-responsepython skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/scripts/analyze_image.py \
--request '{"prompt":"提取发票字段","image":"./samples/invoice.png"}' \
--schema skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/references/examples/invoice.schema.json \
--print-responsecurl -sS https://dashscope.aliyuncs.com/compatible-mode/v1/chat/completions \
-H "Authorization: Bearer $DASHSCOPE_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model":"qwen3-vl-plus",
"messages":[
{
"role":"user",
"content":[
{"type":"image_url","image_url":{"url":"https://example.com/demo.jpg"}},
{"type":"text","text":"请描述这张图并列出可执行动作"}
]
}
],
"max_tokens":512,
"temperature":0.2
}'--outputoutput/ai-multimodal-qwen-vl/python tests/ai/multimodal/alicloud-ai-multimodal-qwen-vl-test/scripts/smoke_test_qwen_vl.py \
--image output/ai-image-qwen-image/images/vl_test_cat.png| Error | Likely cause | Action |
|---|---|---|
| 401/403 | Missing or invalid key | Check |
| 400 | Invalid request schema or unsupported image source | Validate |
| 429 | Rate limit | Retry with exponential backoff and lower concurrency. |
| 5xx | Temporary backend issue | Retry with backoff and idempotent request design. |
-latestreferences/sources.mdreferences/api_reference.md