alicloud-ai-multimodal-qvq

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese
Category: provider
分类:供应商

Model Studio QVQ Visual Reasoning

Model Studio QVQ 视觉推理

Validation

验证

bash
mkdir -p output/alicloud-ai-multimodal-qvq
python -m py_compile skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py && echo "py_compile_ok" > output/alicloud-ai-multimodal-qvq/validate.txt
Pass criteria: command exits 0 and
output/alicloud-ai-multimodal-qwen-vqv/validate.txt
is generated.
bash
mkdir -p output/alicloud-ai-multimodal-qvq
python -m py_compile skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py && echo "py_compile_ok" > output/alicloud-ai-multimodal-qvq/validate.txt
通过标准:命令执行返回0,且生成
output/alicloud-ai-multimodal-qwen-vqv/validate.txt
文件。

Critical model names

关键模型名称

Use one of these exact model strings:
  • qvq-plus
  • qvq-max
使用以下精确的模型字符串之一:
  • qvq-plus
  • qvq-max

Typical use

典型用途

  • Mathematical reasoning from screenshots
  • Diagram and chart reasoning
  • Visually grounded multi-step problem solving
  • 基于截图的数学推理
  • 图表与图形推理
  • 基于视觉的多步骤问题解决

Quick start

快速开始

bash
python skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py \
  --output output/alicloud-ai-multimodal-qvq/request.json
bash
python skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py \
  --output output/alicloud-ai-multimodal-qvq/request.json

Notes

注意事项

  • Use
    skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/
    for standard image understanding.
  • Use QVQ when the task explicitly needs stronger reasoning over visual evidence.
  • 标准图像理解请使用
    skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/
  • 当任务明确需要对视觉证据进行更强的推理时,请使用QVQ。

References

参考资料

  • references/sources.md
  • references/sources.md