aliyun-pixverse-generation

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese
Category: provider
分类:provider

Model Studio Aishi Video Generation

Model Studio 爱诗视频生成

Validation

验证

bash
mkdir -p output/aliyun-pixverse-generation
python -m py_compile skills/ai/video/aliyun-pixverse-generation/scripts/prepare_aishi_request.py && echo "py_compile_ok" > output/aliyun-pixverse-generation/validate.txt
Pass criteria: command exits 0 and
output/aliyun-pixverse-generation/validate.txt
is generated.
bash
mkdir -p output/aliyun-pixverse-generation
python -m py_compile skills/ai/video/aliyun-pixverse-generation/scripts/prepare_aishi_request.py && echo "py_compile_ok" > output/aliyun-pixverse-generation/validate.txt
通过标准:命令退出码为0,且成功生成
output/aliyun-pixverse-generation/validate.txt
文件。

Output And Evidence

输出与凭证

  • Save normalized request payloads, chosen model variant, and task polling snapshots under
    output/aliyun-pixverse-generation/
    .
  • Record region, resolution/size, duration, and whether audio generation was enabled.
Use Aishi when the user explicitly wants the non-Wan PixVerse family for video generation.
  • 将标准化请求负载、选定的模型变体以及任务轮询快照保存至
    output/aliyun-pixverse-generation/
    目录下。
  • 记录区域、分辨率/尺寸、时长以及是否启用音频生成。
当用户明确要求使用非Wan系列的PixVerse家族模型进行视频生成时,请使用爱诗(Aishi)。

Critical model names

关键模型名称

Use one of these exact model strings:
  • pixverse/pixverse-v5.6-t2v
  • pixverse/pixverse-v5.6-it2v
  • pixverse/pixverse-v5.6-kf2v
  • pixverse/pixverse-v5.6-r2v
Selection guidance:
  • Use
    pixverse/pixverse-v5.6-t2v
    for text-only generation.
  • Use
    pixverse/pixverse-v5.6-it2v
    for first-frame image-to-video.
  • Use
    pixverse/pixverse-v5.6-kf2v
    for first-frame + last-frame transitions.
  • Use
    pixverse/pixverse-v5.6-r2v
    for multi-image character/style consistency.
请使用以下精确模型字符串之一:
  • pixverse/pixverse-v5.6-t2v
  • pixverse/pixverse-v5.6-it2v
  • pixverse/pixverse-v5.6-kf2v
  • pixverse/pixverse-v5.6-r2v
选择指南:
  • 纯文本生成视频请使用
    pixverse/pixverse-v5.6-t2v
  • 首图生成视频请使用
    pixverse/pixverse-v5.6-it2v
  • 首帧+尾帧转场生成视频请使用
    pixverse/pixverse-v5.6-kf2v
  • 需保证多图人物/风格一致性请使用
    pixverse/pixverse-v5.6-r2v

Prerequisites

前置要求

  • This family currently only supports China mainland (Beijing).
  • Install SDK or call HTTP directly:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Set
    DASHSCOPE_API_KEY
    in your environment, or add
    dashscope_api_key
    to
    ~/.alibabacloud/credentials
    .
  • 该模型家族目前仅支持中国大陆(北京)区域。
  • 安装SDK或直接调用HTTP接口:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • 在环境变量中设置
    DASHSCOPE_API_KEY
    ,或者将
    dashscope_api_key
    添加到
    ~/.alibabacloud/credentials
    文件中。

Normalized interface (video.generate)

标准化接口(video.generate)

Request

请求参数

  • model
    (string, required)
  • prompt
    (string, optional for
    it2v
    , required for other variants)
  • media
    (array<object>, optional)
  • size
    (string, optional): direct pixel size such as
    1280*720
    , used by
    t2v
    and
    r2v
  • resolution
    (string, optional):
    360P
    /
    540P
    /
    720P
    /
    1080P
    , used by
    it2v
    and
    kf2v
  • duration
    (int, required):
    5
    /
    8
    /
    10
    , except 1080P only supports
    5
    /
    8
  • audio
    (bool, optional)
  • watermark
    (bool, optional)
  • seed
    (int, optional)
  • model
    (字符串,必填)
  • prompt
    (字符串,
    it2v
    模型可选,其他变体必填)
  • media
    (对象数组,可选)
  • size
    (字符串,可选):直接指定像素尺寸,例如
    1280*720
    ,供
    t2v
    r2v
    使用
  • resolution
    (字符串,可选):
    360P
    /
    540P
    /
    720P
    /
    1080P
    ,供
    it2v
    kf2v
    使用
  • duration
    (整数,必填):
    5
    /
    8
    /
    10
    ,其中1080P分辨率仅支持
    5
    /
    8
  • audio
    (布尔值,可选)
  • watermark
    (布尔值,可选)
  • seed
    (整数,可选)

Response

响应参数

  • task_id
    (string)
  • task_status
    (string)
  • video_url
    (string, when finished)
  • task_id
    (字符串)
  • task_status
    (字符串)
  • video_url
    (字符串,任务完成时返回)

Endpoint and execution model

接口地址与执行模式

  • Submit task:
    POST https://dashscope.aliyuncs.com/api/v1/services/aigc/video-generation/video-synthesis
  • Poll task:
    GET https://dashscope.aliyuncs.com/api/v1/tasks/{task_id}
  • HTTP calls are async only and must set header
    X-DashScope-Async: enable
    .
  • 提交任务:
    POST https://dashscope.aliyuncs.com/api/v1/services/aigc/video-generation/video-synthesis
  • 轮询任务:
    GET https://dashscope.aliyuncs.com/api/v1/tasks/{task_id}
  • HTTP调用仅支持异步模式,必须设置请求头
    X-DashScope-Async: enable

Quick start

快速开始

Text-to-video:
bash
python skills/ai/video/aliyun-pixverse-generation/scripts/prepare_aishi_request.py \
  --model pixverse/pixverse-v5.6-t2v \
  --prompt "A compact robot walks through a rainy neon alley." \
  --size 1280*720 \
  --duration 5
Image-to-video:
bash
python skills/ai/video/aliyun-pixverse-generation/scripts/prepare_aishi_request.py \
  --model pixverse/pixverse-v5.6-it2v \
  --prompt "The turtle swims slowly as the camera rises." \
  --media image_url=https://example.com/turtle.webp \
  --resolution 720P \
  --duration 5
文本生成视频:
bash
python skills/ai/video/aliyun-pixverse-generation/scripts/prepare_aishi_request.py \
  --model pixverse/pixverse-v5.6-t2v \
  --prompt "A compact robot walks through a rainy neon alley." \
  --size 1280*720 \
  --duration 5
图片生成视频:
bash
python skills/ai/video/aliyun-pixverse-generation/scripts/prepare_aishi_request.py \
  --model pixverse/pixverse-v5.6-it2v \
  --prompt "The turtle swims slowly as the camera rises." \
  --media image_url=https://example.com/turtle.webp \
  --resolution 720P \
  --duration 5

Operational guidance

使用指南

  • t2v
    and
    r2v
    use
    size
    ;
    it2v
    and
    kf2v
    use
    resolution
    .
  • For
    kf2v
    , provide exactly one
    first_frame
    and one
    last_frame
    .
  • For
    r2v
    , you can pass up to 7 reference images.
  • Aishi returns task IDs first; do not treat the initial response as the final video result.
  • t2v
    r2v
    使用
    size
    参数;
    it2v
    kf2v
    使用
    resolution
    参数。
  • 使用
    kf2v
    时,必须恰好提供1个
    first_frame
    和1个
    last_frame
  • 使用
    r2v
    时,最多可传入7张参考图。
  • 爱诗会优先返回任务ID,请勿将初始响应作为最终视频结果。

Output location

输出位置

  • Default output:
    output/aliyun-pixverse-generation/request.json
  • Override base dir with
    OUTPUT_DIR
    .
  • 默认输出路径:
    output/aliyun-pixverse-generation/request.json
  • 可通过
    OUTPUT_DIR
    环境变量覆盖基础目录。

References

参考资料

  • references/sources.md
  • references/sources.md