alicloud-ai-audio-tts-voice-design

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese
Category: provider
分类:服务商

Model Studio Qwen TTS Voice Design

Model Studio Qwen TTS 语音设计

Use voice design models to create controllable synthetic voices from natural language descriptions.
使用语音设计模型,通过自然语言描述创建可控制的合成语音。

Critical model names

关键模型名称

Use one of these exact model strings:
  • qwen3-tts-vd-2026-01-26
  • qwen3-tts-vd-realtime-2025-12-16
请使用以下精确的模型字符串之一:
  • qwen3-tts-vd-2026-01-26
  • qwen3-tts-vd-realtime-2025-12-16

Prerequisites

前提条件

  • Install SDK in a virtual environment:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Set
    DASHSCOPE_API_KEY
    in your environment, or add
    dashscope_api_key
    to
    ~/.alibabacloud/credentials
    .
  • 在虚拟环境中安装SDK:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • 在环境变量中设置
    DASHSCOPE_API_KEY
    ,或者将
    dashscope_api_key
    添加到
    ~/.alibabacloud/credentials
    文件中。

Normalized interface (tts.voice_design)

标准化接口(tts.voice_design)

Request

请求参数

  • voice_prompt
    (string, required) target voice description
  • text
    (string, required)
  • stream
    (bool, optional)
  • voice_prompt
    (字符串,必填):目标语音描述
  • text
    (字符串,必填)
  • stream
    (布尔值,可选)

Response

响应结果

  • audio_url
    (string) or streaming PCM chunks
  • voice_id
    (string)
  • request_id
    (string)
  • audio_url
    (字符串)或流式PCM数据块
  • voice_id
    (字符串)
  • request_id
    (字符串)

Operational guidance

操作指南

  • Write voice prompts with tone, pace, emotion, and timbre constraints.
  • Build a reusable voice prompt library for product consistency.
  • Validate generated voice in short utterances before long scripts.
  • 在语音提示中明确语气、语速、情感和音色要求。
  • 构建可复用的语音提示库,确保产品语音一致性。
  • 在处理长脚本前,先通过短句验证生成的语音效果。

Local helper script

本地辅助脚本

Prepare a normalized request JSON and validate response schema:
bash
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
  --voice-prompt "A warm female host voice, clear articulation, medium pace" \
  --text "这是音色设计演示"
准备标准化的请求JSON并验证响应格式:
bash
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
  --voice-prompt "A warm female host voice, clear articulation, medium pace" \
  --text "这是音色设计演示"

Output location

输出位置

  • Default output:
    output/ai-audio-tts-voice-design/audio/
  • Override base dir with
    OUTPUT_DIR
    .
  • 默认输出路径:
    output/ai-audio-tts-voice-design/audio/
  • 可通过
    OUTPUT_DIR
    环境变量覆盖基础目录。

References

参考资料

  • references/sources.md
  • references/sources.md