alicloud-ai-audio-tts-voice-design
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseCategory: provider
分类:服务商
Model Studio Qwen TTS Voice Design
Model Studio Qwen TTS 语音设计
Use voice design models to create controllable synthetic voices from natural language descriptions.
使用语音设计模型,通过自然语言描述创建可控制的合成语音。
Critical model names
关键模型名称
Use one of these exact model strings:
qwen3-tts-vd-2026-01-26qwen3-tts-vd-realtime-2025-12-16
请使用以下精确的模型字符串之一:
qwen3-tts-vd-2026-01-26qwen3-tts-vd-realtime-2025-12-16
Prerequisites
前提条件
- Install SDK in a virtual environment:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope- Set in your environment, or add
DASHSCOPE_API_KEYtodashscope_api_key.~/.alibabacloud/credentials
- 在虚拟环境中安装SDK:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope- 在环境变量中设置,或者将
DASHSCOPE_API_KEY添加到dashscope_api_key文件中。~/.alibabacloud/credentials
Normalized interface (tts.voice_design)
标准化接口(tts.voice_design)
Request
请求参数
- (string, required) target voice description
voice_prompt - (string, required)
text - (bool, optional)
stream
- (字符串,必填):目标语音描述
voice_prompt - (字符串,必填)
text - (布尔值,可选)
stream
Response
响应结果
- (string) or streaming PCM chunks
audio_url - (string)
voice_id - (string)
request_id
- (字符串)或流式PCM数据块
audio_url - (字符串)
voice_id - (字符串)
request_id
Operational guidance
操作指南
- Write voice prompts with tone, pace, emotion, and timbre constraints.
- Build a reusable voice prompt library for product consistency.
- Validate generated voice in short utterances before long scripts.
- 在语音提示中明确语气、语速、情感和音色要求。
- 构建可复用的语音提示库,确保产品语音一致性。
- 在处理长脚本前,先通过短句验证生成的语音效果。
Local helper script
本地辅助脚本
Prepare a normalized request JSON and validate response schema:
bash
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
--voice-prompt "A warm female host voice, clear articulation, medium pace" \
--text "这是音色设计演示"准备标准化的请求JSON并验证响应格式:
bash
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
--voice-prompt "A warm female host voice, clear articulation, medium pace" \
--text "这是音色设计演示"Output location
输出位置
- Default output:
output/ai-audio-tts-voice-design/audio/ - Override base dir with .
OUTPUT_DIR
- 默认输出路径:
output/ai-audio-tts-voice-design/audio/ - 可通过环境变量覆盖基础目录。
OUTPUT_DIR
References
参考资料
references/sources.md
references/sources.md