aliyun-qwen-tts-voice-design
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseCategory: provider
分类:provider
Model Studio Qwen TTS Voice Design
模型工作室 Qwen TTS 音色设计
Use voice design models to create controllable synthetic voices from natural language descriptions.
使用音色设计模型通过自然语言描述创建可控的合成语音。
Critical model names
关键模型名称
Use one of these exact model strings:
qwen3-tts-vd-2026-01-26qwen3-tts-vd-realtime-2026-01-15
请使用以下精确的模型字符串之一:
qwen3-tts-vd-2026-01-26qwen3-tts-vd-realtime-2026-01-15
Prerequisites
前置条件
- Install SDK in a virtual environment:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope- Set in your environment, or add
DASHSCOPE_API_KEYtodashscope_api_key.~/.alibabacloud/credentials
- 在虚拟环境中安装SDK:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope- 在你的环境中设置,或者将
DASHSCOPE_API_KEY添加到dashscope_api_key文件中。~/.alibabacloud/credentials
Normalized interface (tts.voice_design)
标准化接口 (tts.voice_design)
Request
请求参数
- (string, required) target voice description
voice_prompt - (string, required)
text - (bool, optional)
stream
- (字符串,必填) 目标音色描述
voice_prompt - (字符串,必填)
text - (布尔值,选填)
stream
Response
响应参数
- (string) or streaming PCM chunks
audio_url - (string)
voice_id - (string)
request_id
- (字符串) 或流式PCM数据块
audio_url - (字符串)
voice_id - (字符串)
request_id
Operational guidance
操作指引
- Write voice prompts with tone, pace, emotion, and timbre constraints.
- Build a reusable voice prompt library for product consistency.
- Validate generated voice in short utterances before long scripts.
- 编写音色提示词时包含语调、语速、情绪和音色约束。
- 构建可复用的音色提示词库以保证产品一致性。
- 在处理长脚本前先通过短句验证生成的语音效果。
Local helper script
本地辅助脚本
Prepare a normalized request JSON and validate response schema:
bash
.venv/bin/python skills/ai/audio/aliyun-qwen-tts-voice-design/scripts/prepare_voice_design_request.py \
--voice-prompt "A warm female host voice, clear articulation, medium pace" \
--text "This is a voice-design demo"准备标准化请求JSON并验证响应格式:
bash
.venv/bin/python skills/ai/audio/aliyun-qwen-tts-voice-design/scripts/prepare_voice_design_request.py \
--voice-prompt "A warm female host voice, clear articulation, medium pace" \
--text "This is a voice-design demo"Output location
输出位置
- Default output:
output/ai-audio-tts-voice-design/audio/ - Override base dir with .
OUTPUT_DIR
- 默认输出路径:
output/ai-audio-tts-voice-design/audio/ - 可通过环境变量覆盖基础目录。
OUTPUT_DIR
Validation
校验
bash
mkdir -p output/aliyun-qwen-tts-voice-design
for f in skills/ai/audio/aliyun-qwen-tts-voice-design/scripts/*.py; do
python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/aliyun-qwen-tts-voice-design/validate.txtPass criteria: command exits 0 and is generated.
output/aliyun-qwen-tts-voice-design/validate.txtbash
mkdir -p output/aliyun-qwen-tts-voice-design
for f in skills/ai/audio/aliyun-qwen-tts-voice-design/scripts/*.py; do
python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/aliyun-qwen-tts-voice-design/validate.txt通过标准:命令退出码为0且生成文件。
output/aliyun-qwen-tts-voice-design/validate.txtOutput And Evidence
输出与凭证
- Save artifacts, command outputs, and API response summaries under .
output/aliyun-qwen-tts-voice-design/ - Include key parameters (region/resource id/time range) in evidence files for reproducibility.
- 将产物、命令输出和API响应摘要保存在目录下。
output/aliyun-qwen-tts-voice-design/ - 在凭证文件中包含关键参数(区域/资源ID/时间范围)以便复现结果。
Workflow
工作流程
- Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
- Run one minimal read-only query first to verify connectivity and permissions.
- Execute the target operation with explicit parameters and bounded scope.
- Verify results and save output/evidence files.
- 确认用户意图、区域、标识符,以及操作是只读还是会修改数据。
- 首先运行一个最小化的只读查询验证连通性和权限。
- 使用明确参数和有限范围执行目标操作。
- 验证结果并保存输出/凭证文件。
References
参考
references/sources.md
references/sources.md