aliyun-qwen-tts-voice-design

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Category: provider

分类：provider

Model Studio Qwen TTS Voice Design

模型工作室 Qwen TTS 音色设计

Use voice design models to create controllable synthetic voices from natural language descriptions.

使用音色设计模型通过自然语言描述创建可控的合成语音。

Critical model names

关键模型名称

Use one of these exact model strings:

```
qwen3-tts-vd-2026-01-26
```
```
qwen3-tts-vd-realtime-2026-01-15
```

请使用以下精确的模型字符串之一：

```
qwen3-tts-vd-2026-01-26
```
```
qwen3-tts-vd-realtime-2026-01-15
```

Prerequisites

前置条件

Install SDK in a virtual environment:

bash

python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

Set

DASHSCOPE_API_KEY

in your environment, or add

dashscope_api_key

~/.alibabacloud/credentials

在虚拟环境中安装SDK：

bash

python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

在你的环境中设置

DASHSCOPE_API_KEY

，或者将

dashscope_api_key

添加到

~/.alibabacloud/credentials

文件中。

Normalized interface (tts.voice_design)

标准化接口 (tts.voice_design)

Request

请求参数

```
voice_prompt
```
(string, required) target voice description
```
text
```
(string, required)
```
stream
```
(bool, optional)

```
voice_prompt
```
(字符串，必填) 目标音色描述
```
text
```
(字符串，必填)
```
stream
```
(布尔值，选填)

Response

响应参数

```
audio_url
```
(string) or streaming PCM chunks
```
voice_id
```
(string)
```
request_id
```
(string)

```
audio_url
```
(字符串) 或流式PCM数据块
```
voice_id
```
(字符串)
```
request_id
```
(字符串)

Operational guidance

操作指引

Write voice prompts with tone, pace, emotion, and timbre constraints.
Build a reusable voice prompt library for product consistency.
Validate generated voice in short utterances before long scripts.

编写音色提示词时包含语调、语速、情绪和音色约束。
构建可复用的音色提示词库以保证产品一致性。
在处理长脚本前先通过短句验证生成的语音效果。

Local helper script

本地辅助脚本

Prepare a normalized request JSON and validate response schema:

bash

.venv/bin/python skills/ai/audio/aliyun-qwen-tts-voice-design/scripts/prepare_voice_design_request.py \
  --voice-prompt "A warm female host voice, clear articulation, medium pace" \
  --text "This is a voice-design demo"

准备标准化请求JSON并验证响应格式：

bash

.venv/bin/python skills/ai/audio/aliyun-qwen-tts-voice-design/scripts/prepare_voice_design_request.py \
  --voice-prompt "A warm female host voice, clear articulation, medium pace" \
  --text "This is a voice-design demo"

Output location

输出位置

Default output:
```
output/ai-audio-tts-voice-design/audio/
```
Override base dir with
```
OUTPUT_DIR
```
.

默认输出路径：
```
output/ai-audio-tts-voice-design/audio/
```
可通过
```
OUTPUT_DIR
```
环境变量覆盖基础目录。

Validation

校验

bash

mkdir -p output/aliyun-qwen-tts-voice-design
for f in skills/ai/audio/aliyun-qwen-tts-voice-design/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/aliyun-qwen-tts-voice-design/validate.txt

Pass criteria: command exits 0 and

output/aliyun-qwen-tts-voice-design/validate.txt

is generated.

bash

mkdir -p output/aliyun-qwen-tts-voice-design
for f in skills/ai/audio/aliyun-qwen-tts-voice-design/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/aliyun-qwen-tts-voice-design/validate.txt

通过标准：命令退出码为0且生成

output/aliyun-qwen-tts-voice-design/validate.txt

文件。

Output And Evidence

输出与凭证

Save artifacts, command outputs, and API response summaries under
```
output/aliyun-qwen-tts-voice-design/
```
.
Include key parameters (region/resource id/time range) in evidence files for reproducibility.

将产物、命令输出和API响应摘要保存在
```
output/aliyun-qwen-tts-voice-design/
```
目录下。
在凭证文件中包含关键参数（区域/资源ID/时间范围）以便复现结果。

Workflow

工作流程

Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
Run one minimal read-only query first to verify connectivity and permissions.
Execute the target operation with explicit parameters and bounded scope.
Verify results and save output/evidence files.

确认用户意图、区域、标识符，以及操作是只读还是会修改数据。
首先运行一个最小化的只读查询验证连通性和权限。
使用明确参数和有限范围执行目标操作。
验证结果并保存输出/凭证文件。

References

参考

```
references/sources.md
```

```
references/sources.md
```