alicloud-ai-audio-tts-voice-design

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Category: provider

分类：服务商

Model Studio Qwen TTS Voice Design

Model Studio Qwen TTS 语音设计

Use voice design models to create controllable synthetic voices from natural language descriptions.

使用语音设计模型，通过自然语言描述创建可控制的合成语音。

Critical model names

关键模型名称

Use one of these exact model strings:

```
qwen3-tts-vd-2026-01-26
```
```
qwen3-tts-vd-realtime-2025-12-16
```

请使用以下精确的模型字符串之一：

```
qwen3-tts-vd-2026-01-26
```
```
qwen3-tts-vd-realtime-2025-12-16
```

Prerequisites

前提条件

Install SDK in a virtual environment:

bash

python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

Set

DASHSCOPE_API_KEY

in your environment, or add

dashscope_api_key

~/.alibabacloud/credentials

在虚拟环境中安装SDK：

bash

python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

在环境变量中设置

DASHSCOPE_API_KEY

，或者将

dashscope_api_key

添加到

~/.alibabacloud/credentials

文件中。

Normalized interface (tts.voice_design)

标准化接口（tts.voice_design）

Request

请求参数

```
voice_prompt
```
(string, required) target voice description
```
text
```
(string, required)
```
stream
```
(bool, optional)

```
voice_prompt
```
（字符串，必填）：目标语音描述
```
text
```
（字符串，必填）
```
stream
```
（布尔值，可选）

Response

响应结果

```
audio_url
```
(string) or streaming PCM chunks
```
voice_id
```
(string)
```
request_id
```
(string)

```
audio_url
```
（字符串）或流式PCM数据块
```
voice_id
```
（字符串）
```
request_id
```
（字符串）

Operational guidance

操作指南

Write voice prompts with tone, pace, emotion, and timbre constraints.
Build a reusable voice prompt library for product consistency.
Validate generated voice in short utterances before long scripts.

在语音提示中明确语气、语速、情感和音色要求。
构建可复用的语音提示库，确保产品语音一致性。
在处理长脚本前，先通过短句验证生成的语音效果。

Local helper script

本地辅助脚本

Prepare a normalized request JSON and validate response schema:

bash

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
  --voice-prompt "A warm female host voice, clear articulation, medium pace" \
  --text "这是音色设计演示"

准备标准化的请求JSON并验证响应格式：

bash

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
  --voice-prompt "A warm female host voice, clear articulation, medium pace" \
  --text "这是音色设计演示"

Output location

输出位置

Default output:
```
output/ai-audio-tts-voice-design/audio/
```
Override base dir with
```
OUTPUT_DIR
```
.

默认输出路径：
```
output/ai-audio-tts-voice-design/audio/
```
可通过
```
OUTPUT_DIR
```
环境变量覆盖基础目录。

References

参考资料

```
references/sources.md
```

```
references/sources.md
```