alicloud-ai-audio-tts-voice-clone

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese
Category: provider
类别:服务提供商

Model Studio Qwen TTS Voice Clone

Model Studio Qwen TTS 语音克隆

Use voice cloning models to replicate timbre from enrollment audio samples.
使用语音克隆模型从注册音频样本中复刻音色。

Critical model names

关键模型名称

Use one of these exact model strings:
  • qwen3-tts-vc-2026-01-22
  • qwen3-tts-vc-realtime-2026-01-15
请使用以下精确的模型字符串之一:
  • qwen3-tts-vc-2026-01-22
  • qwen3-tts-vc-realtime-2026-01-15

Prerequisites

前置条件

  • Install SDK in a virtual environment:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Set
    DASHSCOPE_API_KEY
    in your environment, or add
    dashscope_api_key
    to
    ~/.alibabacloud/credentials
    .
  • 在虚拟环境中安装SDK:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • 在环境变量中设置
    DASHSCOPE_API_KEY
    ,或在
    ~/.alibabacloud/credentials
    中添加
    dashscope_api_key

Normalized interface (tts.voice_clone)

标准化接口(tts.voice_clone)

Request

请求参数

  • text
    (string, required)
  • voice_sample
    (string | bytes, required) enrollment sample
  • voice_name
    (string, optional)
  • stream
    (bool, optional)
  • text
    (字符串,必填)
  • voice_sample
    (字符串 | 字节,必填):注册样本
  • voice_name
    (字符串,可选)
  • stream
    (布尔值,可选)

Response

响应内容

  • audio_url
    (string) or streaming PCM chunks
  • voice_id
    (string)
  • request_id
    (string)
  • audio_url
    (字符串)或流式PCM数据块
  • voice_id
    (字符串)
  • request_id
    (字符串)

Operational guidance

操作指南

  • Use clean speech samples with low background noise.
  • Respect consent and policy requirements for cloned voices.
  • Persist generated
    voice_id
    and reuse for future synthesis requests.
  • 使用低背景噪音的清晰语音样本。
  • 遵守克隆语音的相关许可和政策要求。
  • 保存生成的
    voice_id
    ,以便在后续合成请求中复用。

Local helper script

本地辅助脚本

Prepare a normalized request JSON and validate response schema:
bash
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-clone/scripts/prepare_voice_clone_request.py \
  --text "欢迎来到语音复刻演示" \
  --voice-sample "https://example.com/voice-sample.wav"
准备标准化的请求JSON并验证响应格式:
bash
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-clone/scripts/prepare_voice_clone_request.py \
  --text "欢迎来到语音复刻演示" \
  --voice-sample "https://example.com/voice-sample.wav"

Output location

输出位置

  • Default output:
    output/ai-audio-tts-voice-clone/audio/
  • Override base dir with
    OUTPUT_DIR
    .
  • 默认输出路径:
    output/ai-audio-tts-voice-clone/audio/
  • 可通过
    OUTPUT_DIR
    覆盖基础目录。

References

参考资料

  • references/sources.md
  • references/sources.md