alicloud-ai-audio-tts-voice-clone

Original🇺🇸 English
Translated
1 scriptsChecked / no sensitive code detected

Voice cloning workflows with Alibaba Cloud Model Studio Qwen TTS VC models. Use when creating cloned voices from sample audio and synthesizing text with cloned timbre.

4installs
Added on

NPX Install

npx skill4agent add cinience/alicloud-skills alicloud-ai-audio-tts-voice-clone

Tags

Translated version includes tags in frontmatter
Category: provider

Model Studio Qwen TTS Voice Clone

Use voice cloning models to replicate timbre from enrollment audio samples.

Critical model names

Use one of these exact model strings:
  • qwen3-tts-vc-2026-01-22
  • qwen3-tts-vc-realtime-2026-01-15

Prerequisites

  • Install SDK in a virtual environment:
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Set
    DASHSCOPE_API_KEY
    in your environment, or add
    dashscope_api_key
    to
    ~/.alibabacloud/credentials
    .

Normalized interface (tts.voice_clone)

Request

  • text
    (string, required)
  • voice_sample
    (string | bytes, required) enrollment sample
  • voice_name
    (string, optional)
  • stream
    (bool, optional)

Response

  • audio_url
    (string) or streaming PCM chunks
  • voice_id
    (string)
  • request_id
    (string)

Operational guidance

  • Use clean speech samples with low background noise.
  • Respect consent and policy requirements for cloned voices.
  • Persist generated
    voice_id
    and reuse for future synthesis requests.

Local helper script

Prepare a normalized request JSON and validate response schema:
bash
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-clone/scripts/prepare_voice_clone_request.py \
  --text "欢迎来到语音复刻演示" \
  --voice-sample "https://example.com/voice-sample.wav"

Output location

  • Default output:
    output/ai-audio-tts-voice-clone/audio/
  • Override base dir with
    OUTPUT_DIR
    .

References

  • references/sources.md