Search Results: voice-cloning

Found 29 Skills

AI & Machine Learningguia-matthieu/clawfu-skil...

voice-design

Select and create the perfect AI voice for your content using ElevenLabs, Qwen3-TTS, and other platforms—matching voice characteristics to brand personality and audience. Use when: Choosing an AI voice for video narration; Creating a consistent brand voice across content; Cloning a voice for scalable production; Comparing voice synthesis platforms; Designing voice characteristics by description

🇺🇸|EnglishTranslated

AI & Machine Learningxiaomimimo/mimo-skills

mimo-v2-5-tts

MiMo V2.5 TTS Text-to-Speech. Generate speech using Xiaomi MiMo V2.5 TTS series models. This skill is activated when text needs to be converted to speech, voice messages need to be sent, content needs to be read aloud, or when users request 'speak it out' or 'voice reply'. It supports three modes: preset voice, voice design, and voice cloning, as well as natural language control and director mode. It also supports style tag control for tone, emotion, and dialect, and preset voices support singing.

🇨🇳|ChineseTranslated

4 scripts/Attention

AI & Machine Learningpostplusai/postplus-skill...

voice-batch-runner

Generate and manage persona-aware voice assets for short-form video production, including voice design, script-specific audio takes, and future reusable voice identities. Use this when persona registries and scripts already exist and you need local audio assets, voice manifests, and reviewable voice iterations without losing continuity across many videos.

🇺🇸|EnglishTranslated

13 scripts/Attention

AI & Machine Learningfreestylefly/canghe-skill...

flyworks-avatar-video

Generate videos using Flyworks (a.k.a HiFly) Digital Humans. Create talking photo videos from images, use public avatars with TTS, or clone voices for custom audio.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningmckruz/comfyui-expert

comfyui-voice-pipeline

Generate character voices using TTS, voice cloning, and lip-sync tools. Supports Chatterbox, F5-TTS, TTS Audio Suite, RVC, and ElevenLabs. Use when creating speech audio for characters or syncing audio to video.

🇺🇸|EnglishTranslated

AI & Machine Learningjarmen423/skills

qwen3-tts

Build text-to-speech applications using Qwen3-TTS, a powerful speech generation system supporting voice clone, voice design, and custom voice synthesis. Use when creating TTS applications, generating speech from text, cloning voices from audio samples, designing new voices via natural language descriptions, or fine-tuning TTS models. Supports 10 languages (Chinese, English, Japanese, Korean, German, French, Russian, Portuguese, Spanish, Italian).

🇺🇸|EnglishTranslated

AI & Machine Learningerichowens/some_claude_sk...

voice-audio-engineer

Expert in voice synthesis, TTS, voice cloning, podcast production, speech processing, and voice UI design via ElevenLabs integration. Specializes in vocal clarity, loudness standards (LUFS), de-essing, dialogue mixing, and voice transformation. Activate on 'TTS', 'text-to-speech', 'voice clone', 'voice synthesis', 'ElevenLabs', 'podcast', 'voice recording', 'speech-to-speech', 'voice UI', 'audiobook', 'dialogue'. NOT for spatial audio (use sound-engineer), music production (use DAW tools), game audio middleware (use sound-engineer), sound effects generation (use sound-engineer with ElevenLabs SFX), or live concert audio.

🇺🇸|EnglishTranslated

AI & Machine Learningcatoncat/vox-cli

vox

Vox single-entry voice orchestration skill. Used to complete environment guarding, CLI installation, on-demand model download, ASR transcription, voice cloning, pipeline execution and task troubleshooting through natural language. It is used when users only describe the target without providing specific commands.

🇨🇳|ChineseTranslated

5 scripts/Attention

Tools & Utilitiesaahl/skills

zai-tts

Text-to-speech conversion using GLM-TTS service via the `uvx zai-tts` command for generating audio from text. Use when (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, podcast, driving, cooking). (3) Using pre-cloned voices for speech.

🇺🇸|EnglishTranslated

Testing & QAcinience/alicloud-skills

alicloud-ai-audio-tts-voice-clone-test

Minimal voice cloning TTS smoke test for Model Studio Qwen TTS VC.

🇺🇸|EnglishTranslated

AI & Machine Learningnoizai/skills

tts

Convert text into speech with Kokoro or Noiz, including simple and timeline-aligned modes.

🇺🇸|EnglishTranslated

4 scripts/Attention

AI & Machine Learningaradotso/trending-skills

voicebox-voice-synthesis

Expert skill for Voicebox — the open-source local voice cloning and TTS studio built with Tauri, React, and FastAPI

🇺🇸|EnglishTranslated