Search Results: voice-cloning

Found 34 Skills

ai-voice-cloning

AI voice generation, text-to-speech, and voice synthesis via inference.sh CLI. Models: Kokoro TTS, DIA, Chatterbox, Higgs, VibeVoice for natural speech. Capabilities: multiple voices, emotions, accents, long-form narration, conversation. Use for: voiceovers, audiobooks, podcasts, video narration, accessibility. Triggers: voice cloning, tts, text to speech, ai voice, voice generation, voice synthesis, voice over, narration, speech synthesis, ai narrator, elevenlabs alternative, natural voice, realistic speech, voice ai

🇺🇸|EnglishTranslated

829

AI & Machine Learningskills-shell/skills

ai-voice-cloning

AI voice generation, text-to-speech, and voice synthesis via inference.sh CLI. Models: Inworld TTS-2 (100+ languages, emotion/non-verbal steering), Inworld TTS 1.5 (ultra-low latency), ElevenLabs (22+ premium voices, 32 languages), Kokoro TTS, DIA, Chatterbox, Higgs, VibeVoice for natural speech. Capabilities: multiple voices, emotions, accents, long-form narration, conversation, voice transformation, delivery mode control, character voices. Use for: voiceovers, audiobooks, podcasts, video narration, accessibility, gaming NPCs, avatar audio, UGC. Triggers: voice cloning, tts, text to speech, ai voice, voice generation, voice synthesis, voice over, narration, speech synthesis, ai narrator, elevenlabs, eleven labs, natural voice, realistic speech, voice ai, voice changer, inworld, inworld tts, character voice, npc voice

🇺🇸|EnglishTranslated

AI & Machine Learningdigitalsamba/claude-code-...

elevenlabs

Generate AI voiceovers, sound effects, and music using ElevenLabs APIs. Use when creating audio content for videos, podcasts, or games. Triggers include generating voiceovers, narration, dialogue, sound effects from descriptions, background music, soundtrack generation, voice cloning, or any audio synthesis task.

🇺🇸|EnglishTranslated

AI & Machine Learningnotedit/happy-skills

tts-skill

MiniMax TTS API - Text-to-Speech, Voice Cloning, Voice Design

🇨🇳|ChineseTranslated

1 scripts/Checked

AI & Machine Learningnoizai/skills

tts

Convert text into speech with Kokoro or Noiz, including simple and timeline-aligned modes.

🇺🇸|EnglishTranslated

4 scripts/Attention

AI & Machine Learningcatoncat/vox-cli

vox

Vox single-entry voice orchestration skill. Used to complete environment guarding, CLI installation, on-demand model download, ASR transcription, voice cloning, pipeline execution and task troubleshooting through natural language. It is used when users only describe the target without providing specific commands.

🇨🇳|ChineseTranslated

5 scripts/Attention

AI & Machine Learningcnemri/google-genai-skill...

speech-use

Generate (TTS), Transcribe (STT), and Clone voices using Google's GenAI and Cloud Speech SDKs. Supports Gemini-TTS, Chirp 3, and Instant Custom Voice.

🇺🇸|EnglishTranslated

3 scripts/Checked

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-tts-voice-clone

Voice cloning workflows with Alibaba Cloud Model Studio Qwen TTS VC models. Use when creating cloned voices from sample audio and synthesizing text with cloned timbre.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningnoizai/skills

characteristic-voice

Make generated speech feel companion-like with fillers, emotional tuning, and preset speaking styles.

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningpika-labs/pika-plugins

podcast

Two-host podcast video for any URL or free-form topic — 1 minute, 4 acts × ~15s, native multi-shot dialogue, optional voice cloning for Host A. Use when the user asks to "make a podcast", "podcast about [thing]", "podcast review of [url]", "two-host explainer", "interview-style clip", "two people talking on camera", "I/me and X talk about Y", or "interview with [persona] about [topic]". Native audio is the deliverable; captions are skipped by default because podcast dialogue mistranscribes domain terms.

🇺🇸|EnglishTranslated

AI & Machine Learningnoizai/skills

chat-with-anyone

Chat with any real person or fictional character in their own voice by automatically finding their speech online, extracting a clean reference sample, and generating audio replies. Use when the user says "我想跟xxx聊天", "你来扮演xxx跟我说话", "让xxx给我讲讲这篇文章", or similar.

🇺🇸|EnglishTranslated

AI & Machine Learningaradotso/trending-skills

voicebox-voice-synthesis

Expert skill for Voicebox — the open-source local voice cloning and TTS studio built with Tauri, React, and FastAPI

🇺🇸|EnglishTranslated