Search Results: audio-generation

Found 32 Skills

text-to-speech

Convert text to natural speech using Sarvam AI's Bulbul v3 model. Handles audio generation, voiceovers, and voice interfaces for 11 Indian languages with 30+ voices. Supports REST, HTTP streaming, WebSocket, and pronunciation dictionaries. Use when generating spoken audio from text.

🇺🇸|EnglishTranslated

AI & Machine Learningnoizai/skills

sound-fx

Use this skill whenever the user wants to generate sound effects, ambient audio, or short audio clips from a text description. Triggers include: any mention of 'sound effect', 'sfx', 'generate sound', 'make a sound', 'audio effect', 'ambient sound', 'foley', 'sound clip', 'noise', or requests to produce a specific sound (e.g. 'make a gunshot sound', 'generate thunder', 'create the sound of rain'). Also use when the user describes an action or scenario and wants the corresponding audio (e.g. 'someone getting spanked', 'a door slamming', 'cartoon boing'). Do NOT use for speech synthesis, music generation with melody/lyrics, or voice cloning.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningveniceai/skills

venice-audio-music

Async music / audio-track generation via Venice. Covers the /audio/quote + /audio/queue + /audio/retrieve + /audio/complete lifecycle, lyrics vs instrumental, voice selection, duration, language, speed, model capability probing, and webhook-free polling.

🇺🇸|EnglishTranslated

AI & Machine Learningboomsystel-code/openclaw-...

audio-cog

AI audio generation powered by CellCog. Text-to-speech, voice synthesis, voiceovers, podcast audio, narration, music generation, background music, sound design. Professional audio creation with AI.

🇺🇸|EnglishTranslated

AI & Machine Learningnoizai/skills

chat-with-anyone

Chat with any real person or fictional character in their own voice by automatically finding their speech online, extracting a clean reference sample, and generating audio replies. Use when the user says "我想跟xxx聊天", "你来扮演xxx跟我说话", "让xxx给我讲讲这篇文章", or similar.

🇺🇸|EnglishTranslated

AI & Machine Learningnexu-io/open-design

speech

Generate spoken audio from text using OpenAI's API with built-in voices. Useful for narrated explainers, lecture audio, and quick voiceover tracks.

🇺🇸|EnglishTranslated

AI & Machine Learningsteipete/clawdis

sag

ElevenLabs text-to-speech with mac-style say UX.

🇺🇸|EnglishTranslated

AI & Machine Learninginference-sh-skills/skill...

elevenlabs-sound-effects

Generate AI sound effects from text descriptions with ElevenLabs via inference.sh CLI. Capabilities: text-to-sound-effect, custom duration, royalty-free audio. Use for: video production, game audio, podcasts, films, presentations, social media. Triggers: sound effects, sfx, sound generation, ai sound effects, generate sound, foley, audio effects, sound design, text to sound, elevenlabs sound, eleven labs sfx, ambient sound, cinematic sound, game sound effects

🇺🇸|EnglishTranslated

AI & Machine Learningakrindev/google-studio-sk...

gemini-tts

Generate speech from text using Google Gemini TTS models via scripts/. Use for text-to-speech, audio generation, voice synthesis, multi-speaker conversations, and creating audio content. Supports multiple voices and streaming. Triggers on "text to speech", "TTS", "generate audio", "voice synthesis", "speak this text".

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningpexoai/pexo-skills

videoagent-audio-studio

Tired of juggling multiple audio APIs? This skill gives you one-command access to TTS, music generation, sound effects, and voice cloning. Use when you want to generate any audio without managing multiple API keys.

🇺🇸|EnglishTranslated

3 scripts/Attention

AI & Machine Learningchanjing-ai/chan-skills

chanjing-tts

Use Chanjing TTS API to convert text to speech

🇺🇸|EnglishTranslated

AI & Machine Learningfal-ai-community/skills

fal-video-edit

Edit existing videos using AI — remix style, edit content, upscale resolution, remove background, or add audio/sound effects. Use when the user requests "Edit video", "Remix video", "Upscale video", "Remove video background", "Add sound to video", "Video to audio".

🇺🇸|EnglishTranslated

2 scripts/Attention