Search Results: ai-audio

Found 10 Skills

music

Generate music using ElevenLabs Music API. Use when creating instrumental tracks, songs with lyrics, background music, jingles, or any AI-generated music composition. Supports prompt-based generation, composition plans for granular control, and detailed output with metadata.

🇺🇸|EnglishTranslated

AI & Machine Learningaffaan-m/everything-claud...

fal-ai-media

Unified media generation via fal.ai MCP — image, video, and audio. Covers text-to-image (Nano Banana), text/image-to-video (Seedance, Kling, Veo 3), text-to-speech (CSM-1B), and video-to-audio (ThinkSound). Use when the user wants to generate images, videos, or audio with AI.

🇺🇸|EnglishTranslated

AI & Machine Learningqianwen-ai/qianwen-ai

qianwen-audio-tts

[QianWen] Synthesize speech from text with Qwen TTS models. TRIGGER when: user wants to convert text to speech, create voiceovers, generate audio narration, read text aloud, build TTS applications, mentions speech synthesis/voice generation/audio output from text, or explicitly invokes this skill by name (e.g. use qianwen-audio-tts). DO NOT TRIGGER when: user wants speech recognition/ASR, text generation without audio, non-Qwen audio tasks.

🇺🇸|EnglishTranslated

4 scripts/Checked

AI & Machine Learningveniceai/skills

venice-audio-music

Async music / audio-track generation via Venice. Covers the /audio/quote + /audio/queue + /audio/retrieve + /audio/complete lifecycle, lyrics vs instrumental, voice selection, duration, language, speed, model capability probing, and webhook-free polling.

🇺🇸|EnglishTranslated

AI & Machine Learningaradotso/trending-skills

voicebox-voice-synthesis

Expert skill for Voicebox — the open-source local voice cloning and TTS studio built with Tauri, React, and FastAPI

🇺🇸|EnglishTranslated

AI & Machine Learningminimax-ai/skills

minimax-music-gen

Use when user wants to generate music, songs, or audio tracks. Triggers on phrases like "generate a song", "make music", "create a track", "写首歌", "生成音乐", "来一首歌", "帮我做首歌", "纯音乐", "cover", "唱一首", or any request involving music creation, song writing, lyrics generation, or audio production. Also triggers when user provides lyrics and wants them turned into a song, or describes a mood/scene and wants background music. Even casual requests like "给我来点音乐" or "I want a chill beat" should trigger this skill. Do NOT use for music playback of existing files, music theory questions, or music recommendation without generation.

🇺🇸|EnglishTranslated

6 scripts/Attention

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-asr-realtime-test

Minimal realtime ASR smoke test for Model Studio Qwen ASR Realtime.

🇺🇸|EnglishTranslated

AI & Machine Learninggooglecloudplatform/verte...

genmedia-producer

Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or complex multi-step media workflows using the GenMedia MCP servers (Veo, Lyria, Gemini TTS, NanoBanana).

🇺🇸|EnglishTranslated

AI & Machine Learningframersai/agentos-skills

audio-generation

Music and sound effects generation — 8 providers with fallback chains, user-configurable preferences, local and cloud options.

🇺🇸|EnglishTranslated

AI & Machine Learningnexu-io/open-design

audio-jingle

Audio generation skill — jingles, beds, voiceover, and sound effects. Routes music requests to Suno V5 / Udio / Lyria, speech to MiniMax TTS / FishAudio / ElevenLabs V3, and SFX to ElevenLabs SFX or AudioCraft. Output is one MP3/WAV file saved to the project folder.

🇺🇸|EnglishTranslated