Loading...
Loading...
Audio generation skill — jingles, beds, voiceover, and sound effects. Routes music requests to Suno V5 / Udio / Lyria, speech to MiniMax TTS / FishAudio / ElevenLabs V3, and SFX to ElevenLabs SFX or AudioCraft. Output is one MP3/WAV file saved to the project folder.
npx skill4agent add nexu-io/open-design audio-jingleaudioKind | Models we route to | Plan focus |
|---|---|---|
| Suno V5 (default), Udio, Lyria 2 | genre + tempo + instrumentation |
| MiniMax TTS (default), Fish, ElevenLabs V3 | script + voice + pacing |
| ElevenLabs SFX (default), AudioCraft | texture + impact + duration |
audio-jingle/
├── SKILL.md
└── example.htmlaudioKindaudioModelaudioDurationvoiceaudioKind(unknown — ask)voiceminimax-tts--voicevoice_idmale-qn-qingse--voicevoice_id--voiceaudioDurationnode "$OD_BIN" media generate \
--project "$OD_PROJECT_ID" \
--surface audio \
--audio-kind "<music|speech|sfx>" \
--model "<audioModel from metadata>" \
--duration <audioDuration seconds> \
[--voice "<provider voice id (speech only)>"] \
--output "<short-slug>-<duration>s.mp3" \
--prompt "<assembled prompt from Step 2 — for speech, the literal script>"{"file": {"name": "...", ...}}--voicevoice_idmale-qn-qingse