Search Results: dio

Found 1,612 Skills

AI & Machine Learningnexu-io/open-design

venice-audio-music

Music generation queueing, retrieval, and completion endpoints via Venice.ai. Suited for jingles, background loops, and prototype scoring.

🇺🇸|EnglishTranslated

AI & Machine Learningjianshuo/claude-skills

wjs-transcribing-audio

Use when the user has audio or video and wants a timestamped transcript (SRT) in the source language. Routes by source language — Chinese defaults to Volcano (豆包) ASR; other languages (Spanish, English, Portuguese, French, Italian, Japanese, Korean, etc.) use OpenAI Whisper API with word-level timestamps and self-assembled cues. Outputs SRT with punctuation-bounded cues capped for on-screen reading. Triggers — "转写", "转成字幕", "做 SRT", "transcribe", "make subtitles", "speech to text", "出字幕".

🇺🇸|EnglishTranslated

AI & Machine Learningpostplusai/postplus-skill...

audio-generation

Control audio generation requests before execution. Use this when the user asks for TTS, persona voice, voice change, translated dub, cloned voice take, podcast audio, or lip-sync audio handoff and the skill must classify the request before handing execution to voice-batch-runner or a video workflow.

🇺🇸|EnglishTranslated

AI & Machine Learningframersai/agentos-skills

audio-generation

Music and sound effects generation — 8 providers with fallback chains, user-configurable preferences, local and cloud options.

🇺🇸|EnglishTranslated

Testing & QAsupabase/supabase

e2e-studio-tests

Run e2e tests in the Studio app. Use when asked to run e2e tests, run studio tests, playwright tests, or test the feature.

🇺🇸|EnglishTranslated

Frontend Developmentb-open-io/prompts

ui-audio-theme

Generate cohesive UI audio themes with subtle, minimal sound effects for applications. This skill should be used when users want to create a set of coordinated interface sounds for wallet apps, dashboards, or web applications - generating sounds mapped to UI interaction constants like button clicks, notifications, and navigation transitions using ElevenLabs API.

🇺🇸|EnglishTranslated

1 scripts/Checked

Tools & Utilitiesbitwize-music-studio/clau...

sheet-music-publisher

Converts mastered audio to sheet music and creates printable songbooks. Use after mastering when the user wants sheet music or a songbook for their album.

🇺🇸|EnglishTranslated

AI & Machine Learningoimiragieo/agent-studio

token-saver-context-compression

Search-aware context compression workflow for agent-studio. Use pnpm hybrid search + token-saver compression, then persist distilled learnings via MemoryRecord.

🇺🇸|EnglishTranslated

13 scripts/Attention

Testing & QAcinience/alicloud-skills

alicloud-ai-audio-tts-voice-design-test

Minimal voice design TTS smoke test for Model Studio Qwen TTS VD.

🇺🇸|EnglishTranslated

Data Processingdkyazzentwatwa/chatgpt-sk...

audio-normalizer

Use when asked to normalize audio volume, match loudness, or apply peak/RMS normalization to audio files.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-cosyvoice-voice-design

Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from a voice prompt plus preview text before using the returned voice_id in TTS.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningmaxgent-ai/maxgent-plugin

audio-transcribe

Speech-to-text transcription using Whisper with word-level timestamps. Use when users ask to transcribe audio or video to text, generate subtitles, or recognize speech.

🇺🇸|EnglishTranslated

1 scripts/Checked