Search Results: audio-processing

Found 24 Skills

ffmpeg-patterns

FFmpeg video and audio processing patterns. Use when transcoding video/audio, extracting clips, adding filters, merging media, creating thumbnails, or batch processing media files.

🇺🇸|EnglishTranslated

AI & Machine Learningtrpc-group/trpc-agent-go

whisper

Transcribe audio files to text using OpenAI Whisper

🇺🇸|EnglishTranslated

1 scripts/Checked

Tools & Utilitieswlzh/skills

text-to-speech

Text-to-Speech Tool - Supports script parsing, emotion tagging, and post-processing, based on Edge TTS

🇨🇳|ChineseTranslated

1 scripts/Checked

Tools & Utilitieswinsorllc/upgraded-carniv...

ffmpeg-tools

Production-grade FFmpeg video/audio processing. Convert, compress, trim, merge, resize, and extract audio from media files with progress tracking, comprehensive error handling, and safety limits.

🇺🇸|EnglishTranslated

4 scripts/Attention

Product & Designbenzema216/dreamina-claud...

music-to-storyboard

Generate storyboard from music analysis — shot-by-shot with camera movements

🇺🇸|EnglishTranslated

AI & Machine Learningninehills/skills

video-reader

Read, watch, and listen to video/audio files. Use Gemini for native video understanding, or extract key frames + Whisper transcription as fallback. Use when a user sends a video/audio and asks about its content, what's in it, what someone said, etc.

🇺🇸|EnglishTranslated

AI & Machine Learningdeveloperpaxs/paxs-skills

paxs-api

Connect to PAXS AI platform to create meetings, upload recordings, and generate transcriptions and meeting notes. Use this skill when a user wants to transcribe audio, create meeting notes, or interact with the PAXS platform.

🇺🇸|EnglishTranslated

1 scripts/Checked

Tools & Utilitiesrameerez/claude-code-star...

transcribe-video

Generate subtitles (SRT/VTT) and plain text transcripts from video or audio files using AWS Transcribe. Use when creating captions, extracting spoken content, generating transcripts for notes, or making video content searchable.

🇺🇸|EnglishTranslated

AI & Machine Learningrunwayml/skills

runwayml

Generate AI videos, images, and audio with Runway API. Use when generating video from images, text-to-video, video-to-video, character performance, text-to-image, text-to-speech, sound effects, or voice processing with Runway.

🇺🇸|EnglishTranslated

AI & Machine Learningzainhas/togetherai-skills

together-audio

Text-to-speech (TTS) and speech-to-text (STT) via Together AI. TTS models include Orpheus, Kokoro, Cartesia Sonic, Rime, MiniMax with REST, streaming, and WebSocket support. STT models include Whisper and Voxtral. Use when users need voice synthesis, audio generation, speech recognition, transcription, TTS, STT, or real-time voice applications.

🇺🇸|EnglishTranslated

2 scripts/Checked

AI & Machine Learningchanjing-ai/chan-skills

chanjing-tts-voice-clone

Use Chanjing TTS API to synthesize speech from text, using user-provided voice

🇺🇸|EnglishTranslated

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-livetranslate

Use when live speech translation is needed with Alibaba Cloud Model Studio Qwen LiveTranslate models, including bilingual meetings, realtime interpretation, and speech-to-speech or speech-to-text translation flows.

🇺🇸|EnglishTranslated

1 scripts/Checked