Search Results: audio-processing

Found 48 Skills

Tools & Utilitiesdigitalsamba/claude-code-...

ffmpeg

Video and audio processing with FFmpeg. Use for format conversion, resizing, compression, audio extraction, and preparing assets for Remotion. Triggers include converting GIF to MP4, resizing video, extracting audio, compressing files, or any media transformation task.

🇺🇸|EnglishTranslated

AI & Machine Learningninehills/skills

video-reader

Read, watch, and listen to video/audio files. Use Gemini for native video understanding, or extract key frames + Whisper transcription as fallback. Use when a user sends a video/audio and asks about its content, what's in it, what someone said, etc.

🇺🇸|EnglishTranslated

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-asr-realtime-test

Minimal realtime ASR smoke test for Model Studio Qwen ASR Realtime.

🇺🇸|EnglishTranslated

AI & Machine Learningdeveloperpaxs/paxs-skills

paxs-api

Connect to PAXS AI platform to create meetings, upload recordings, and generate transcriptions and meeting notes. Use this skill when a user wants to transcribe audio, create meeting notes, or interact with the PAXS platform.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningrunwayml/skills

runwayml

Generate AI videos, images, and audio with Runway API. Use when generating video from images, text-to-video, video-to-video, character performance, text-to-image, text-to-speech, sound effects, or voice processing with Runway.

🇺🇸|EnglishTranslated

Tools & Utilitieswinsorllc/upgraded-carniv...

ffmpeg-tools

Production-grade FFmpeg video/audio processing. Convert, compress, trim, merge, resize, and extract audio from media files with progress tracking, comprehensive error handling, and safety limits.

🇺🇸|EnglishTranslated

4 scripts/Attention

AI & Machine Learningcinience/alicloud-skills

aliyun-cosyvoice-voice-clone

Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from reference audio and then reusing the returned voice_id in later TTS calls.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningframersai/agentos-skills

google-cloud-tts

Text-to-speech synthesis via Google Cloud Text-to-Speech API — MP3 output, configurable language and voice, voice listing.

🇺🇸|EnglishTranslated

AI & Machine Learningtrpc-group/trpc-agent-go

whisper

Transcribe audio files to text using OpenAI Whisper

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-livetranslate

Use when live speech translation is needed with Alibaba Cloud Model Studio Qwen LiveTranslate models, including bilingual meetings, realtime interpretation, and speech-to-speech or speech-to-text translation flows.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningeachlabs/skills

audio-visualization

Generate audio visualization videos using each::sense AI. Create waveforms, spectrum analyzers, particle effects, 3D visualizations, and beat-synced animations from audio files.

🇺🇸|EnglishTranslated

Tools & Utilitiesrameerez/claude-code-star...

transcribe-video

Generate subtitles (SRT/VTT) and plain text transcripts from video or audio files using AWS Transcribe. Use when creating captions, extracting spoken content, generating transcripts for notes, or making video content searchable.

🇺🇸|EnglishTranslated