Search Results: transcription

Found 103 Skills

AI & Machine Learninginfquest/vibe-ops-plugin

audio-transcribe

Convert audio/video to text using Whisper, with support for word-level timestamps. Use this when users need speech-to-text conversion, audio-to-text transcription, video-to-text extraction, subtitle generation, transcribe audio, speech to text, generate subtitles, or speech recognition.

🇨🇳|ChineseTranslated

1 scripts/Checked

AI & Machine Learningeachlabs/skills

eachlabs-voice-audio

Text-to-speech, speech-to-text, voice conversion, and audio processing using EachLabs AI models. Supports ElevenLabs TTS, Whisper transcription with diarization, and RVC voice conversion. Use when the user needs TTS, transcription, or voice conversion.

🇺🇸|EnglishTranslated

AI & Machine Learningsteipete/clawdis

openai-whisper-api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningassemblyai/assemblyai-ski...

assemblyai

Use when implementing speech-to-text, audio transcription, real-time streaming STT, audio intelligence features, or voice AI using AssemblyAI APIs or SDKs. Use when user mentions AssemblyAI, voice agents, transcription, speaker diarization, PII redaction of audio, LLM Gateway for audio understanding, or applying LLMs to transcripts. Also use when building voice agents with LiveKit or Pipecat that need speech-to-text, or when the user is working with any audio/video processing pipeline that could benefit from transcription, even if they don't mention AssemblyAI by name.

🇺🇸|EnglishTranslated

AI & Machine Learningsickn33/antigravity-aweso...

voice-ai-engine-development

Build real-time conversational AI voice engines using async worker pipelines, streaming transcription, LLM agents, and TTS synthesis with interrupt handling and multi-provider support

🇺🇸|EnglishTranslated

5 scripts/Checked

AI & Machine Learningdaymade/claude-code-skill...

transcript-fixer

Corrects speech-to-text transcription errors in meeting notes, lectures, and interviews using dictionary rules and AI. Learns patterns to build personalized correction databases. Use when working with transcripts containing ASR/STT errors, homophones, or Chinese/English mixed content requiring cleanup.

🇺🇸|EnglishTranslated

51 scripts/Checked

Automationceeon/videocut-skills

videocut:字幕

Subtitle Generation and Burning. Volcengine Transcription → Dictionary Error Correction → Review → Burning. Trigger words: add subtitles, generate subtitles, subtitles

🇨🇳|ChineseTranslated

1 scripts/Attention

AI & Machine Learningagentiveau/myagentive

deepgram-transcription

Transcribe audio and video files using the Deepgram API. This skill should be used when the user requests transcription of audio files (mp3, wav, m4a, aac) or video files (mp4, mov, avi, etc.). Handles large video files by extracting audio first to reduce upload size and processing time.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningvm0-ai/vm0-skills

openai

OpenAI API via curl. Use this skill for GPT chat completions, DALL-E image generation, Whisper audio transcription, embeddings, and text-to-speech.

🇺🇸|EnglishTranslated

AI & Machine Learningcat-xierluo/legal-skills

funasr-transcribe

Use local FunASR service to transcribe audio or video files into timestamped Markdown files, supporting common formats such as mp4, mov, mp3, wav, m4a, etc. This skill should be used when users need speech-to-text conversion, meeting minutes, video subtitles, or podcast transcription.

🇨🇳|ChineseTranslated

4 scripts/Attention

Tools & Utilitiesheygen-com/skills

video-understand

Understand video content locally using ffmpeg frame extraction and Whisper transcription. No API keys needed. Use when: (1) Understanding what a video contains, (2) Transcribing video audio locally, (3) Extracting key frames for visual analysis, (4) Getting video content without API keys.

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningelevenlabs/skills

voice-isolator

Remove background noise and isolate vocals/speech from audio using ElevenLabs Voice Isolator (audio isolation) API. Use when cleaning up noisy recordings, removing music or background ambience from dialogue, isolating speech from field recordings, preparing audio for transcription, extracting vocals, or any "denoise / clean up / isolate voice" task.

🇺🇸|EnglishTranslated