Search Results: speaker-diarization

Found 6 Skills

transcribe

Transcribe audio files to text with optional diarization and known-speaker hints. Use when a user asks to transcribe speech from audio/video, extract text from recordings, or label speakers in interviews or meetings.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningcat-xierluo/legal-skills

funasr-transcribe

Use local FunASR service to transcribe audio or video files into timestamped Markdown files, supporting common formats such as mp4, mov, mp3, wav, m4a, etc. This skill should be used when users need speech-to-text conversion, meeting minutes, video subtitles, or podcast transcription.

🇨🇳|ChineseTranslated

4 scripts/Attention

AI & Machine Learningframersai/agentos-skills

streaming-stt-deepgram

Real-time streaming speech-to-text via Deepgram WebSocket API — sub-300 ms latency, Nova-2 model, speaker diarization, auto-reconnect.

🇺🇸|EnglishTranslated

AI & Machine Learningagntswrm/agent-media

audio-transcribe

Transcribes audio to text with timestamps and optional speaker identification. Use when you need to convert speech to text, create subtitles, transcribe meetings, or process voice recordings.

🇺🇸|EnglishTranslated

AI & Machine Learningeachlabs/skills

subtitle-generation

Generate subtitles and captions for videos using each::sense AI. Create auto-generated subtitles, multi-language captions, animated TikTok-style text, SRT/VTT exports, speaker diarization, and burned-in subtitles.

🇺🇸|EnglishTranslated

AI & Machine Learningvanman2024/ai-dev-marketp...

stt-integration

ElevenLabs Speech-to-Text transcription workflows with Scribe v1 supporting 99 languages, speaker diarization, and Vercel AI SDK integration. Use when implementing audio transcription, building STT features, integrating speech-to-text, setting up Vercel AI SDK with ElevenLabs, or when user mentions transcription, STT, Scribe v1, audio-to-text, speaker diarization, or multi-language transcription.

🇺🇸|EnglishTranslated

5 scripts/Attention