Search Results: speech-to-text

Found 59 Skills

AI & Machine Learningcnemri/google-genai-skill...

speech-use

Generate (TTS), Transcribe (STT), and Clone voices using Google's GenAI and Cloud Speech SDKs. Supports Gemini-TTS, Chirp 3, and Instant Custom Voice.

🇺🇸|EnglishTranslated

3 scripts/Checked

AI & Machine Learningdokhacgiakhoa/antigravity...

voice-ai-engine-development

Architecting real-time Voice AI agents.

🇺🇸|EnglishTranslated

AI & Machine Learning958877748/skills

groq-stt

Transcribe audio files using Groq API (Whisper models). Use when user needs to transcribe audio to text.

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningteam-telnyx/skills

telnyx-ai-inference-python

Access Telnyx LLM inference APIs, embeddings, and AI analytics for call insights and summaries. This skill provides Python SDK examples.

🇺🇸|EnglishTranslated

AI & Machine Learningpostplusai/postplus-skill...

video-transcription

Transcribe video files directly into timed transcripts and subtitle-ready artifacts using hosted Whisper video-to-text. Use this when the input is a video and the goal is speech extraction, caption generation, or edit-prep timing.

🇺🇸|EnglishTranslated

12 scripts/Attention

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-asr

Transcribe non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when converting recorded audio files to text, generating transcripts with timestamps, or documenting DashScope/OpenAI-compatible ASR request and response fields.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningaahl/skills

qwen-asr

Transcribe audio files using Qwen ASR. Use when the user sends voice messages and wants them converted to text.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningteam-telnyx/skills

telnyx-ai-inference-curl

Access Telnyx LLM inference APIs, embeddings, and AI analytics for call insights and summaries. This skill provides REST API (curl) examples.

🇺🇸|EnglishTranslated

AI & Machine Learningjeremylongshore/claude-co...

deepgram-install-auth

Install and configure Deepgram SDK/CLI authentication. Use when setting up a new Deepgram integration, configuring API keys, or initializing Deepgram in your project. Trigger with phrases like "install deepgram", "setup deepgram", "deepgram auth", "configure deepgram API key".

🇺🇸|EnglishTranslated

Tools & Utilitieshyperpuncher/dotagents

chough

Fast ASR CLI tool for transcribing audio/video files. Use when user wants to transcribe audio/video, generate subtitles (VTT), convert speech to text with timestamps (JSON), or optimize transcription for low memory.

🇺🇸|EnglishTranslated

AI & Machine Learningmembranedev/application-s...

assemblyai

AssemblyAI integration. Manage Transcripts, Speakers, Jobs. Use when the user wants to interact with AssemblyAI data.

🇺🇸|EnglishTranslated