Search Results: speech-to-text

Found 70 Skills

telnyx-voice-streaming-javascript

Stream call audio in real-time, fork media to external destinations, and transcribe speech live. Use for real-time analytics and AI integrations. This skill provides JavaScript SDK examples.

🇺🇸|EnglishTranslated

Tools & Utilitiesyzfly/douyin-mcp-server

douyin-video

Watermark-free Douyin video download and transcript extraction tool. Retrieve watermark-free video download links from Douyin share links, download videos, extract voice transcripts from videos and automatically save them to files. Applicable scenarios include obtaining Douyin video information, downloading watermark-free videos, and batch extracting video transcripts. Triggered when users need to process Douyin video links or extract video content.

🇨🇳|ChineseTranslated

1 scripts/Attention

AI & Machine Learningagentiveau/myagentive

deepgram-transcription

Transcribe audio and video files using the Deepgram API. This skill should be used when the user requests transcription of audio files (mp3, wav, m4a, aac) or video files (mp4, mov, avi, etc.). Handles large video files by extracting audio first to reduce upload size and processing time.

🇺🇸|EnglishTranslated

1 scripts/Checked

Frontend Developmentsyncfusion/react-ui-compo...

syncfusion-react-speech-to-text

Implement the Syncfusion React SpeechToText component. Use this skill to convert speech to text, manage microphone input, control listening states, process speech events, customize UI, support accessible voice-enabled forms, and handle globalization and security in React applications.

🇺🇸|EnglishTranslated

Backend Developmentsyncfusion/aspnetcore-ui-...

syncfusion-aspnetcore-speech-to-text

Implement the Syncfusion ASP.NET Core SpeechToText control for converting spoken words to text using Web Speech API. Use this skill when implementing speech recognition with Razor Tag Helpers, converting voice to text in ASP.NET Core applications, handling microphone input, processing speech events, customizing button appearance, managing listening states, or building accessible voice-enabled forms. Covers setup, speech recognition features, Razor Tag Helper syntax, events, methods, globalization, and security.

🇺🇸|EnglishTranslated

AI & Machine Learningcnemri/google-genai-skill...

speech-use

Generate (TTS), Transcribe (STT), and Clone voices using Google's GenAI and Cloud Speech SDKs. Supports Gemini-TTS, Chirp 3, and Instant Custom Voice.

🇺🇸|EnglishTranslated

3 scripts/Checked

AI & Machine Learningteam-telnyx/skills

telnyx-ai-inference-python

Access Telnyx LLM inference APIs, embeddings, and AI analytics for call insights and summaries. This skill provides Python SDK examples.

🇺🇸|EnglishTranslated

AI & Machine Learningdeepgram/skills

examples

Find working Deepgram integration examples with third-party platforms and frameworks. Use whenever someone wants to integrate Deepgram with Twilio, LiveKit, LangChain, Vercel AI SDK, Discord, Vonage, Pipecat, Expo, FastAPI, Cloudflare Workers, Slack, Telegram, LlamaIndex, Zoom, Next.js, Nuxt, Django, SvelteKit, NestJS, Spring Boot, CrewAI, Riverside, SignalWire, and more. Examples are full runnable integration demos, not minimal feature snippets.

🇺🇸|EnglishTranslated

AI & Machine Learningmembranedev/application-s...

assemblyai

AssemblyAI integration. Manage Transcripts, Speakers, Jobs. Use when the user wants to interact with AssemblyAI data.

🇺🇸|EnglishTranslated

AI & Machine Learningteam-telnyx/skills

telnyx-ai-inference-curl

Access Telnyx LLM inference APIs, embeddings, and AI analytics for call insights and summaries. This skill provides REST API (curl) examples.

🇺🇸|EnglishTranslated

AI & Machine Learningpostplusai/postplus-skill...

video-transcription

Transcribe video files directly into timed transcripts and subtitle-ready artifacts using hosted Whisper video-to-text. Use this when the input is a video and the goal is speech extraction, caption generation, or edit-prep timing.

🇺🇸|EnglishTranslated

12 scripts/Attention

AI & Machine Learningdeepgram/skills

recipes

Find focused, runnable Deepgram recipes for a specific feature × language. Use whenever someone wants a minimal working code snippet for ONE feature (transcribe URL, diarize, smart-format, voice agent connect, etc.) rather than a full starter app. Recipes are under 50 lines, read DEEPGRAM_API_KEY from env, and ship with a runnable example_test. Covers Python, JavaScript, Go, .NET, Java, Rust, and the Deepgram CLI.

🇺🇸|EnglishTranslated