Loading...
Loading...
Found 5 Skills
Convert audio/video to text using Whisper, with support for word-level timestamps. Use this when users need speech-to-text conversion, audio-to-text transcription, video-to-text extraction, subtitle generation, transcribe audio, speech to text, generate subtitles, or speech recognition.
Transcribe video files directly into timed transcripts and subtitle-ready artifacts using hosted Whisper video-to-text. Use this when the input is a video and the goal is speech extraction, caption generation, or edit-prep timing.
Extract, transcribe, and translate YouTube video transcripts using the YouTubeTranscript.dev V2 API. Supports captions, ASR audio transcription, batch processing (up to 100 videos), translation to 100+ languages, and multiple output formats. Use when working with YouTube videos, subtitles, captions, or video-to-text conversion.
Use when "youtube transcript", "extract subtitles", "video captions", "get transcript", "video to text"
Extract subtitles from YouTube video links and convert them into Chinese transcripts. Supports both auto-generated and manual subtitles. Usage scenarios: (1) Users provide a YouTube link and request subtitle/transcript extraction, (2) Users request to convert YouTube video content into text, (3) Users say "Help me convert this YouTube video to text" or similar requests. Trigger words: YouTube subtitles, video to text, extract subtitles, YouTube transcript, video text draft.