Loading...
Loading...
Found 87 Skills
Use when the user has audio or video and wants a timestamped transcript (SRT) in the source language. Routes by source language — Chinese defaults to Volcano (豆包) ASR; other languages (Spanish, English, Portuguese, French, Italian, Japanese, Korean, etc.) use OpenAI Whisper API with word-level timestamps and self-assembled cues. Outputs SRT with punctuation-bounded cues capped for on-screen reading. Triggers — "转写", "转成字幕", "做 SRT", "transcribe", "make subtitles", "speech to text", "出字幕".
Extract watermark-free Douyin/TikTok videos and transcribe audio content using AI speech recognition
Transcribe audio files using Qwen ASR. Use when the user sends voice messages and wants them converted to text.