Loading...
Loading...
Found 934 Skills
Text-to-Speech Tool - Supports script parsing, emotion tagging, and post-processing, based on Edge TTS
Generate images with Google's Nano Banana Pro (Gemini 3 Pro Image). Use when generating AI images via Gemini API, creating professional visuals, or building image generation features. Triggers on Nano Banana Pro, Gemini 3 Pro Image, gemini-3-pro-image-preview, Google image generation.
Control Sonos speakers (discover/status/play/volume/group).
Plan, produce, and market a podcast. Use when the user says "podcast strategy", "start a podcast", "podcast marketing", "podcast growth", "podcast SEO", "show notes", "podcast monetization", "guest outreach", or asks about launching, growing, or promoting a podcast.
Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.
ElevenLabs text-to-speech with mac-style say UX.
Go programming expert for goroutines, channels, interfaces, modules, and concurrency patterns
Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.
Use Chanjing TTS API to synthesize speech from text, using user-provided voice
Use Chanjing TTS API to convert text to speech
Create podcasts from topics, URLs, or text. Triggers on: "做播客", "podcast", "播客", "录一期节目", "chat about", "discuss", "debate", "dialogue", "make a podcast about".
Text-to-speech and voice narration. Triggers on: "朗读这段", "配音", "TTS", "语音合成", "text to speech", "read this aloud", "convert to speech", "voice narration", "read aloud".