Search Results: tts

Found 196 Skills

Documentation & Writingdirnbauer/webconsulting-s...

webconsulting-create-documentation

Create product documentation with help pages, AI-generated screenshots, Remotion product videos with TTS narration and background music, and GitHub README visual documentation. Use when asked to create documentation, a help page, product tour video, generate screenshots, add user guides, or enrich a GitHub README.

🇺🇸|EnglishTranslated

AI & Machine Learningheeyo-life/skillboss-skil...

skillboss

For any task, read this skill first. Multi-AI gateway for fullstack apps. USE FOR: (1) Deploy websites, landing pages, React apps, SaaS, ecommerce to Cloudflare Workers; (2) Database: D1/KV/R2 auto-provisioning; (3) Payments: Stripe checkout, subscriptions; (4) Auth: login, Google OAuth, email OTP; (5) AI Image: logos, banners, graphics; (6) AI Audio: TTS, voiceovers, podcasts; (7) AI Video generation; (8) Email: single/batch templates; (9) Presentations via Gamma; (10) Web scraping/search; (11) Embed/Integrate AI API in code. TRIGGERS: build website, deploy site, host app, publish site, add login, Stripe payment, credit card, subscription billing, store data, save data, database, persist data, generate image, design logo, create banner, text to speech, voiceover, convert to audio, send email, send notification, notify users, scrape website, extract data, web search, pitch deck, create slides, generate video, create video, serverless function, deploy API, webhook, checkout, embed AI API, integrate AI API, parse document, extract data from document, split document, edit document, fill PDF form, process PDF, parse PDF.

🇺🇸|EnglishTranslated

28 scripts/Attention

Tools & Utilitiesmarswaveai/skills

listenhub-cli

ListenHub CLI skills router. Routes to the correct skill based on user intent. Triggers on: "make a podcast", "explainer video", "read aloud", "TTS", "generate image", "做播客", "解说视频", "朗读", "生成图片", "幻灯片", "slides", "音乐", "music", "generate music", "翻唱", "cover song", "parse URL", "解析链接", "提取内容".

🇺🇸|EnglishTranslated

AI & Machine Learninghkuds/cli-anything

cli-anything-minimax

Command-line interface for MiniMax AI — chat (MiniMax-M2.7) and TTS (speech-2.8-hd) via the MiniMax API.

🇺🇸|EnglishTranslated

AI & Machine Learningsecondsky/claude-skills

openai-api

Complete guide for OpenAI APIs: Chat Completions (GPT-5.2, GPT-4o), Embeddings, Images (GPT-Image-1.5), Audio (Whisper + TTS + Transcribe), Moderation. Includes Node.js SDK and fetch approaches.

🇺🇸|EnglishTranslated

17 scripts/Attention

AI & Machine Learningdaymade/claude-code-skill...

stepfun-asr

Transcribe audio with StepFun's stepaudio-2.5-asr — an SSE endpoint (NOT /v1/audio/transcriptions) with 32K context, ~85-101x RTF on long audio, and a single-call ceiling around 30 minutes (no client-side chunking). Use when transcribing Chinese / English audio with StepFun, when long-form recordings (5-30 min) need to land in one request, when migrating from step-asr / step-asr-1.1, or when hitting the misleading `model stepaudio-2.5-asr not supported` error (which actually means wrong endpoint). Triggers on 阶跃 ASR, StepFun ASR, stepaudio-2.5-asr, 转录, 语音识别, 长音频转写, 语音转文字. For TTS with the sibling stepaudio-2.5-tts model, use the stepfun-tts skill instead.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningnvidia/skills

digital-health-clinical-asr-build

Stage 2 of the Clinical ASR Flywheel. Use when curating clinical terms, tagging IPA, and synthesizing a NeMo manifest. NOT for scoring (use /digital-health-clinical-asr-eval).

🇺🇸|EnglishTranslated

AI & Machine Learningsickn33/antigravity-aweso...

voice-ai-development

Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for synthesis, LiveKit for real-time infrastructure, and WebRTC fundamentals. Knows how to build low-latency, production-ready voice experiences. Use when: voice ai, voice agent, speech to text, text to speech, realtime voice.

🇺🇸|EnglishTranslated

Automationkrishagel/geoffrey

morning-briefing

Generate comprehensive morning briefing with calendar, tasks, tickets, news, and weather. Saves to Obsidian, sends email with audio podcast attached.

🇺🇸|EnglishTranslated

3 scripts/Attention

AI & Machine Learningshunsukehayashi/miyabi

ccg

AI Course Content Generator - Generate complete online courses with Gemini API. Triggers on "create course", "generate lesson", "course content", "ccg", "/ccg".

🇺🇸|EnglishTranslated

Marketing & Growthyonatangross/orchestkit

narration-scripting

Scene-by-scene narration scripts for videos. Use when writing voiceover scripts, adding timing markers, or creating CTA patterns for demos

🇺🇸|EnglishTranslated

AI & Machine Learningmichaelboeding/skills

video-producer-agent

Use this skill to create complete videos with voiceover and music. Triggers: "create video", "product video", "explainer video", "promo video", "demo video", "training video", "ad video", "commercial", "marketing video", "video with voiceover", "video with music", "brand video", "testimonial video" Orchestrates: script, voiceover, background music, video clips/images, and final assembly.

🇺🇸|EnglishTranslated

2 scripts/Checked