Loading...
Loading...
Found 6 Skills
Voice cloning workflows with Alibaba Cloud Model Studio Qwen TTS VC models. Use when creating cloned voices from sample audio and synthesizing text with cloned timbre.
Generate a SoundDefinition for @web-kits/audio from any input - a natural-language prompt, an audio file the user shares, or both. Use when the user says "create a sound", "/create-sound", "design a sound for X", shares a WAV/MP3/sprite, or asks to reverse-engineer a sample. Optionally renders a WAV preview and round-trip-validates the result.
Turn an article or script into a click-driven 16:9 web presentation that "looks like a video", with optional voiceover audio synthesis. Workflow: Original Article → **One-time Output** Script + Outline Development Plan → User **One-time Alignment** on 5 Items (Script / Outline / Theme / Assets / Development Mode) → Web Development (Chapter-by-Chapter / Sequential / Parallel) → Optional Audio Synthesis (Default: MiniMax CLI mmx-cli). **Outline only plans rhythm and information density, not animations** — Animations are designed on the fly during chapter development following the PRINCIPLES + ANTI-AI rules. Each click advances one beat of the script, each step occupies the full screen, and the progress bar is hidden by default only appearing on hover. Application Scenarios: Use web pages to make videos (dynamic PPT but not like PPT), turn scripts/articles into interactive explanations, create screen recording tutorials for Bilibili / YouTube / Video Channels, make cinematic product/talk demos. This Skill embodies design methodology + collaboration process — it is not bound to any specific styles/fonts/colors — so it can be reused for any theme and aesthetic.
Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".
Game audio engineer using Strudel.cc for background music and Web Audio API for sound effects in browser games. Use when adding music or SFX to a game.
Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from a voice prompt plus preview text before using the returned voice_id in TTS.