Loading...
Loading...
Universal AI voice / text-to-speech skill supporting OpenAI TTS (gpt-4o-mini-tts, tts-1), ElevenLabs multilingual TTS with voice cloning, Bailian Qwen TTS (qwen-tts / qwen3-tts-vd with voice-design custom voices, long-text chunking built in), MiniMax speech-02-hd, SiliconFlow CosyVoice / SenseVoice, and PlayHT 2.0. Use this skill whenever the user asks to read text aloud, synthesize speech, generate narration, create voice-over, dub a script, or turn any text into audio (mp3 / wav / ogg / flac). Typical phrases include "read this aloud", "generate voice for ...", "create a narration of ...", "tts this", "把这段念出来", "做个配音", "合成语音", or mentions of voices / TTS model names like Alloy, Ash, Cherry, Rachel, CosyVoice, PlayHT. Always use this skill even if the user does not specify a provider — pick one from EXTEND.md defaults or available env keys.
npx skill4agent add iamzhihuix/happy-claude-skills happy-audio-gen# Shortest path — OpenAI default voice
bun scripts/main.ts --text "Hello, world" --out ./hello.mp3
# Chinese, MiniMax
bun scripts/main.ts --provider minimax --text "大家好" --voice male-qn-qingse --out ./hello.mp3
# Long-form, Bailian (auto-splits by sentence)
bun scripts/main.ts --provider bailian --textfiles ./script.md --out ./narration.mp3./.happy-skills/happy-audio-gen/EXTEND.md$XDG_CONFIG_HOME/happy-skills/happy-audio-gen/EXTEND.md~/.happy-skills/happy-audio-gen/EXTEND.mdbun scripts/main.ts --setupreferences/config/first-time-setup.mdnpx -y bun--provider <id>default_provideropenai > elevenlabs > bailian > minimax > siliconflow > playhtopenaielevenlabsbailianminimaxbailiansiliconflowplayht--text--textfiles--out <path>.mp3.wav.ogg.flac--voice <id>references/voices.md--rate 0.5..2.0--instruction "..."openaisiliconflow--language <code>enzhjabun scripts/main.ts \
--provider openai \
--model gpt-4o-mini-tts \
--voice alloy \
--text "..." \
--out ./out.mp3{ "success": true, "provider": "openai", "model": "gpt-4o-mini-tts", "voice": "alloy", "output": "/abs/out.mp3", "size_bytes": 76032, "format": "mp3" }happy-audio-gen[openai] OpenAI TTS 400invalid voicealloyashcoralechofableonyxnovasageshimmer[minimax] ... 2049 invalid api keyMINIMAX_BASE_URL=https://api.minimaxi.com/v1[bailian] ... 400 DataInspectionFailed[elevenlabs] 401references/providers.mdreferences/voices.mdreferences/error_codes.mdreferences/config/first-time-setup.mdreferences/config/extend-schema.mdassets/EXTEND.template.md