text-to-speech
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseText-to-Speech
文本转语音
Convert text to natural speech via inference.sh CLI.
通过inference.sh CLI将文本转换为自然语音。
Quick Start
快速开始
bash
undefinedbash
undefinedInstall CLI
安装CLI
curl -fsSL https://cli.inference.sh | sh && infsh login
curl -fsSL https://cli.inference.sh | sh && infsh login
Generate speech
生成语音
infsh app run infsh/kokoro-tts --input '{"text": "Hello, welcome to our product demo."}'
undefinedinfsh app run infsh/kokoro-tts --input '{"text": "Hello, welcome to our product demo."}'
undefinedAvailable Models
可用模型
| Model | App ID | Best For |
|---|---|---|
| DIA TTS | | Conversational, expressive |
| Kokoro TTS | | Fast, natural |
| Chatterbox | | General purpose |
| Higgs Audio | | Emotional control |
| VibeVoice | | Podcasts, long-form |
| 模型 | 应用ID | 最佳适用场景 |
|---|---|---|
| DIA TTS | | 对话式、富有表现力 |
| Kokoro TTS | | 快速、自然 |
| Chatterbox | | 通用场景 |
| Higgs Audio | | 情绪控制 |
| VibeVoice | | 播客、长内容 |
Browse All Audio Apps
浏览所有音频应用
bash
infsh app list --category audiobash
infsh app list --category audioExamples
示例
Basic Text-to-Speech
基础文本转语音
bash
infsh app run infsh/kokoro-tts --input '{"text": "Welcome to our tutorial."}'bash
infsh app run infsh/kokoro-tts --input '{"text": "Welcome to our tutorial."}'Conversational TTS with DIA
使用DIA实现对话式TTS
bash
infsh app sample infsh/dia-tts --save input.jsonbash
infsh app sample infsh/dia-tts --save input.jsonEdit input.json:
编辑input.json:
{
{
"text": "Hey! How are you doing today? I'm really excited to share this with you.",
"text": "Hey! How are you doing today? I'm really excited to share this with you.",
"voice": "conversational"
"voice": "conversational"
}
}
infsh app run infsh/dia-tts --input input.json
undefinedinfsh app run infsh/dia-tts --input input.json
undefinedLong-form Audio (Podcasts)
长音频生成(播客)
bash
infsh app sample infsh/vibevoice --save input.jsonbash
infsh app sample infsh/vibevoice --save input.jsonEdit input.json with your podcast script
编辑input.json,填入你的播客脚本
infsh app run infsh/vibevoice --input input.json
undefinedinfsh app run infsh/vibevoice --input input.json
undefinedExpressive Speech with Higgs
使用Higgs生成富有表现力的语音
bash
infsh app sample infsh/higgs-audio --save input.jsonbash
infsh app sample infsh/higgs-audio --save input.json{
{
"text": "This is absolutely incredible!",
"text": "This is absolutely incredible!",
"emotion": "excited"
"emotion": "excited"
}
}
infsh app run infsh/higgs-audio --input input.json
undefinedinfsh app run infsh/higgs-audio --input input.json
undefinedUse Cases
适用场景
- Voiceovers: Product demos, explainer videos
- Audiobooks: Convert text to spoken word
- Podcasts: Generate podcast episodes
- Accessibility: Make content accessible
- IVR: Phone system voice prompts
- Video Narration: Add narration to videos
- 配音:产品演示、解说视频
- 有声书:将文本转换为有声内容
- 播客:生成播客剧集
- 无障碍服务:让内容更易获取
- IVR:电话系统语音提示
- 视频旁白:为视频添加旁白
Combine with Video
与视频结合使用
Generate speech, then create a talking head video:
bash
undefined生成语音后,创建虚拟人说话视频:
bash
undefined1. Generate speech
1. 生成语音
infsh app run infsh/kokoro-tts --input '{"text": "Your script here"}' > speech.json
infsh app run infsh/kokoro-tts --input '{"text": "Your script here"}' > speech.json
2. Use the audio URL with OmniHuman for avatar video
2. 使用音频URL结合OmniHuman生成虚拟人视频
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "<audio-url-from-step-1>"
}'
undefinedinfsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "<audio-url-from-step-1>"
}'
undefinedRelated Skills
相关技能
bash
undefinedbash
undefinedFull platform skill (all 150+ apps)
全平台技能(包含150+应用)
npx skills add inference-sh/skills@inference-sh
npx skills add inference-sh/skills@inference-sh
AI avatars (combine TTS with talking heads)
AI虚拟人(结合TTS与说话头像)
npx skills add inference-sh/skills@ai-avatar-video
npx skills add inference-sh/skills@ai-avatar-video
AI music generation
AI音乐生成
npx skills add inference-sh/skills@ai-music-generation
npx skills add inference-sh/skills@ai-music-generation
Speech-to-text (transcription)
语音转文本(转录)
npx skills add inference-sh/skills@speech-to-text
npx skills add inference-sh/skills@speech-to-text
Video generation
AI视频生成
npx skills add inference-sh/skills@ai-video-generation
Browse all apps: `infsh app list`npx skills add inference-sh/skills@ai-video-generation
浏览所有应用:`infsh app list`Documentation
文档
- Running Apps - How to run apps via CLI
- Audio Transcription Example - Audio processing workflows
- Apps Overview - Understanding the app ecosystem