elevenlabs-tts

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

ElevenLabs Text-to-Speech

ElevenLabs 文本转语音

Premium text-to-speech with 22+ voices via inference.sh CLI.
ElevenLabs TTS
通过inference.sh CLI使用拥有22+种语音的优质文本转语音服务。
ElevenLabs TTS

Quick Start

快速开始

Requires inference.sh CLI (
infsh
). Install instructions
bash
infsh login
需要inference.sh CLI(
infsh
)。安装说明
bash
infsh login

Generate speech with ElevenLabs

使用ElevenLabs生成语音

infsh app run elevenlabs/tts --input '{"text": "Hello, welcome to our product demo.", "voice": "aria"}'
undefined
infsh app run elevenlabs/tts --input '{"text": "Hello, welcome to our product demo.", "voice": "aria"}'
undefined

Available Models

可用模型

ModelIDBest ForLatency
Multilingual v2
eleven_multilingual_v2
Highest quality, 32 languages~250ms
Turbo v2.5
eleven_turbo_v2_5
Balance of speed & quality~150ms
Flash v2.5
eleven_flash_v2_5
Ultra-low latency~75ms
模型ID最适用场景延迟
多语言v2
eleven_multilingual_v2
最高音质,支持32种语言~250ms
Turbo v2.5
eleven_turbo_v2_5
速度与音质平衡~150ms
Flash v2.5
eleven_flash_v2_5
超低延迟~75ms

Voice Library

语音库

Female Voices

女性语音

VoiceStyle
aria
American, conversational
alice
British, confident
bella
American, warm
jessica
American, expressive
laura
American, professional
lily
British, soft
sarah
American, friendly
语音风格
aria
美式,口语化
alice
英式,自信沉稳
bella
美式,温暖亲和
jessica
美式,富有表现力
laura
美式,专业正式
lily
英式,轻柔温和
sarah
美式,友好亲切

Male Voices

男性语音

VoiceStyle
george
British, authoritative
adam
American, deep
bill
American, mature
brian
American, conversational
callum
Transatlantic, intense
charlie
Australian, natural
chris
American, casual
daniel
British, commanding
eric
American, friendly
harry
American, young
liam
American, articulate
matilda
American, warm
river
American, confident
roger
American, authoritative
will
American, bright
语音风格
george
英式,权威正式
adam
美式,低沉浑厚
bill
美式,成熟稳重
brian
美式,口语化
callum
大西洋两岸口音,富有张力
charlie
澳式,自然随性
chris
美式,休闲轻松
daniel
英式,有号召力
eric
美式,友好亲切
harry
美式,年轻活力
liam
美式,清晰流畅
matilda
美式,温暖亲和
river
美式,自信沉稳
roger
美式,权威正式
will
美式,明亮轻快

Examples

示例

Basic Speech

基础语音生成

bash
infsh app run elevenlabs/tts --input '{"text": "Welcome to our quarterly earnings presentation.", "voice": "george"}'
bash
infsh app run elevenlabs/tts --input '{"text": "Welcome to our quarterly earnings presentation.", "voice": "george"}'

Choose a Model

选择模型

bash
undefined
bash
undefined

Highest quality

最高音质

infsh app run elevenlabs/tts --input '{ "text": "This is our premium multilingual model with the best quality.", "voice": "aria", "model": "eleven_multilingual_v2" }'
infsh app run elevenlabs/tts --input '{ "text": "This is our premium multilingual model with the best quality.", "voice": "aria", "model": "eleven_multilingual_v2" }'

Ultra-fast for real-time applications

适用于实时应用的超快速模型

infsh app run elevenlabs/tts --input '{ "text": "Flash model for low-latency applications.", "voice": "brian", "model": "eleven_flash_v2_5" }'
undefined
infsh app run elevenlabs/tts --input '{ "text": "Flash model for low-latency applications.", "voice": "brian", "model": "eleven_flash_v2_5" }'
undefined

Voice Tuning

语音调优

bash
infsh app run elevenlabs/tts --input '{
  "text": "Fine-tune the voice characteristics for your use case.",
  "voice": "bella",
  "stability": 0.3,
  "similarity_boost": 0.9,
  "style": 0.4
}'
ParameterRangeEffect
stability
0-1Higher = more consistent, lower = more expressive
similarity_boost
0-1Higher = closer to original voice character
style
0-1Higher = more style exaggeration
use_speaker_boost
true/falseEnhances speaker clarity
bash
infsh app run elevenlabs/tts --input '{
  "text": "Fine-tune the voice characteristics for your use case.",
  "voice": "bella",
  "stability": 0.3,
  "similarity_boost": 0.9,
  "style": 0.4
}'
参数范围效果
stability
0-1值越高,语音越稳定;值越低,表现力越强
similarity_boost
0-1值越高,越贴近原始语音特征
style
0-1值越高,风格夸张度越强
use_speaker_boost
true/false提升说话人清晰度

Output Formats

输出格式

bash
undefined
bash
undefined

High-quality MP3

高品质MP3

infsh app run elevenlabs/tts --input '{ "text": "High quality audio output.", "voice": "daniel", "output_format": "mp3_44100_192" }'

| Format | Description |
|--------|-------------|
| `mp3_44100_128` | MP3 at 44.1kHz, 128kbps (default) |
| `mp3_44100_192` | MP3 at 44.1kHz, 192kbps |
| `pcm_16000` | Raw PCM at 16kHz |
| `pcm_22050` | Raw PCM at 22.05kHz |
| `pcm_24000` | Raw PCM at 24kHz |
| `pcm_44100` | Raw PCM at 44.1kHz |
infsh app run elevenlabs/tts --input '{ "text": "High quality audio output.", "voice": "daniel", "output_format": "mp3_44100_192" }'

| 格式 | 说明 |
|--------|-------------|
| `mp3_44100_128` | MP3格式,44.1kHz,128kbps(默认) |
| `mp3_44100_192` | MP3格式,44.1kHz,192kbps |
| `pcm_16000` | 原始PCM格式,16kHz |
| `pcm_22050` | 原始PCM格式,22.05kHz |
| `pcm_24000` | 原始PCM格式,24kHz |
| `pcm_44100` | 原始PCM格式,44.1kHz |

Multilingual

多语言支持

ElevenLabs supports 32 languages including English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, Hindi, Russian, and more.
bash
undefined
ElevenLabs支持32种语言,包括英语、西班牙语、法语、德语、意大利语、葡萄牙语、中文、日语、韩语、阿拉伯语、印地语、俄语等。
bash
undefined

Spanish

西班牙语

infsh app run elevenlabs/tts --input '{ "text": "Hola, bienvenidos a nuestra presentación.", "voice": "aria", "model": "eleven_multilingual_v2" }'
infsh app run elevenlabs/tts --input '{ "text": "Hola, bienvenidos a nuestra presentación.", "voice": "aria", "model": "eleven_multilingual_v2" }'

French

法语

infsh app run elevenlabs/tts --input '{ "text": "Bonjour, bienvenue à notre démonstration.", "voice": "alice", "model": "eleven_multilingual_v2" }'
undefined
infsh app run elevenlabs/tts --input '{ "text": "Bonjour, bienvenue à notre démonstration.", "voice": "alice", "model": "eleven_multilingual_v2" }'
undefined

Voice + Video Workflow

语音+视频工作流

bash
undefined
bash
undefined

1. Generate voiceover

1. 生成旁白配音

infsh app run elevenlabs/tts --input '{ "text": "Introducing the future of AI-powered content creation.", "voice": "george" }' > voiceover.json
infsh app run elevenlabs/tts --input '{ "text": "Introducing the future of AI-powered content creation.", "voice": "george" }' > voiceover.json

2. Create talking head video

2. 创建虚拟人视频

infsh app run bytedance/omnihuman-1-5 --input '{ "image_url": "https://portrait.jpg", "audio_url": "<audio-url-from-step-1>" }'
undefined
infsh app run bytedance/omnihuman-1-5 --input '{ "image_url": "https://portrait.jpg", "audio_url": "<audio-url-from-step-1>" }'
undefined

Use Cases

适用场景

  • Voiceovers: Product demos, explainer videos, commercials
  • Audiobooks: Long-form narration with consistent voices
  • Podcasts: AI hosts with natural delivery
  • E-learning: Course narration in multiple languages
  • Accessibility: High-quality screen reader content
  • IVR: Professional phone system messages
  • Video Narration: Documentary and social media content
  • 旁白配音: 产品演示、解说视频、商业广告
  • 有声书: 长篇内容旁白,语音风格一致
  • 播客: 自然流畅的AI主播
  • 在线教育: 多语言课程旁白
  • 无障碍服务: 高品质屏幕阅读器内容
  • IVR: 专业的电话系统语音提示
  • 视频解说: 纪录片与社交媒体内容配音

Related Skills

相关技能

bash
undefined
bash
undefined

ElevenLabs multi-speaker dialogue

ElevenLabs多角色对话

npx skills add inference-sh/skills@elevenlabs-dialogue
npx skills add inference-sh/skills@elevenlabs-dialogue

ElevenLabs voice changer

ElevenLabs语音变声器

npx skills add inference-sh/skills@elevenlabs-voice-changer
npx skills add inference-sh/skills@elevenlabs-voice-changer

ElevenLabs sound effects

ElevenLabs音效生成

npx skills add inference-sh/skills@elevenlabs-sound-effects
npx skills add inference-sh/skills@elevenlabs-sound-effects

All TTS models (Kokoro, DIA, Chatterbox, and more)

所有TTS模型(Kokoro、DIA、Chatterbox等)

npx skills add inference-sh/skills@text-to-speech
npx skills add inference-sh/skills@text-to-speech

Full platform skill (all 150+ apps)

全平台技能(包含150+应用)

npx skills add inference-sh/skills@infsh-cli

Browse all audio apps: `infsh app list --category audio`
npx skills add inference-sh/skills@infsh-cli

浏览所有音频应用:`infsh app list --category audio`