narration-scripting
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseNarration Scripting
旁白脚本创作
Comprehensive guide to writing narration scripts optimized for video production, TTS synthesis, and audience engagement.
本指南全面介绍如何撰写适配视频制作、TTS合成和观众参与度优化的旁白脚本。
Overview
概述
- Writing scene-by-scene narration for demo videos
- Timing synchronization between visuals and voice
- Pacing narration for optimal comprehension
- CTA scripting that converts viewers
- TTS-optimized script formatting
- Multi-format narration (horizontal, vertical, square)
- 为演示视频撰写逐场景旁白
- 视觉内容与旁白的时间同步
- 优化旁白节奏以提升理解度
- 撰写能转化观众的CTA脚本
- 适配TTS的脚本格式
- 多格式旁白(横屏、竖屏、方形)
Core Principle
核心原则
Narration = Visual Support + Comprehension Timing + Emotional Arc
Narration should enhance visuals, not compete with them. Words must land precisely when viewers need context, and pacing must match cognitive load.
旁白 = 视觉支撑 + 理解节奏 + 情感弧线
旁白应强化视觉内容,而非与之竞争。文字需在观众需要上下文的精准时刻出现,节奏需匹配认知负荷。
Timing Fundamentals
时间基础
Frame to Milliseconds Conversion
帧到毫秒的转换
Frame Rate 1 Frame 15 Frames 30 Frames 60 Frames
─────────────────────────────────────────────────────────────
24 fps 41.67ms 625ms 1250ms 2500ms
30 fps 33.33ms 500ms 1000ms 2000ms
60 fps 16.67ms 250ms 500ms 1000ms
Common Timing Shortcuts:
├── 30fps: Frame# x 33.33 = milliseconds
├── 24fps: Frame# x 41.67 = milliseconds
└── 60fps: Frame# / 60 x 1000 = millisecondsFrame Rate 1 Frame 15 Frames 30 Frames 60 Frames
─────────────────────────────────────────────────────────────
24 fps 41.67ms 625ms 1250ms 2500ms
30 fps 33.33ms 500ms 1000ms 2000ms
60 fps 16.67ms 250ms 500ms 1000ms
Common Timing Shortcuts:
├── 30fps: Frame# x 33.33 = milliseconds
├── 24fps: Frame# x 41.67 = milliseconds
└── 60fps: Frame# / 60 x 1000 = millisecondsSync Point Types
同步点类型
Type Symbol Usage Precision
──────────────────────────────────────────────────────────────
Hard Sync [!] Word lands on action +/- 2 frames
Soft Sync [~] Word near action +/- 10 frames
Window Sync [...] Word during scene Flexible
Lead Sync [>] Word before action 100-300ms early
Lag Sync [<] Word after action 100-500ms lateType Symbol Usage Precision
──────────────────────────────────────────────────────────────
Hard Sync [!] Word lands on action +/- 2 frames
Soft Sync [~] Word near action +/- 10 frames
Window Sync [...] Word during scene Flexible
Lead Sync [>] Word before action 100-300ms early
Lag Sync [<] Word after action 100-500ms lateWords Per Minute (WPM) Guidelines
每分钟字数(WPM)指南
Comprehension-Based Pacing
基于理解度的节奏控制
Content Type WPM Range Pause Frequency Use Case
─────────────────────────────────────────────────────────────────
Technical Demo 120-140 Every 8-10 words Complex UI, code
Tutorial 130-150 Every 10-12 words Step-by-step
Product Feature 140-160 Every 12-15 words Marketing, benefits
Quick Overview 150-170 Every 15-20 words Intro sequences
High Energy 170-190 Minimal pauses TikTok, Reels
Documentary 110-130 Natural pauses StorytellingContent Type WPM Range Pause Frequency Use Case
─────────────────────────────────────────────────────────────────
Technical Demo 120-140 Every 8-10 words Complex UI, code
Tutorial 130-150 Every 10-12 words Step-by-step
Product Feature 140-160 Every 12-15 words Marketing, benefits
Quick Overview 150-170 Every 15-20 words Intro sequences
High Energy 170-190 Minimal pauses TikTok, Reels
Documentary 110-130 Natural pauses StorytellingPlatform-Specific WPM
平台专属WPM建议
Platform WPM Range Why
────────────────────────────────────────────────────
TikTok 160-180 Fast scroll, hook fast
Reels 150-170 Slightly slower aesthetic
YouTube Shorts 140-160 More value-focused
YouTube Long 130-150 Comprehension over speed
LinkedIn 120-140 Professional, clear
Twitter/X 150-170 Quick engagementPlatform WPM Range Why
────────────────────────────────────────────────────
TikTok 160-180 Fast scroll, hook fast
Reels 150-170 Slightly slower aesthetic
YouTube Shorts 140-160 More value-focused
YouTube Long 130-150 Comprehension over speed
LinkedIn 120-140 Professional, clear
Twitter/X 150-170 Quick engagementCalculating Script Length
脚本长度计算
Formula: (Video Duration in seconds) x (WPM / 60) = Word Count
Examples:
├── 15s video @ 150 WPM = 37 words
├── 30s video @ 140 WPM = 70 words
├── 60s video @ 130 WPM = 130 words
├── 5m video @ 140 WPM = 700 words
└── 10m video @ 135 WPM = 1350 words
Include pause time:
Effective words = Total words - (pause_count x 1.5)Formula: (Video Duration in seconds) x (WPM / 60) = Word Count
Examples:
├── 15s video @ 150 WPM = 37 words
├── 30s video @ 140 WPM = 70 words
├── 60s video @ 130 WPM = 130 words
├── 5m video @ 140 WPM = 700 words
└── 10m video @ 135 WPM = 1350 words
Include pause time:
Effective words = Total words - (pause_count x 1.5)Script Format Standard
脚本格式标准
Basic Script Block
基础脚本模块
markdown
undefinedmarkdown
undefinedScene: [Scene Name]
场景:[场景名称]
Duration: [start] - [end] (total seconds)
Visual: [What's on screen]
Narration:
[!0:00.000] "First word lands exactly here."
[~0:02.500] "This phrase starts around this mark."
[...0:05-0:08] "This section plays during this window."
[>0:10.000] "This leads INTO the next action."
[<0:12.500] "This follows the completed action."
Pauses:
- [0:04.000] 300ms breath pause
- [0:08.500] 500ms dramatic pause
Notes:
- Emphasis on "exactly" and "action"
- Tone: Confident, clear
undefined时长: [开始时间] - [结束时间](总秒数)
视觉内容: [屏幕显示内容]
旁白:
[!0:00.000] "第一个词精准对应此处。"
[~0:02.500] "这句话大约在此时开始。"
[...0:05-0:08] "这段旁白在此时间段内播放。"
[>0:10.000] "这句话提前引出下一个动作。"
[<0:12.500] "这句话在动作完成后出现。"
停顿:
- [0:04.000] 300ms呼吸停顿
- [0:08.500] 500ms戏剧性停顿
备注:
- 强调“精准”和“动作”
- 语气:自信、清晰
undefinedExtended Format with TTS Markers
带TTS标记的扩展格式
markdown
undefinedmarkdown
undefinedScene: Product Feature Demo
场景:产品功能演示
Duration: 0:15.000 - 0:30.000 (15s)
Visual: Screen recording of feature in action
Narration (TTS-Optimized):
[!0:15.000] "Watch how simple this is." {rate:0.9}
[~0:17.500] "Just click... {pause:200ms} and drag." {rate:1.0}
[!0:20.000] "The AI handles the rest." {emphasis:high}
[...0:22-0:26] "No configuration needed. No learning curve."
[>0:27.500] "Ready to try it yourself?" {tone:inviting}
TTS Parameters:
- Voice: Professional, warm (e.g., OpenAI "nova", Gemini "Kore")
- Base rate: 1.0x
- Pitch: Neutral
Sync Points:
- 0:15.000 [HARD] Word "Watch" on button hover
- 0:17.500 [SOFT] "click" during click animation
- 0:20.000 [HARD] "AI" on result appearing
undefined时长: 0:15.000 - 0:30.000(15秒)
视觉内容: 功能实操的屏幕录制
旁白(适配TTS):
[!0:15.000] "看看这有多简单。" {rate:0.9}
[~0:17.500] "只需点击... {pause:200ms} 然后拖动。" {rate:1.0}
[!0:20.000] "剩下的交给AI处理。" {emphasis:high}
[...0:22-0:26] "无需配置,无需学习曲线。"
[>0:27.500] "准备好亲自尝试了吗?" {tone:inviting}
TTS参数:
- 语音:专业、亲切(例如OpenAI "nova"、Gemini "Kore")
- 基础语速:1.0x
- 音调:中性
同步点:
- 0:15.000 [硬同步] 单词“看看”对应按钮悬停时刻
- 0:17.500 [软同步] “点击”对应点击动画时刻
- 0:20.000 [硬同步] “AI”对应结果出现时刻
undefinedScene-by-Scene Templates
逐场景模板
Demo Video Template (30s)
30秒演示视频模板
markdown
undefinedmarkdown
undefinedScene 1: Hook
场景1:钩子
Duration: 0:00 - 0:03 (3s)
Visual: Problem statement or pain point visual
Narration:
[!0:00.000] "Tired of {problem}?"
[~0:01.500] "There's a better way."
WPM: 160 (8 words / 3s)
Tone: Empathetic, intriguing
时长: 0:00 - 0:03(3秒)
视觉内容: 问题陈述或痛点展示
旁白:
[!0:00.000] "受够了{问题}?"
[~0:01.500] "有更好的解决方案。"
WPM: 160(3秒共8个词)
语气: 共情、引人好奇
Scene 2: Solution Intro
场景2:解决方案介绍
Duration: 0:03 - 0:08 (5s)
Visual: Product/tool name reveal, interface preview
Narration:
[!0:03.000] "Meet {ProductName}."
[~0:04.500] "The {category} that actually works."
[...0:06-0:08] "Let me show you."
WPM: 140 (12 words / 5s)
Tone: Confident, friendly
时长: 0:03 - 0:08(5秒)
视觉内容: 产品/工具名称揭晓、界面预览
旁白:
[!0:03.000] "来看看{产品名称}。"
[~0:04.500] "这款{品类}真的好用。"
[...0:06-0:08] "我来演示给你看。"
WPM: 140(5秒共12个词)
语气: 自信、友好
Scene 3: Demo Action
场景3:演示操作
Duration: 0:08 - 0:20 (12s)
Visual: Screen recording of key feature
Narration:
[!0:08.000] "Here's how it works."
[~0:09.500] "Step one: {action}."
[!0:12.000] "Step two: {action}."
[~0:15.000] "And just like that..."
[!0:17.000] "{Result in one sentence}."
[...0:18-0:20] Pause for visual impact
WPM: 130 (26 words / 12s)
Tone: Clear, instructional
时长: 0:08 - 0:20(12秒)
视觉内容: 核心功能的屏幕录制
旁白:
[!0:08.000] "操作方法如下。"
[~0:09.500] "第一步:{操作}。"
[!0:12.000] "第二步:{操作}。"
[~0:15.000] "就这样..."
[!0:17.000] "{一句话总结结果}。"
[...0:18-0:20] 停顿以突出视觉效果
WPM: 130(12秒共26个词)
语气: 清晰、指导性
Scene 4: CTA
场景4:CTA
Duration: 0:20 - 0:30 (10s)
Visual: CTA screen with link/QR code
Narration:
[!0:20.000] "Ready to {benefit}?"
[~0:22.000] "Try {ProductName} free today."
[!0:25.000] "Link in bio." OR "Click below."
[...0:27-0:30] {Music swell, no narration}
WPM: 120 (15 words / 7.5s active narration)
Tone: Inviting, urgent but not pushy
undefined时长: 0:20 - 0:30(10秒)
视觉内容: 带链接/二维码的CTA画面
旁白:
[!0:20.000] "准备好{享受益处}了吗?"
[~0:22.000] "立即免费试用{产品名称}。"
[!0:25.000] "链接在简介中。" 或 "点击下方链接。"
[...0:27-0:30] {音乐渐强,无旁白}
WPM: 120(7.5秒有效旁白共15个词)
语气: 有吸引力、紧迫但不过分
undefinedTutorial Template (60s)
60秒教程模板
See:
references/script-templates.md详见:
references/script-templates.mdPromo Template (15s)
15秒推广模板
See:
references/script-templates.md详见:
references/script-templates.mdCTA Scripting Patterns
CTA脚本模板
CTA Formula Framework
CTA公式框架
Pattern Script Template Use Case
─────────────────────────────────────────────────────────────────────────
Direct Ask "Try {product} free today." Conversion-focused
Benefit-First "Start {benefit}ing now." Value-focused
Scarcity "Join {number} others before {time}." Urgency
Social Proof "{Number} developers already use this." Trust-building
Next Step "Here's what to do next..." Educational
Question CTA "Ready to {transformation}?" Engagement模板类型 脚本模板 适用场景
─────────────────────────────────────────────────────────────────────────
直接请求 "立即免费试用{产品}。" 转化导向
利益前置 "立即开始{享受益处}。" 价值导向
稀缺性 "{时间}前加入,已有{人数}人参与。" 紧迫感
社交证明 "已有{数量}名开发者使用本工具。" 建立信任
下一步指引 "接下来要做的是..." 教育类
提问式CTA "准备好{实现转变}了吗?" 提升参与度CTA Timing Rules
CTA时间规则
Video Length CTA Start CTA Duration Approach
────────────────────────────────────────────────────────────
<15s Last 3s 2-3s Direct, single CTA
15-30s Last 5s 3-5s Benefit + action
30-60s Last 8-10s 6-8s Setup + CTA + reinforce
60-120s Last 12-15s 8-12s Recap + CTA + social proof
>2min Last 20-30s 15-20s Summary + CTA + next content视频时长 CTA开始时间 CTA持续时长 策略
────────────────────────────────────────────────────────────
<15秒 最后3秒 2-3秒 直接、单次CTA
15-30秒 最后5秒 3-5秒 利益+行动
30-60秒 最后8-10秒 6-8秒 铺垫+CTA+强化
60-120秒 最后12-15秒 8-12秒 回顾+CTA+社交证明
>2分钟 最后20-30秒 15-20秒 总结+CTA+后续内容预告Platform-Specific CTAs
平台专属CTA
Platform CTA Script Pattern Notes
────────────────────────────────────────────────────────────────────
TikTok "Follow for more {topic}." Simple, immediate
"Link in bio."
Reels "Save this for later." Encourages saves
"Share with someone who needs this."
YouTube Shorts "Subscribe for more {topic}." Channel growth
"Full tutorial linked above."
YouTube Long "Like and subscribe." Engagement boost
"Watch this next: [card]" Session time
LinkedIn "What's your experience with this?" Comment engagement
"DM me for the template." Lead generation平台 CTA脚本模板 备注
────────────────────────────────────────────────────────────────────
TikTok "关注获取更多{主题}内容。" 简洁、即时
"链接在简介中。"
Reels "收藏以备后用。" 鼓励收藏
"分享给需要的人。"
YouTube Shorts "订阅获取更多{主题}内容。" 频道增长
"完整教程链接在上方。"
YouTube长视频 "点赞并订阅。" 提升参与度
"接下来观看:[卡片]" 延长观看时长
LinkedIn "你有过相关经历吗?" 提升评论互动
"私信我获取模板。" 线索生成TTS Optimization
TTS优化
Script Formatting for TTS
适配TTS的脚本格式
markdown
**DO:**
- Use contractions: "It's" not "It is" (more natural)
- Write phonetically for tricky words: "GIF" → "gif" or "jif"
- Include punctuation for pauses: commas, periods, ellipses
- Mark emphasis with *asterisks* or {emphasis:word}
- Specify pronunciation: "Read" {rhymes with "red"}
**DON'T:**
- Use ALL CAPS (TTS reads as acronym)
- Include URLs verbatim (spell out or skip)
- Use special characters: &, @, # (except as words)
- Write numbers as digits for large numbers: "1,234,567"markdown
**建议:**
- 使用缩写形式:用“It's”而非“It is”(更自然)
- 生僻词标注音标:“GIF” → “gif”或“jif”
- 用标点标记停顿:逗号、句号、省略号
- 用*星号*或{emphasis:word}标记强调
- 指定发音:“Read” {rhymes with "red"}
**避免:**
- 使用全大写(TTS会读作首字母缩写)
- 直接写入URL(拼写出来或省略)
- 使用特殊字符:&、@、#(除非作为单词)
- 大数字用阿拉伯数字:“1,234,567”TTS Markup Examples
TTS标记示例
markdown
**Plain text:**
"Click the button and wait for results."
**TTS-optimized:**
"Click the button... {pause:300ms} and wait for results."
**With emphasis:**
"Click the *button*... {pause:300ms} and wait for *results*."
**With rate control:**
"{rate:0.9}Watch carefully.{rate:1.1} This happens fast."
**SSML format (advanced):**
<speak>
Click the <emphasis level="moderate">button</emphasis>
<break time="300ms"/>
and wait for <prosody rate="slow">results</prosody>.
</speak>markdown
**纯文本:**
“点击按钮,等待结果。”
**适配TTS的版本:**
“点击按钮... {pause:300ms} 等待结果。”
**带强调的版本:**
“点击*按钮*... {pause:300ms} 等待*结果*。”
**带语速控制的版本:**
“{rate:0.9}仔细看。{rate:1.1} 动作很快。”
**SSML格式(进阶):**
<speak>
Click the <emphasis level="moderate">button</emphasis>
<break time="300ms"/>
and wait for <prosody rate="slow">results</prosody>.
</speak>Voice Selection Guidelines
语音选择指南
Content Type Recommended Voices Characteristics
────────────────────────────────────────────────────────────────────────
Technical Demo OpenAI: "onyx", Gemini: "Charon" Clear, authoritative
Tutorial OpenAI: "nova", Gemini: "Kore" Warm, patient
Marketing OpenAI: "alloy", Gemini: "Puck" Energetic, engaging
Corporate OpenAI: "echo", Gemini: "Fenrir" Professional, calm
Storytelling OpenAI: "fable", Gemini: "Aoede" Expressive, melodic内容类型 推荐语音 特点
────────────────────────────────────────────────────────────────────────
技术演示 OpenAI: "onyx", Gemini: "Charon" 清晰、权威
教程 OpenAI: "nova", Gemini: "Kore" 亲切、耐心
营销 OpenAI: "alloy", Gemini: "Puck" 有活力、吸引人
企业宣传 OpenAI: "echo", Gemini: "Fenrir" 专业、沉稳
叙事类 OpenAI: "fable", Gemini: "Aoede" 富有表现力、悦耳Sync Point Workflow
同步点工作流
Pre-Production Sync Planning
前期制作同步规划
1. Create rough edit with temporary VO or no audio
2. Mark key visual moments (timestamps)
3. Write narration to hit those marks
4. Calculate total word count vs. duration
5. Adjust pacing or cut visuals to match
Visual Moments to Mark:
├── Actions (clicks, transitions, animations)
├── Reveals (new screens, results, data)
├── Emphasis (key features, benefits)
├── Transitions (scene changes)
└── CTA appearance1. 创建带临时旁白或无音频的粗剪版本
2. 标记关键视觉时刻(时间戳)
3. 撰写旁白以匹配这些标记点
4. 计算总字数与时长的匹配度
5. 调整节奏或剪辑视觉内容以适配
需标记的视觉时刻:
├── 动作(点击、转场、动画)
├── 揭晓(新界面、结果、数据)
├── 重点(核心功能、优势)
├── 转场(场景切换)
└── CTA出现时刻Post-Production Sync Adjustment
后期制作同步调整
Problem Solution
───────────────────────────────────────────────────────────────
Narration too long Cut words, not speed up TTS
Narration too short Add pauses, elaboration
Hard sync misaligned Adjust video timing or rewrite phrase
Multiple sync misses Re-evaluate structure
TTS sounds rushed Lower WPM, add breath pauses
TTS sounds slow Trim pauses, tighten phrases问题 解决方案
───────────────────────────────────────────────────────────────
旁白过长 删减文字,而非加速TTS
旁白过短 添加停顿、补充内容
硬同步错位 调整视频时间或重写语句
多处同步失误 重新评估结构
TTS听起来仓促 降低WPM,添加呼吸停顿
TTS听起来缓慢 删减停顿、精简语句Quick Reference: Narration Checklist
快速参考:旁白脚本检查清单
Pre-Write:
[ ] Video duration confirmed
[ ] Target WPM selected
[ ] Key sync points identified
[ ] CTA placement decided
[ ] Voice/tone defined
During Write:
[ ] Each scene has timing markers
[ ] Pauses marked for breath/emphasis
[ ] Technical terms phonetically noted
[ ] Contractions used naturally
[ ] Emphasis words identified
Post-Write:
[ ] Word count within target
[ ] All sync points achievable
[ ] TTS test recorded
[ ] Timing validated against video
[ ] CTA clear and actionable撰写前:
[ ] 已确认视频时长
[ ] 已选定目标WPM
[ ] 已确定关键同步点
[ ] 已决定CTA位置
[ ] 已定义语音/语气
撰写中:
[ ] 每个场景都有时间标记
[ ] 已标记呼吸/强调停顿
[ ] 专业术语已标注音标
[ ] 自然使用缩写形式
[ ] 已确定需强调的词汇
撰写后:
[ ] 字数在目标范围内
[ ] 所有同步点均可实现
[ ] 已录制TTS测试版
[ ] 已对照视频验证时间
[ ] CTA清晰且可执行Related Skills
相关技能
- : Rhythm and timing patterns for video editing
video-pacing - : TTS providers and voice synthesis
audio-language-models - : Full demo video production workflow
demo-producer - : Attention-grabbing opening patterns
hook-formulas - : Persuasive writing techniques
copywriting-patterns
- : 视频剪辑的节奏与时间模式
video-pacing - : TTS提供商与语音合成
audio-language-models - : 完整演示视频制作流程
demo-producer - : 吸引注意力的开场模板
hook-formulas - : 说服性写作技巧
copywriting-patterns
References
参考资料
- Script Templates - Full templates for demo, tutorial, promo videos
- Timing Markers - Detailed sync point specification
- Pacing Guidelines - WPM targets and comprehension research
- 脚本模板 - 演示、教程、推广视频的完整模板
- 时间标记 - 同步点详细说明
- 节奏指南 - WPM目标与理解度研究
Capability Details
能力详情
script-writing
script-writing
Keywords: narration, script, voiceover, VO, dialogue, copy
Solves:
- How do I write narration for a demo video?
- Script templates for video production
- Writing voice-over copy
关键词: narration, script, voiceover, VO, dialogue, copy
解决问题:
- 如何为演示视频撰写旁白?
- 视频制作的脚本模板
- 撰写配音文案
timing-sync
timing-sync
Keywords: timing, sync, synchronization, markers, cue points
Solves:
- How do I sync narration to video?
- Timing markers for TTS
- Frame-accurate voice alignment
关键词: timing, sync, synchronization, markers, cue points
解决问题:
- 如何实现旁白与视频的同步?
- TTS的时间标记
- 帧级精准的语音对齐
pacing
pacing
Keywords: WPM, words per minute, pacing, speed, comprehension
Solves:
- How fast should narration be?
- Calculating script length for video duration
- Platform-specific pacing
关键词: WPM, words per minute, pacing, speed, comprehension
解决问题:
- 旁白语速应该多快?
- 根据视频时长计算脚本长度
- 平台专属节奏建议
cta-scripting
cta-scripting
Keywords: CTA, call to action, conversion, engagement
Solves:
- How do I write a video CTA?
- CTA patterns for different platforms
- Closing scripts that convert
关键词: CTA, call to action, conversion, engagement
解决问题:
- 如何为视频撰写CTA?
- 不同平台的CTA模板
- 能转化的收尾脚本
tts-optimization
tts-optimization
Keywords: TTS, text-to-speech, voice synthesis, SSML
Solves:
- How do I format scripts for TTS?
- Making AI voices sound natural
- TTS markup and emphasis
关键词: TTS, text-to-speech, voice synthesis, SSML
解决问题:
- 如何为TTS格式化脚本?
- 让AI语音听起来更自然
- TTS标记与强调