narration-scripting

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Narration Scripting

旁白脚本创作

Comprehensive guide to writing narration scripts optimized for video production, TTS synthesis, and audience engagement.
本指南全面介绍如何撰写适配视频制作、TTS合成和观众参与度优化的旁白脚本。

Overview

概述

  • Writing scene-by-scene narration for demo videos
  • Timing synchronization between visuals and voice
  • Pacing narration for optimal comprehension
  • CTA scripting that converts viewers
  • TTS-optimized script formatting
  • Multi-format narration (horizontal, vertical, square)
  • 为演示视频撰写逐场景旁白
  • 视觉内容与旁白的时间同步
  • 优化旁白节奏以提升理解度
  • 撰写能转化观众的CTA脚本
  • 适配TTS的脚本格式
  • 多格式旁白(横屏、竖屏、方形)

Core Principle

核心原则

Narration = Visual Support + Comprehension Timing + Emotional Arc
Narration should enhance visuals, not compete with them. Words must land precisely when viewers need context, and pacing must match cognitive load.
旁白 = 视觉支撑 + 理解节奏 + 情感弧线
旁白应强化视觉内容,而非与之竞争。文字需在观众需要上下文的精准时刻出现,节奏需匹配认知负荷。

Timing Fundamentals

时间基础

Frame to Milliseconds Conversion

帧到毫秒的转换

Frame Rate    1 Frame    15 Frames    30 Frames    60 Frames
─────────────────────────────────────────────────────────────
24 fps        41.67ms    625ms        1250ms       2500ms
30 fps        33.33ms    500ms        1000ms       2000ms
60 fps        16.67ms    250ms        500ms        1000ms

Common Timing Shortcuts:
├── 30fps: Frame# x 33.33 = milliseconds
├── 24fps: Frame# x 41.67 = milliseconds
└── 60fps: Frame# / 60 x 1000 = milliseconds
Frame Rate    1 Frame    15 Frames    30 Frames    60 Frames
─────────────────────────────────────────────────────────────
24 fps        41.67ms    625ms        1250ms       2500ms
30 fps        33.33ms    500ms        1000ms       2000ms
60 fps        16.67ms    250ms        500ms        1000ms

Common Timing Shortcuts:
├── 30fps: Frame# x 33.33 = milliseconds
├── 24fps: Frame# x 41.67 = milliseconds
└── 60fps: Frame# / 60 x 1000 = milliseconds

Sync Point Types

同步点类型

Type            Symbol    Usage                   Precision
──────────────────────────────────────────────────────────────
Hard Sync       [!]       Word lands on action    +/- 2 frames
Soft Sync       [~]       Word near action        +/- 10 frames
Window Sync     [...]     Word during scene       Flexible
Lead Sync       [>]       Word before action      100-300ms early
Lag Sync        [<]       Word after action       100-500ms late
Type            Symbol    Usage                   Precision
──────────────────────────────────────────────────────────────
Hard Sync       [!]       Word lands on action    +/- 2 frames
Soft Sync       [~]       Word near action        +/- 10 frames
Window Sync     [...]     Word during scene       Flexible
Lead Sync       [>]       Word before action      100-300ms early
Lag Sync        [<]       Word after action       100-500ms late

Words Per Minute (WPM) Guidelines

每分钟字数(WPM)指南

Comprehension-Based Pacing

基于理解度的节奏控制

Content Type          WPM Range    Pause Frequency    Use Case
─────────────────────────────────────────────────────────────────
Technical Demo        120-140      Every 8-10 words   Complex UI, code
Tutorial              130-150      Every 10-12 words  Step-by-step
Product Feature       140-160      Every 12-15 words  Marketing, benefits
Quick Overview        150-170      Every 15-20 words  Intro sequences
High Energy           170-190      Minimal pauses     TikTok, Reels
Documentary           110-130      Natural pauses     Storytelling
Content Type          WPM Range    Pause Frequency    Use Case
─────────────────────────────────────────────────────────────────
Technical Demo        120-140      Every 8-10 words   Complex UI, code
Tutorial              130-150      Every 10-12 words  Step-by-step
Product Feature       140-160      Every 12-15 words  Marketing, benefits
Quick Overview        150-170      Every 15-20 words  Intro sequences
High Energy           170-190      Minimal pauses     TikTok, Reels
Documentary           110-130      Natural pauses     Storytelling

Platform-Specific WPM

平台专属WPM建议

Platform       WPM Range    Why
────────────────────────────────────────────────────
TikTok         160-180      Fast scroll, hook fast
Reels          150-170      Slightly slower aesthetic
YouTube Shorts 140-160      More value-focused
YouTube Long   130-150      Comprehension over speed
LinkedIn       120-140      Professional, clear
Twitter/X      150-170      Quick engagement
Platform       WPM Range    Why
────────────────────────────────────────────────────
TikTok         160-180      Fast scroll, hook fast
Reels          150-170      Slightly slower aesthetic
YouTube Shorts 140-160      More value-focused
YouTube Long   130-150      Comprehension over speed
LinkedIn       120-140      Professional, clear
Twitter/X      150-170      Quick engagement

Calculating Script Length

脚本长度计算

Formula: (Video Duration in seconds) x (WPM / 60) = Word Count

Examples:
├── 15s video @ 150 WPM = 37 words
├── 30s video @ 140 WPM = 70 words
├── 60s video @ 130 WPM = 130 words
├── 5m video @ 140 WPM = 700 words
└── 10m video @ 135 WPM = 1350 words

Include pause time:
Effective words = Total words - (pause_count x 1.5)
Formula: (Video Duration in seconds) x (WPM / 60) = Word Count

Examples:
├── 15s video @ 150 WPM = 37 words
├── 30s video @ 140 WPM = 70 words
├── 60s video @ 130 WPM = 130 words
├── 5m video @ 140 WPM = 700 words
└── 10m video @ 135 WPM = 1350 words

Include pause time:
Effective words = Total words - (pause_count x 1.5)

Script Format Standard

脚本格式标准

Basic Script Block

基础脚本模块

markdown
undefined
markdown
undefined

Scene: [Scene Name]

场景:[场景名称]

Duration: [start] - [end] (total seconds) Visual: [What's on screen]

Narration: [!0:00.000] "First word lands exactly here." [~0:02.500] "This phrase starts around this mark." [...0:05-0:08] "This section plays during this window." [>0:10.000] "This leads INTO the next action." [<0:12.500] "This follows the completed action."
Pauses:
  • [0:04.000] 300ms breath pause
  • [0:08.500] 500ms dramatic pause
Notes:
  • Emphasis on "exactly" and "action"
  • Tone: Confident, clear
undefined
时长: [开始时间] - [结束时间](总秒数) 视觉内容: [屏幕显示内容]

旁白: [!0:00.000] "第一个词精准对应此处。" [~0:02.500] "这句话大约在此时开始。" [...0:05-0:08] "这段旁白在此时间段内播放。" [>0:10.000] "这句话提前引出下一个动作。" [<0:12.500] "这句话在动作完成后出现。"
停顿:
  • [0:04.000] 300ms呼吸停顿
  • [0:08.500] 500ms戏剧性停顿
备注:
  • 强调“精准”和“动作”
  • 语气:自信、清晰
undefined

Extended Format with TTS Markers

带TTS标记的扩展格式

markdown
undefined
markdown
undefined

Scene: Product Feature Demo

场景:产品功能演示

Duration: 0:15.000 - 0:30.000 (15s) Visual: Screen recording of feature in action

Narration (TTS-Optimized): [!0:15.000] "Watch how simple this is." {rate:0.9} [~0:17.500] "Just click... {pause:200ms} and drag." {rate:1.0} [!0:20.000] "The AI handles the rest." {emphasis:high} [...0:22-0:26] "No configuration needed. No learning curve." [>0:27.500] "Ready to try it yourself?" {tone:inviting}
TTS Parameters:
  • Voice: Professional, warm (e.g., OpenAI "nova", Gemini "Kore")
  • Base rate: 1.0x
  • Pitch: Neutral
Sync Points:
  • 0:15.000 [HARD] Word "Watch" on button hover
  • 0:17.500 [SOFT] "click" during click animation
  • 0:20.000 [HARD] "AI" on result appearing
undefined
时长: 0:15.000 - 0:30.000(15秒) 视觉内容: 功能实操的屏幕录制

旁白(适配TTS): [!0:15.000] "看看这有多简单。" {rate:0.9} [~0:17.500] "只需点击... {pause:200ms} 然后拖动。" {rate:1.0} [!0:20.000] "剩下的交给AI处理。" {emphasis:high} [...0:22-0:26] "无需配置,无需学习曲线。" [>0:27.500] "准备好亲自尝试了吗?" {tone:inviting}
TTS参数:
  • 语音:专业、亲切(例如OpenAI "nova"、Gemini "Kore")
  • 基础语速:1.0x
  • 音调:中性
同步点:
  • 0:15.000 [硬同步] 单词“看看”对应按钮悬停时刻
  • 0:17.500 [软同步] “点击”对应点击动画时刻
  • 0:20.000 [硬同步] “AI”对应结果出现时刻
undefined

Scene-by-Scene Templates

逐场景模板

Demo Video Template (30s)

30秒演示视频模板

markdown
undefined
markdown
undefined

Scene 1: Hook

场景1:钩子

Duration: 0:00 - 0:03 (3s) Visual: Problem statement or pain point visual

Narration: [!0:00.000] "Tired of {problem}?" [~0:01.500] "There's a better way."
WPM: 160 (8 words / 3s) Tone: Empathetic, intriguing

时长: 0:00 - 0:03(3秒) 视觉内容: 问题陈述或痛点展示

旁白: [!0:00.000] "受够了{问题}?" [~0:01.500] "有更好的解决方案。"
WPM: 160(3秒共8个词) 语气: 共情、引人好奇

Scene 2: Solution Intro

场景2:解决方案介绍

Duration: 0:03 - 0:08 (5s) Visual: Product/tool name reveal, interface preview

Narration: [!0:03.000] "Meet {ProductName}." [~0:04.500] "The {category} that actually works." [...0:06-0:08] "Let me show you."
WPM: 140 (12 words / 5s) Tone: Confident, friendly

时长: 0:03 - 0:08(5秒) 视觉内容: 产品/工具名称揭晓、界面预览

旁白: [!0:03.000] "来看看{产品名称}。" [~0:04.500] "这款{品类}真的好用。" [...0:06-0:08] "我来演示给你看。"
WPM: 140(5秒共12个词) 语气: 自信、友好

Scene 3: Demo Action

场景3:演示操作

Duration: 0:08 - 0:20 (12s) Visual: Screen recording of key feature

Narration: [!0:08.000] "Here's how it works." [~0:09.500] "Step one: {action}." [!0:12.000] "Step two: {action}." [~0:15.000] "And just like that..." [!0:17.000] "{Result in one sentence}." [...0:18-0:20] Pause for visual impact
WPM: 130 (26 words / 12s) Tone: Clear, instructional

时长: 0:08 - 0:20(12秒) 视觉内容: 核心功能的屏幕录制

旁白: [!0:08.000] "操作方法如下。" [~0:09.500] "第一步:{操作}。" [!0:12.000] "第二步:{操作}。" [~0:15.000] "就这样..." [!0:17.000] "{一句话总结结果}。" [...0:18-0:20] 停顿以突出视觉效果
WPM: 130(12秒共26个词) 语气: 清晰、指导性

Scene 4: CTA

场景4:CTA

Duration: 0:20 - 0:30 (10s) Visual: CTA screen with link/QR code

Narration: [!0:20.000] "Ready to {benefit}?" [~0:22.000] "Try {ProductName} free today." [!0:25.000] "Link in bio." OR "Click below." [...0:27-0:30] {Music swell, no narration}
WPM: 120 (15 words / 7.5s active narration) Tone: Inviting, urgent but not pushy
undefined
时长: 0:20 - 0:30(10秒) 视觉内容: 带链接/二维码的CTA画面

旁白: [!0:20.000] "准备好{享受益处}了吗?" [~0:22.000] "立即免费试用{产品名称}。" [!0:25.000] "链接在简介中。" 或 "点击下方链接。" [...0:27-0:30] {音乐渐强,无旁白}
WPM: 120(7.5秒有效旁白共15个词) 语气: 有吸引力、紧迫但不过分
undefined

Tutorial Template (60s)

60秒教程模板

See:
references/script-templates.md
详见:
references/script-templates.md

Promo Template (15s)

15秒推广模板

See:
references/script-templates.md
详见:
references/script-templates.md

CTA Scripting Patterns

CTA脚本模板

CTA Formula Framework

CTA公式框架

Pattern              Script Template                          Use Case
─────────────────────────────────────────────────────────────────────────
Direct Ask           "Try {product} free today."             Conversion-focused
Benefit-First        "Start {benefit}ing now."               Value-focused
Scarcity             "Join {number} others before {time}."   Urgency
Social Proof         "{Number} developers already use this." Trust-building
Next Step            "Here's what to do next..."             Educational
Question CTA         "Ready to {transformation}?"            Engagement
模板类型              脚本模板                          适用场景
─────────────────────────────────────────────────────────────────────────
直接请求           "立即免费试用{产品}。"             转化导向
利益前置        "立即开始{享受益处}。"               价值导向
稀缺性             "{时间}前加入,已有{人数}人参与。"   紧迫感
社交证明         "已有{数量}名开发者使用本工具。"   建立信任
下一步指引            "接下来要做的是..."             教育类
提问式CTA         "准备好{实现转变}了吗?"            提升参与度

CTA Timing Rules

CTA时间规则

Video Length    CTA Start       CTA Duration    Approach
────────────────────────────────────────────────────────────
<15s            Last 3s         2-3s            Direct, single CTA
15-30s          Last 5s         3-5s            Benefit + action
30-60s          Last 8-10s      6-8s            Setup + CTA + reinforce
60-120s         Last 12-15s     8-12s           Recap + CTA + social proof
>2min           Last 20-30s     15-20s          Summary + CTA + next content
视频时长    CTA开始时间       CTA持续时长    策略
────────────────────────────────────────────────────────────
<15秒            最后3秒         2-3秒            直接、单次CTA
15-30秒          最后5秒         3-5秒            利益+行动
30-60秒          最后8-10秒      6-8秒            铺垫+CTA+强化
60-120秒         最后12-15秒     8-12秒           回顾+CTA+社交证明
>2分钟           最后20-30秒     15-20秒          总结+CTA+后续内容预告

Platform-Specific CTAs

平台专属CTA

Platform        CTA Script Pattern                    Notes
────────────────────────────────────────────────────────────────────
TikTok          "Follow for more {topic}."            Simple, immediate
                "Link in bio."
Reels           "Save this for later."                Encourages saves
                "Share with someone who needs this."
YouTube Shorts  "Subscribe for more {topic}."         Channel growth
                "Full tutorial linked above."
YouTube Long    "Like and subscribe."                 Engagement boost
                "Watch this next: [card]"             Session time
LinkedIn        "What's your experience with this?"   Comment engagement
                "DM me for the template."             Lead generation
平台        CTA脚本模板                    备注
────────────────────────────────────────────────────────────────────
TikTok          "关注获取更多{主题}内容。"            简洁、即时
                "链接在简介中。"
Reels           "收藏以备后用。"                鼓励收藏
                "分享给需要的人。"
YouTube Shorts  "订阅获取更多{主题}内容。"         频道增长
                "完整教程链接在上方。"
YouTube长视频    "点赞并订阅。"                 提升参与度
                "接下来观看:[卡片]"             延长观看时长
LinkedIn        "你有过相关经历吗?"   提升评论互动
                "私信我获取模板。"             线索生成

TTS Optimization

TTS优化

Script Formatting for TTS

适配TTS的脚本格式

markdown
**DO:**
- Use contractions: "It's" not "It is" (more natural)
- Write phonetically for tricky words: "GIF" → "gif" or "jif"
- Include punctuation for pauses: commas, periods, ellipses
- Mark emphasis with *asterisks* or {emphasis:word}
- Specify pronunciation: "Read" {rhymes with "red"}

**DON'T:**
- Use ALL CAPS (TTS reads as acronym)
- Include URLs verbatim (spell out or skip)
- Use special characters: &, @, # (except as words)
- Write numbers as digits for large numbers: "1,234,567"
markdown
**建议:**
- 使用缩写形式:用“It's”而非“It is”(更自然)
- 生僻词标注音标:“GIF” → “gif”或“jif”
- 用标点标记停顿:逗号、句号、省略号
-*星号*或{emphasis:word}标记强调
- 指定发音:“Read” {rhymes with "red"}

**避免:**
- 使用全大写(TTS会读作首字母缩写)
- 直接写入URL(拼写出来或省略)
- 使用特殊字符:&、@、#(除非作为单词)
- 大数字用阿拉伯数字:“1,234,567”

TTS Markup Examples

TTS标记示例

markdown
**Plain text:**
"Click the button and wait for results."

**TTS-optimized:**
"Click the button... {pause:300ms} and wait for results."

**With emphasis:**
"Click the *button*... {pause:300ms} and wait for *results*."

**With rate control:**
"{rate:0.9}Watch carefully.{rate:1.1} This happens fast."

**SSML format (advanced):**
<speak>
  Click the <emphasis level="moderate">button</emphasis>
  <break time="300ms"/>
  and wait for <prosody rate="slow">results</prosody>.
</speak>
markdown
**纯文本:**
“点击按钮,等待结果。”

**适配TTS的版本:**
“点击按钮... {pause:300ms} 等待结果。”

**带强调的版本:**
“点击*按钮*... {pause:300ms} 等待*结果*。”

**带语速控制的版本:**
“{rate:0.9}仔细看。{rate:1.1} 动作很快。”

**SSML格式(进阶):**
<speak>
  Click the <emphasis level="moderate">button</emphasis>
  <break time="300ms"/>
  and wait for <prosody rate="slow">results</prosody>.
</speak>

Voice Selection Guidelines

语音选择指南

Content Type        Recommended Voices              Characteristics
────────────────────────────────────────────────────────────────────────
Technical Demo      OpenAI: "onyx", Gemini: "Charon"  Clear, authoritative
Tutorial            OpenAI: "nova", Gemini: "Kore"    Warm, patient
Marketing           OpenAI: "alloy", Gemini: "Puck"   Energetic, engaging
Corporate           OpenAI: "echo", Gemini: "Fenrir"  Professional, calm
Storytelling        OpenAI: "fable", Gemini: "Aoede"  Expressive, melodic
内容类型        推荐语音              特点
────────────────────────────────────────────────────────────────────────
技术演示      OpenAI: "onyx", Gemini: "Charon"  清晰、权威
教程            OpenAI: "nova", Gemini: "Kore"    亲切、耐心
营销            OpenAI: "alloy", Gemini: "Puck"   有活力、吸引人
企业宣传        OpenAI: "echo", Gemini: "Fenrir"  专业、沉稳
叙事类        OpenAI: "fable", Gemini: "Aoede"  富有表现力、悦耳

Sync Point Workflow

同步点工作流

Pre-Production Sync Planning

前期制作同步规划

1. Create rough edit with temporary VO or no audio
2. Mark key visual moments (timestamps)
3. Write narration to hit those marks
4. Calculate total word count vs. duration
5. Adjust pacing or cut visuals to match

Visual Moments to Mark:
├── Actions (clicks, transitions, animations)
├── Reveals (new screens, results, data)
├── Emphasis (key features, benefits)
├── Transitions (scene changes)
└── CTA appearance
1. 创建带临时旁白或无音频的粗剪版本
2. 标记关键视觉时刻(时间戳)
3. 撰写旁白以匹配这些标记点
4. 计算总字数与时长的匹配度
5. 调整节奏或剪辑视觉内容以适配

需标记的视觉时刻:
├── 动作(点击、转场、动画)
├── 揭晓(新界面、结果、数据)
├── 重点(核心功能、优势)
├── 转场(场景切换)
└── CTA出现时刻

Post-Production Sync Adjustment

后期制作同步调整

Problem                     Solution
───────────────────────────────────────────────────────────────
Narration too long          Cut words, not speed up TTS
Narration too short         Add pauses, elaboration
Hard sync misaligned        Adjust video timing or rewrite phrase
Multiple sync misses        Re-evaluate structure
TTS sounds rushed           Lower WPM, add breath pauses
TTS sounds slow             Trim pauses, tighten phrases
问题                     解决方案
───────────────────────────────────────────────────────────────
旁白过长          删减文字,而非加速TTS
旁白过短          添加停顿、补充内容
硬同步错位        调整视频时间或重写语句
多处同步失误        重新评估结构
TTS听起来仓促        降低WPM,添加呼吸停顿
TTS听起来缓慢        删减停顿、精简语句

Quick Reference: Narration Checklist

快速参考:旁白脚本检查清单

Pre-Write:
[ ] Video duration confirmed
[ ] Target WPM selected
[ ] Key sync points identified
[ ] CTA placement decided
[ ] Voice/tone defined

During Write:
[ ] Each scene has timing markers
[ ] Pauses marked for breath/emphasis
[ ] Technical terms phonetically noted
[ ] Contractions used naturally
[ ] Emphasis words identified

Post-Write:
[ ] Word count within target
[ ] All sync points achievable
[ ] TTS test recorded
[ ] Timing validated against video
[ ] CTA clear and actionable
撰写前:
[ ] 已确认视频时长
[ ] 已选定目标WPM
[ ] 已确定关键同步点
[ ] 已决定CTA位置
[ ] 已定义语音/语气

撰写中:
[ ] 每个场景都有时间标记
[ ] 已标记呼吸/强调停顿
[ ] 专业术语已标注音标
[ ] 自然使用缩写形式
[ ] 已确定需强调的词汇

撰写后:
[ ] 字数在目标范围内
[ ] 所有同步点均可实现
[ ] 已录制TTS测试版
[ ] 已对照视频验证时间
[ ] CTA清晰且可执行

Related Skills

相关技能

  • video-pacing
    : Rhythm and timing patterns for video editing
  • audio-language-models
    : TTS providers and voice synthesis
  • demo-producer
    : Full demo video production workflow
  • hook-formulas
    : Attention-grabbing opening patterns
  • copywriting-patterns
    : Persuasive writing techniques
  • video-pacing
    : 视频剪辑的节奏与时间模式
  • audio-language-models
    : TTS提供商与语音合成
  • demo-producer
    : 完整演示视频制作流程
  • hook-formulas
    : 吸引注意力的开场模板
  • copywriting-patterns
    : 说服性写作技巧

References

参考资料

  • Script Templates - Full templates for demo, tutorial, promo videos
  • Timing Markers - Detailed sync point specification
  • Pacing Guidelines - WPM targets and comprehension research
  • 脚本模板 - 演示、教程、推广视频的完整模板
  • 时间标记 - 同步点详细说明
  • 节奏指南 - WPM目标与理解度研究

Capability Details

能力详情

script-writing

script-writing

Keywords: narration, script, voiceover, VO, dialogue, copy Solves:
  • How do I write narration for a demo video?
  • Script templates for video production
  • Writing voice-over copy
关键词: narration, script, voiceover, VO, dialogue, copy 解决问题:
  • 如何为演示视频撰写旁白?
  • 视频制作的脚本模板
  • 撰写配音文案

timing-sync

timing-sync

Keywords: timing, sync, synchronization, markers, cue points Solves:
  • How do I sync narration to video?
  • Timing markers for TTS
  • Frame-accurate voice alignment
关键词: timing, sync, synchronization, markers, cue points 解决问题:
  • 如何实现旁白与视频的同步?
  • TTS的时间标记
  • 帧级精准的语音对齐

pacing

pacing

Keywords: WPM, words per minute, pacing, speed, comprehension Solves:
  • How fast should narration be?
  • Calculating script length for video duration
  • Platform-specific pacing
关键词: WPM, words per minute, pacing, speed, comprehension 解决问题:
  • 旁白语速应该多快?
  • 根据视频时长计算脚本长度
  • 平台专属节奏建议

cta-scripting

cta-scripting

Keywords: CTA, call to action, conversion, engagement Solves:
  • How do I write a video CTA?
  • CTA patterns for different platforms
  • Closing scripts that convert
关键词: CTA, call to action, conversion, engagement 解决问题:
  • 如何为视频撰写CTA?
  • 不同平台的CTA模板
  • 能转化的收尾脚本

tts-optimization

tts-optimization

Keywords: TTS, text-to-speech, voice synthesis, SSML Solves:
  • How do I format scripts for TTS?
  • Making AI voices sound natural
  • TTS markup and emphasis
关键词: TTS, text-to-speech, voice synthesis, SSML 解决问题:
  • 如何为TTS格式化脚本?
  • 让AI语音听起来更自然
  • TTS标记与强调