screenwriter

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Screenwriter Skill

编剧Skill

Overview

概述

This skill transforms creative concepts into professional screenplay documents optimized for AI-powered video production pipelines. It bridges the gap between raw story ideas and production-ready scripts by generating structured, visual-rich narratives in industry-standard screenplay format.

Pipeline Position:

diverse-content-gen

→ screenwriter →

imagine

→

arch-v

Key Capabilities:

Convert raw ideas into structured scene-by-scene narratives
Generate rich visual descriptions optimized for image generation
Apply professional screenplay formatting (sluglines, action lines, dialogue)
Output XML-tagged markdown for easy parsing
Optimize pacing for 5-10 minute short films (8-15 scenes typical)

本Skill可将创意概念转化为针对AI驱动视频生产流水线优化的专业剧本文档。它通过生成符合行业标准剧本格式的结构化、高视觉信息量叙事，填补了原始故事创意和可投产脚本之间的缺口。

流水线位置：

diverse-content-gen

→ screenwriter →

imagine

→

arch-v

核心功能：

将原始想法转化为逐场景的结构化叙事
生成适配图像生成需求的丰富视觉描述
套用专业剧本格式（场景标题、动作线、对话）
输出带XML标签的Markdown便于解析
为5-10分钟短片优化节奏（通常包含8-15个场景）

Core Workflow

核心工作流

1. Analyze Input Concept

1. 分析输入概念

Extract key story beats from raw ideas
Identify characters, locations, emotional arc
Determine story structure (beginning, middle, end)

从原始想法中提取关键故事节点
识别角色、场景、情感弧光
确定故事结构（开头、中间、结尾）

2. Generate Scene Breakdown

2. 生成场景拆分

Convert story beats into discrete scenes
Establish scene count (aim for 8-15 scenes for 5-10 min films)
Define scene purpose and emotional progression

将故事节点转化为独立场景
确定场景数量（5-10分钟影片目标为8-15个场景）
定义场景用途和情感递进逻辑

3. Write Professional Screenplay

3. 撰写专业剧本

Apply industry-standard formatting
Write visual-rich action lines
Include dialogue when narratively essential
Maintain consistent character descriptions

套用行业标准格式
撰写高视觉信息量的动作线
叙事必要时加入对话
保持角色描述一致性

4. Output XML-Tagged Markdown

4. 输出带XML标签的Markdown

Wrap each scene in XML tags with metadata
Include scene numbers, locations, key visuals
Format for easy pipeline parsing

为每个场景包裹带元数据的XML标签
包含场景编号、地点、关键视觉元素
格式适配流水线快速解析

Screenplay Format Standards

剧本格式标准

Scene Structure (Master Scene Heading)

场景结构（主场景标题）

Slugline Format:

INT/EXT. LOCATION - TIME

Components:

INT/EXT: Interior or Exterior
LOCATION: Specific place (be descriptive but concise)
TIME: DAY, NIGHT, DAWN, DUSK, CONTINUOUS

Examples:

EXT. WASTELAND - DAWN
INT. ABANDONED SUBWAY STATION - NIGHT
EXT. ROOFTOP GARDEN - GOLDEN HOUR

Guidelines:

Always use ALL CAPS for sluglines
Use hyphens to separate elements
Be specific with locations (aids visual generation)
Time should suggest lighting/mood

场次标题（Slugline）格式：

INT/EXT. LOCATION - TIME

构成元素：

INT/EXT: 内景（Interior）或外景（Exterior）
LOCATION: 具体地点（描述要简洁清晰）
TIME: 白天、夜晚、黎明、黄昏、连续

示例：

EXT. WASTELAND - DAWN
INT. ABANDONED SUBWAY STATION - NIGHT
EXT. ROOFTOP GARDEN - GOLDEN HOUR

规范：

场次标题始终使用全大写
使用连字符分隔不同元素
地点描述要具体（助力视觉生成）
时间设置要能暗示灯光/氛围

Action Lines (Visual Description)

动作线（视觉描述）

Purpose: Describe what the audience sees on screen. This is CRITICAL for image generation.

Visual-Rich Writing Principles:

Show, Don't Tell: Write what's visible, not internal thoughts
Sensory Details: Include lighting, atmosphere, textures, colors
Present Tense: Always write in present tense
Active Voice: Use strong, active verbs
Specific Props: Name objects that matter visually
Atmosphere: Set mood through environmental details

Example - Weak:

A robot walks through the city. It's sad.

Example - Strong:

A BOXY ROBOT (Unit-7, weathered chrome with a single blue optical sensor) rolls through fog-shrouded streets. Neon signs flicker overhead, casting pink and cyan reflections on wet pavement. The robot's movements are slow, deliberate—almost hesitant.

Visual Enhancement Checklist:

Lighting described (natural/artificial, quality, color)
Atmosphere/mood established (fog, rain, dust, clarity)
Character appearance detailed (first appearance only)
Props/objects specified (important visual elements)
Composition suggested (without technical camera direction)
Colors/textures mentioned when relevant

用途： 描述观众在屏幕上看到的内容，这对图像生成至关重要。

高视觉信息量写作原则：

展示而非告知： 写可见的内容，而非内心想法
感官细节： 包含灯光、氛围、质感、颜色
现在时态： 始终使用现在时态写作
主动语态： 使用有力的主动动词
具体道具： 点明有视觉重要性的物品
氛围营造： 通过环境细节设定情绪

反面示例：

A robot walks through the city. It's sad.

正面示例：

A BOXY ROBOT (Unit-7, weathered chrome with a single blue optical sensor) rolls through fog-shrouded streets. Neon signs flicker overhead, casting pink and cyan reflections on wet pavement. The robot's movements are slow, deliberate—almost hesitant.

视觉增强检查清单：

描述了灯光（自然/人工、质感、颜色）
设定了氛围/情绪（雾、雨、灰尘、清晰度）
详细描述了角色外观（仅首次出场时）
明确了道具/物品（重要视觉元素）
暗示了画面构图（无需专业镜头指导术语）
相关场景提及了颜色/质感

Character Introduction

角色介绍

First Appearance - Detailed:

SARAH (28, sharp eyes, wearing a weathered leather jacket over faded jeans) enters the frame. Her dark hair is pulled back, revealing a small scar above her left eyebrow.

Subsequent Appearances - Brief:

Sarah checks her watch.

Guidelines:

Character names in ALL CAPS on first appearance only
Include: age (if relevant), key physical traits, wardrobe
Focus on visual identifiers for consistent image generation
Avoid excessive detail—just enough for visual consistency

首次出场 - 详细描述：

SARAH (28, sharp eyes, wearing a weathered leather jacket over faded jeans) enters the frame. Her dark hair is pulled back, revealing a small scar above her left eyebrow.

后续出场 - 简要描述：

Sarah checks her watch.

规范：

仅在角色首次出场时使用全大写姓名
包含：年龄（如相关）、关键外貌特征、着装
聚焦视觉识别特征，保证图像生成的一致性
避免过多细节，满足视觉一致性需求即可

Dialogue (Use Sparingly)

对话（尽量少用）

Format:

CHARACTER NAME
(parenthetical - optional)
Dialogue goes here.

Guidelines for Short Films:

Use dialogue ONLY when essential to story
Favor visual storytelling over talking
Keep lines concise (max 3-4 lines per block)
Parentheticals only for critical tone/action
Character names centered, ALL CAPS

Example:

UNIT-7 (robotic voice, soft)
Organic life form detected.
Probability of survival: low.

格式：

CHARACTER NAME
(parenthetical - optional)
Dialogue goes here.

短片写作规范：

仅在故事必要时使用对话
优先选择视觉叙事而非台词表达
保持台词简洁（每段最多3-4行）
仅在表达关键语气/动作时使用括号补充
角色名居中、全大写

示例：

UNIT-7 (robotic voice, soft)
Organic life form detected.
Probability of survival: low.

Transitions (Minimal Use)

转场（尽量少用）

Common Transitions:

```
FADE IN:
```
- Opening of screenplay only
```
CUT TO:
```
- Scene change (usually implied, use for emphasis)
```
SMASH CUT TO:
```
- Abrupt, jarring transition
```
DISSOLVE TO:
```
- Passage of time
```
FADE OUT.
```
- End of screenplay

Modern Best Practice: Most transitions are IMPLIED. Use sparingly, only for specific narrative effect.

常用转场：

```
FADE IN:
```
- 仅用于剧本开头
```
CUT TO:
```
- 场景切换（通常默认隐含，仅用于强调）
```
SMASH CUT TO:
```
- 生硬突兀的转场
```
DISSOLVE TO:
```
- 表示时间流逝
```
FADE OUT.
```
- 用于剧本结尾

现代最佳实践： 大多数转场都是隐含的，仅为实现特定叙事效果时才明确标注。

XML Output Format

XML输出格式

Scene Tag Structure

场景标签结构

Each scene wrapped in XML with metadata for pipeline processing:

xml

<scene number="1" duration="30-45s">
  <slugline>EXT. WASTELAND - DAWN</slugline>
  <location>Wasteland</location>
  <time>Dawn</time>
  <characters>Unit-7</characters>
  <mood>desolate, lonely</mood>
  <key_visuals>
    <visual>post-apocalyptic wasteland with ruined skyscrapers</visual>
    <visual>boxy robot with single blue optical sensor</visual>
    <visual>dust and smog atmosphere, weak pale sun</visual>
  </key_visuals>
  <action>
Gray dust covers everything. Skeletal remains of skyscrapers pierce the horizon. The sun, pale and weak, struggles through thick smog.

A ROBOT (Unit-7, boxy frame with single blue optical sensor) rolls across cracked asphalt. Its treads leave marks in the dust—the only sign of life.

The robot stops at a pile of rubble, extending a mechanical arm to sort through debris. Methodical. Purposeful. Lonely.
  </action>
</scene>

每个场景都包裹在XML标签中，附带供流水线处理的元数据：

xml

<scene number="1" duration="30-45s">
  <slugline>EXT. WASTELAND - DAWN</slugline>
  <location>Wasteland</location>
  <time>Dawn</time>
  <characters>Unit-7</characters>
  <mood>desolate, lonely</mood>
  <key_visuals>
    <visual>post-apocalyptic wasteland with ruined skyscrapers</visual>
    <visual>boxy robot with single blue optical sensor</visual>
    <visual>dust and smog atmosphere, weak pale sun</visual>
  </key_visuals>
  <action>
Gray dust covers everything. Skeletal remains of skyscrapers pierce the horizon. The sun, pale and weak, struggles through thick smog.

A ROBOT (Unit-7, boxy frame with single blue optical sensor) rolls across cracked asphalt. Its treads leave marks in the dust—the only sign of life.

The robot stops at a pile of rubble, extending a mechanical arm to sort through debris. Methodical. Purposeful. Lonely.
  </action>
</scene>

Metadata Fields

元数据字段

```
number
```
: Scene sequence number (1, 2, 3...)
```
duration
```
: Estimated screen time (for 5-10 min total)
```
slugline
```
: Master scene heading
```
location
```
: Extracted location name
```
time
```
: Time of day
```
characters
```
: Comma-separated character list
```
mood
```
: Emotional tone/atmosphere
```
key_visuals
```
: Array of specific visual elements for image generation
```
action
```
: The full action/description text
```
dialogue
```
(optional): Character dialogue if present

```
number
```
: 场景序号（1, 2, 3...）
```
duration
```
: 预估放映时长（总时长控制在5-10分钟）
```
slugline
```
: 主场景标题
```
location
```
: 提取的地点名称
```
time
```
: 一天中的时间
```
characters
```
: 逗号分隔的角色列表
```
mood
```
: 情感基调/氛围
```
key_visuals
```
: 供图像生成使用的特定视觉元素数组
```
action
```
: 完整的动作/描述文本
```
dialogue
```
（可选）: 存在的角色对话内容

Short Film Structure (5-10 Minutes)

短片结构（5-10分钟）

Scene Count Guidelines

场景数量指引

5 minutes: 6-10 scenes
7 minutes: 10-12 scenes
10 minutes: 12-15 scenes

Average: ~30-60 seconds per scene

5分钟： 6-10个场景
7分钟： 10-12个场景
10分钟： 12-15个场景

平均： 每个场景约30-60秒

Three-Act Structure (Compressed)

三幕式结构（压缩版）

Act 1 - Setup (20-25%): 2-3 scenes

Establish world, character, situation
Inciting incident

Act 2 - Confrontation (50-60%): 4-8 scenes

Development, obstacles, rising tension
Midpoint twist or escalation

Act 3 - Resolution (20-25%): 2-3 scenes

Climax and resolution
Emotional payoff

第一幕 - 背景搭建（20-25%）： 2-3个场景

建立世界观、角色、处境
引入激励事件

第二幕 - 冲突对抗（50-60%）： 4-8个场景

剧情发展、设置障碍、提升张力
中点转折或冲突升级

第三幕 - 结局收尾（20-25%）： 2-3个场景

高潮和结局
情感回报

Pacing Tips

节奏技巧

Open strong: Hook audience in first 10-15 seconds
Visual variety: Alternate between wide/close, action/stillness
Emotional beats: Each scene should shift emotional state
Build tension: Escalate stakes scene-by-scene
Satisfying end: Clear resolution, even if bittersweet

强开场： 前10-15秒抓住观众注意力
视觉多样性： 交替使用远景/近景、动态/静态画面
情感节点： 每个场景都要带动情绪状态变化
张力搭建： 逐场景提升 stakes
收尾圆满： 清晰的结局，即使是苦乐参半的结局也可

Best Practices

最佳实践

For Pipeline Integration

流水线集成相关

Consistent naming: Use same character names throughout
Rich visuals: Every scene needs 3-5 key_visuals for image generation
Parseable format: Maintain strict XML structure
Duration estimates: Help pipeline plan total video length

命名统一： 全程使用相同的角色名称
丰富视觉元素： 每个场景需要3-5个关键视觉元素供图像生成使用
可解析格式： 严格遵守XML结构
时长预估： 帮助流水线规划总视频时长

For Quality Output

输出质量相关

Visual storytelling: Show emotions through actions, not dialogue
Specific details: "weathered chrome" beats "old metal"
Atmospheric writing: Set mood through environment
Lean prose: Each word should serve the image

视觉叙事： 通过动作而非对话展示情绪
细节具体： "风化的镀铬表面"优于"旧金属"
氛围感写作： 通过环境设定情绪
文字精炼： 每个词都要为画面服务

Common Pitfalls to Avoid

需要避免的常见误区

❌ Vague descriptions: "A person walks" → ✅ "A weathered woman in her 50s trudges through snow"
❌ Telling emotions: "She feels sad" → ✅ "Tears streak her dusty cheeks"
❌ Camera directions: "CLOSE UP ON" → ✅ "The crack in the glass spreads"
❌ Over-dialogue: Short films need visual storytelling
❌ Inconsistent character names: Stick to ONE name per character

❌ 模糊描述： "一个人走路" → ✅ "一位50多岁饱经风霜的女人在雪中跋涉"
❌ 直接说明情绪： "她感到难过" → ✅ "泪水划过她沾满灰尘的脸颊"
❌ 镜头指导术语： "特写镜头对准" → ✅ "玻璃上的裂缝在蔓延"
❌ 对话过多： 短片需要视觉叙事
❌ 角色名称不一致： 每个角色固定使用一个名称

Additional Resources

额外资源

Pipeline Integration Guide

流水线集成指南

For detailed guidance on metadata standards, visual optimization, and integration with imagine/arch-v:

references/pipeline-integration.md

如需了解元数据标准、视觉优化、与imagine/arch-v集成的详细指引：

references/pipeline-integration.md

Advanced Techniques

高级技巧

For sophisticated screenwriting techniques, camera movement hints, and pacing optimization:

references/advanced-techniques.md

如需了解复杂编剧技巧、镜头移动提示、节奏优化相关内容：

references/advanced-techniques.md