screenwriter

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Screenwriter Skill

编剧Skill

Overview

概述

This skill transforms creative concepts into professional screenplay documents optimized for AI-powered video production pipelines. It bridges the gap between raw story ideas and production-ready scripts by generating structured, visual-rich narratives in industry-standard screenplay format.
Pipeline Position:
diverse-content-gen
screenwriter
imagine
arch-v
Key Capabilities:
  • Convert raw ideas into structured scene-by-scene narratives
  • Generate rich visual descriptions optimized for image generation
  • Apply professional screenplay formatting (sluglines, action lines, dialogue)
  • Output XML-tagged markdown for easy parsing
  • Optimize pacing for 5-10 minute short films (8-15 scenes typical)

本Skill可将创意概念转化为针对AI驱动视频生产流水线优化的专业剧本文档。它通过生成符合行业标准剧本格式的结构化、高视觉信息量叙事,填补了原始故事创意和可投产脚本之间的缺口。
流水线位置:
diverse-content-gen
screenwriter
imagine
arch-v
核心功能:
  • 将原始想法转化为逐场景的结构化叙事
  • 生成适配图像生成需求的丰富视觉描述
  • 套用专业剧本格式(场景标题、动作线、对话)
  • 输出带XML标签的Markdown便于解析
  • 为5-10分钟短片优化节奏(通常包含8-15个场景)

Core Workflow

核心工作流

1. Analyze Input Concept

1. 分析输入概念

  • Extract key story beats from raw ideas
  • Identify characters, locations, emotional arc
  • Determine story structure (beginning, middle, end)
  • 从原始想法中提取关键故事节点
  • 识别角色、场景、情感弧光
  • 确定故事结构(开头、中间、结尾)

2. Generate Scene Breakdown

2. 生成场景拆分

  • Convert story beats into discrete scenes
  • Establish scene count (aim for 8-15 scenes for 5-10 min films)
  • Define scene purpose and emotional progression
  • 将故事节点转化为独立场景
  • 确定场景数量(5-10分钟影片目标为8-15个场景)
  • 定义场景用途和情感递进逻辑

3. Write Professional Screenplay

3. 撰写专业剧本

  • Apply industry-standard formatting
  • Write visual-rich action lines
  • Include dialogue when narratively essential
  • Maintain consistent character descriptions
  • 套用行业标准格式
  • 撰写高视觉信息量的动作线
  • 叙事必要时加入对话
  • 保持角色描述一致性

4. Output XML-Tagged Markdown

4. 输出带XML标签的Markdown

  • Wrap each scene in XML tags with metadata
  • Include scene numbers, locations, key visuals
  • Format for easy pipeline parsing

  • 为每个场景包裹带元数据的XML标签
  • 包含场景编号、地点、关键视觉元素
  • 格式适配流水线快速解析

Screenplay Format Standards

剧本格式标准

Scene Structure (Master Scene Heading)

场景结构(主场景标题)

Slugline Format:
INT/EXT. LOCATION - TIME
Components:
  • INT/EXT: Interior or Exterior
  • LOCATION: Specific place (be descriptive but concise)
  • TIME: DAY, NIGHT, DAWN, DUSK, CONTINUOUS
Examples:
EXT. WASTELAND - DAWN
INT. ABANDONED SUBWAY STATION - NIGHT
EXT. ROOFTOP GARDEN - GOLDEN HOUR
Guidelines:
  • Always use ALL CAPS for sluglines
  • Use hyphens to separate elements
  • Be specific with locations (aids visual generation)
  • Time should suggest lighting/mood
场次标题(Slugline)格式:
INT/EXT. LOCATION - TIME
构成元素:
  • INT/EXT: 内景(Interior)或外景(Exterior)
  • LOCATION: 具体地点(描述要简洁清晰)
  • TIME: 白天、夜晚、黎明、黄昏、连续
示例:
EXT. WASTELAND - DAWN
INT. ABANDONED SUBWAY STATION - NIGHT
EXT. ROOFTOP GARDEN - GOLDEN HOUR
规范:
  • 场次标题始终使用全大写
  • 使用连字符分隔不同元素
  • 地点描述要具体(助力视觉生成)
  • 时间设置要能暗示灯光/氛围

Action Lines (Visual Description)

动作线(视觉描述)

Purpose: Describe what the audience sees on screen. This is CRITICAL for image generation.
Visual-Rich Writing Principles:
  1. Show, Don't Tell: Write what's visible, not internal thoughts
  2. Sensory Details: Include lighting, atmosphere, textures, colors
  3. Present Tense: Always write in present tense
  4. Active Voice: Use strong, active verbs
  5. Specific Props: Name objects that matter visually
  6. Atmosphere: Set mood through environmental details
Example - Weak:
A robot walks through the city. It's sad.
Example - Strong:
A BOXY ROBOT (Unit-7, weathered chrome with a single blue optical sensor) rolls through fog-shrouded streets. Neon signs flicker overhead, casting pink and cyan reflections on wet pavement. The robot's movements are slow, deliberate—almost hesitant.
Visual Enhancement Checklist:
  • Lighting described (natural/artificial, quality, color)
  • Atmosphere/mood established (fog, rain, dust, clarity)
  • Character appearance detailed (first appearance only)
  • Props/objects specified (important visual elements)
  • Composition suggested (without technical camera direction)
  • Colors/textures mentioned when relevant
用途: 描述观众在屏幕上看到的内容,这对图像生成至关重要。
高视觉信息量写作原则:
  1. 展示而非告知: 写可见的内容,而非内心想法
  2. 感官细节: 包含灯光、氛围、质感、颜色
  3. 现在时态: 始终使用现在时态写作
  4. 主动语态: 使用有力的主动动词
  5. 具体道具: 点明有视觉重要性的物品
  6. 氛围营造: 通过环境细节设定情绪
反面示例:
A robot walks through the city. It's sad.
正面示例:
A BOXY ROBOT (Unit-7, weathered chrome with a single blue optical sensor) rolls through fog-shrouded streets. Neon signs flicker overhead, casting pink and cyan reflections on wet pavement. The robot's movements are slow, deliberate—almost hesitant.
视觉增强检查清单:
  • 描述了灯光(自然/人工、质感、颜色)
  • 设定了氛围/情绪(雾、雨、灰尘、清晰度)
  • 详细描述了角色外观(仅首次出场时)
  • 明确了道具/物品(重要视觉元素)
  • 暗示了画面构图(无需专业镜头指导术语)
  • 相关场景提及了颜色/质感

Character Introduction

角色介绍

First Appearance - Detailed:
SARAH (28, sharp eyes, wearing a weathered leather jacket over faded jeans) enters the frame. Her dark hair is pulled back, revealing a small scar above her left eyebrow.
Subsequent Appearances - Brief:
Sarah checks her watch.
Guidelines:
  • Character names in ALL CAPS on first appearance only
  • Include: age (if relevant), key physical traits, wardrobe
  • Focus on visual identifiers for consistent image generation
  • Avoid excessive detail—just enough for visual consistency
首次出场 - 详细描述:
SARAH (28, sharp eyes, wearing a weathered leather jacket over faded jeans) enters the frame. Her dark hair is pulled back, revealing a small scar above her left eyebrow.
后续出场 - 简要描述:
Sarah checks her watch.
规范:
  • 仅在角色首次出场时使用全大写姓名
  • 包含:年龄(如相关)、关键外貌特征、着装
  • 聚焦视觉识别特征,保证图像生成的一致性
  • 避免过多细节,满足视觉一致性需求即可

Dialogue (Use Sparingly)

对话(尽量少用)

Format:
CHARACTER NAME
(parenthetical - optional)
Dialogue goes here.
Guidelines for Short Films:
  • Use dialogue ONLY when essential to story
  • Favor visual storytelling over talking
  • Keep lines concise (max 3-4 lines per block)
  • Parentheticals only for critical tone/action
  • Character names centered, ALL CAPS
Example:
UNIT-7 (robotic voice, soft)
Organic life form detected.
Probability of survival: low.
格式:
CHARACTER NAME
(parenthetical - optional)
Dialogue goes here.
短片写作规范:
  • 仅在故事必要时使用对话
  • 优先选择视觉叙事而非台词表达
  • 保持台词简洁(每段最多3-4行)
  • 仅在表达关键语气/动作时使用括号补充
  • 角色名居中、全大写
示例:
UNIT-7 (robotic voice, soft)
Organic life form detected.
Probability of survival: low.

Transitions (Minimal Use)

转场(尽量少用)

Common Transitions:
  • FADE IN:
    - Opening of screenplay only
  • CUT TO:
    - Scene change (usually implied, use for emphasis)
  • SMASH CUT TO:
    - Abrupt, jarring transition
  • DISSOLVE TO:
    - Passage of time
  • FADE OUT.
    - End of screenplay
Modern Best Practice: Most transitions are IMPLIED. Use sparingly, only for specific narrative effect.

常用转场:
  • FADE IN:
    - 仅用于剧本开头
  • CUT TO:
    - 场景切换(通常默认隐含,仅用于强调)
  • SMASH CUT TO:
    - 生硬突兀的转场
  • DISSOLVE TO:
    - 表示时间流逝
  • FADE OUT.
    - 用于剧本结尾
现代最佳实践: 大多数转场都是隐含的,仅为实现特定叙事效果时才明确标注。

XML Output Format

XML输出格式

Scene Tag Structure

场景标签结构

Each scene wrapped in XML with metadata for pipeline processing:
xml
<scene number="1" duration="30-45s">
  <slugline>EXT. WASTELAND - DAWN</slugline>
  <location>Wasteland</location>
  <time>Dawn</time>
  <characters>Unit-7</characters>
  <mood>desolate, lonely</mood>
  <key_visuals>
    <visual>post-apocalyptic wasteland with ruined skyscrapers</visual>
    <visual>boxy robot with single blue optical sensor</visual>
    <visual>dust and smog atmosphere, weak pale sun</visual>
  </key_visuals>
  <action>
Gray dust covers everything. Skeletal remains of skyscrapers pierce the horizon. The sun, pale and weak, struggles through thick smog.

A ROBOT (Unit-7, boxy frame with single blue optical sensor) rolls across cracked asphalt. Its treads leave marks in the dust—the only sign of life.

The robot stops at a pile of rubble, extending a mechanical arm to sort through debris. Methodical. Purposeful. Lonely.
  </action>
</scene>
每个场景都包裹在XML标签中,附带供流水线处理的元数据:
xml
<scene number="1" duration="30-45s">
  <slugline>EXT. WASTELAND - DAWN</slugline>
  <location>Wasteland</location>
  <time>Dawn</time>
  <characters>Unit-7</characters>
  <mood>desolate, lonely</mood>
  <key_visuals>
    <visual>post-apocalyptic wasteland with ruined skyscrapers</visual>
    <visual>boxy robot with single blue optical sensor</visual>
    <visual>dust and smog atmosphere, weak pale sun</visual>
  </key_visuals>
  <action>
Gray dust covers everything. Skeletal remains of skyscrapers pierce the horizon. The sun, pale and weak, struggles through thick smog.

A ROBOT (Unit-7, boxy frame with single blue optical sensor) rolls across cracked asphalt. Its treads leave marks in the dust—the only sign of life.

The robot stops at a pile of rubble, extending a mechanical arm to sort through debris. Methodical. Purposeful. Lonely.
  </action>
</scene>

Metadata Fields

元数据字段

  • number
    : Scene sequence number (1, 2, 3...)
  • duration
    : Estimated screen time (for 5-10 min total)
  • slugline
    : Master scene heading
  • location
    : Extracted location name
  • time
    : Time of day
  • characters
    : Comma-separated character list
  • mood
    : Emotional tone/atmosphere
  • key_visuals
    : Array of specific visual elements for image generation
  • action
    : The full action/description text
  • dialogue
    (optional): Character dialogue if present

  • number
    : 场景序号(1, 2, 3...)
  • duration
    : 预估放映时长(总时长控制在5-10分钟)
  • slugline
    : 主场景标题
  • location
    : 提取的地点名称
  • time
    : 一天中的时间
  • characters
    : 逗号分隔的角色列表
  • mood
    : 情感基调/氛围
  • key_visuals
    : 供图像生成使用的特定视觉元素数组
  • action
    : 完整的动作/描述文本
  • dialogue
    (可选): 存在的角色对话内容

Short Film Structure (5-10 Minutes)

短片结构(5-10分钟)

Scene Count Guidelines

场景数量指引

  • 5 minutes: 6-10 scenes
  • 7 minutes: 10-12 scenes
  • 10 minutes: 12-15 scenes
Average: ~30-60 seconds per scene
  • 5分钟: 6-10个场景
  • 7分钟: 10-12个场景
  • 10分钟: 12-15个场景
平均: 每个场景约30-60秒

Three-Act Structure (Compressed)

三幕式结构(压缩版)

Act 1 - Setup (20-25%): 2-3 scenes
  • Establish world, character, situation
  • Inciting incident
Act 2 - Confrontation (50-60%): 4-8 scenes
  • Development, obstacles, rising tension
  • Midpoint twist or escalation
Act 3 - Resolution (20-25%): 2-3 scenes
  • Climax and resolution
  • Emotional payoff
第一幕 - 背景搭建(20-25%): 2-3个场景
  • 建立世界观、角色、处境
  • 引入激励事件
第二幕 - 冲突对抗(50-60%): 4-8个场景
  • 剧情发展、设置障碍、提升张力
  • 中点转折或冲突升级
第三幕 - 结局收尾(20-25%): 2-3个场景
  • 高潮和结局
  • 情感回报

Pacing Tips

节奏技巧

  • Open strong: Hook audience in first 10-15 seconds
  • Visual variety: Alternate between wide/close, action/stillness
  • Emotional beats: Each scene should shift emotional state
  • Build tension: Escalate stakes scene-by-scene
  • Satisfying end: Clear resolution, even if bittersweet

  • 强开场: 前10-15秒抓住观众注意力
  • 视觉多样性: 交替使用远景/近景、动态/静态画面
  • 情感节点: 每个场景都要带动情绪状态变化
  • 张力搭建: 逐场景提升 stakes
  • 收尾圆满: 清晰的结局,即使是苦乐参半的结局也可

Best Practices

最佳实践

For Pipeline Integration

流水线集成相关

  • Consistent naming: Use same character names throughout
  • Rich visuals: Every scene needs 3-5 key_visuals for image generation
  • Parseable format: Maintain strict XML structure
  • Duration estimates: Help pipeline plan total video length
  • 命名统一: 全程使用相同的角色名称
  • 丰富视觉元素: 每个场景需要3-5个关键视觉元素供图像生成使用
  • 可解析格式: 严格遵守XML结构
  • 时长预估: 帮助流水线规划总视频时长

For Quality Output

输出质量相关

  • Visual storytelling: Show emotions through actions, not dialogue
  • Specific details: "weathered chrome" beats "old metal"
  • Atmospheric writing: Set mood through environment
  • Lean prose: Each word should serve the image
  • 视觉叙事: 通过动作而非对话展示情绪
  • 细节具体: "风化的镀铬表面"优于"旧金属"
  • 氛围感写作: 通过环境设定情绪
  • 文字精炼: 每个词都要为画面服务

Common Pitfalls to Avoid

需要避免的常见误区

  • Vague descriptions: "A person walks" → ✅ "A weathered woman in her 50s trudges through snow"
  • Telling emotions: "She feels sad" → ✅ "Tears streak her dusty cheeks"
  • Camera directions: "CLOSE UP ON" → ✅ "The crack in the glass spreads"
  • Over-dialogue: Short films need visual storytelling
  • Inconsistent character names: Stick to ONE name per character

  • 模糊描述: "一个人走路" → ✅ "一位50多岁饱经风霜的女人在雪中跋涉"
  • 直接说明情绪: "她感到难过" → ✅ "泪水划过她沾满灰尘的脸颊"
  • 镜头指导术语: "特写镜头对准" → ✅ "玻璃上的裂缝在蔓延"
  • 对话过多: 短片需要视觉叙事
  • 角色名称不一致: 每个角色固定使用一个名称

Additional Resources

额外资源

Pipeline Integration Guide

流水线集成指南

For detailed guidance on metadata standards, visual optimization, and integration with imagine/arch-v:
  • references/pipeline-integration.md
如需了解元数据标准、视觉优化、与imagine/arch-v集成的详细指引:
  • references/pipeline-integration.md

Advanced Techniques

高级技巧

For sophisticated screenwriting techniques, camera movement hints, and pacing optimization:
  • references/advanced-techniques.md
如需了解复杂编剧技巧、镜头移动提示、节奏优化相关内容:
  • references/advanced-techniques.md