screenwriter
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseScreenwriter Skill
编剧Skill
Overview
概述
This skill transforms creative concepts into professional screenplay documents optimized for AI-powered video production pipelines. It bridges the gap between raw story ideas and production-ready scripts by generating structured, visual-rich narratives in industry-standard screenplay format.
Pipeline Position: → screenwriter → →
diverse-content-genimaginearch-vKey Capabilities:
- Convert raw ideas into structured scene-by-scene narratives
- Generate rich visual descriptions optimized for image generation
- Apply professional screenplay formatting (sluglines, action lines, dialogue)
- Output XML-tagged markdown for easy parsing
- Optimize pacing for 5-10 minute short films (8-15 scenes typical)
本Skill可将创意概念转化为针对AI驱动视频生产流水线优化的专业剧本文档。它通过生成符合行业标准剧本格式的结构化、高视觉信息量叙事,填补了原始故事创意和可投产脚本之间的缺口。
流水线位置: → screenwriter → →
diverse-content-genimaginearch-v核心功能:
- 将原始想法转化为逐场景的结构化叙事
- 生成适配图像生成需求的丰富视觉描述
- 套用专业剧本格式(场景标题、动作线、对话)
- 输出带XML标签的Markdown便于解析
- 为5-10分钟短片优化节奏(通常包含8-15个场景)
Core Workflow
核心工作流
1. Analyze Input Concept
1. 分析输入概念
- Extract key story beats from raw ideas
- Identify characters, locations, emotional arc
- Determine story structure (beginning, middle, end)
- 从原始想法中提取关键故事节点
- 识别角色、场景、情感弧光
- 确定故事结构(开头、中间、结尾)
2. Generate Scene Breakdown
2. 生成场景拆分
- Convert story beats into discrete scenes
- Establish scene count (aim for 8-15 scenes for 5-10 min films)
- Define scene purpose and emotional progression
- 将故事节点转化为独立场景
- 确定场景数量(5-10分钟影片目标为8-15个场景)
- 定义场景用途和情感递进逻辑
3. Write Professional Screenplay
3. 撰写专业剧本
- Apply industry-standard formatting
- Write visual-rich action lines
- Include dialogue when narratively essential
- Maintain consistent character descriptions
- 套用行业标准格式
- 撰写高视觉信息量的动作线
- 叙事必要时加入对话
- 保持角色描述一致性
4. Output XML-Tagged Markdown
4. 输出带XML标签的Markdown
- Wrap each scene in XML tags with metadata
- Include scene numbers, locations, key visuals
- Format for easy pipeline parsing
- 为每个场景包裹带元数据的XML标签
- 包含场景编号、地点、关键视觉元素
- 格式适配流水线快速解析
Screenplay Format Standards
剧本格式标准
Scene Structure (Master Scene Heading)
场景结构(主场景标题)
Slugline Format:
INT/EXT. LOCATION - TIMEComponents:
- INT/EXT: Interior or Exterior
- LOCATION: Specific place (be descriptive but concise)
- TIME: DAY, NIGHT, DAWN, DUSK, CONTINUOUS
Examples:
EXT. WASTELAND - DAWN
INT. ABANDONED SUBWAY STATION - NIGHT
EXT. ROOFTOP GARDEN - GOLDEN HOURGuidelines:
- Always use ALL CAPS for sluglines
- Use hyphens to separate elements
- Be specific with locations (aids visual generation)
- Time should suggest lighting/mood
场次标题(Slugline)格式:
INT/EXT. LOCATION - TIME构成元素:
- INT/EXT: 内景(Interior)或外景(Exterior)
- LOCATION: 具体地点(描述要简洁清晰)
- TIME: 白天、夜晚、黎明、黄昏、连续
示例:
EXT. WASTELAND - DAWN
INT. ABANDONED SUBWAY STATION - NIGHT
EXT. ROOFTOP GARDEN - GOLDEN HOUR规范:
- 场次标题始终使用全大写
- 使用连字符分隔不同元素
- 地点描述要具体(助力视觉生成)
- 时间设置要能暗示灯光/氛围
Action Lines (Visual Description)
动作线(视觉描述)
Purpose: Describe what the audience sees on screen. This is CRITICAL for image generation.
Visual-Rich Writing Principles:
- Show, Don't Tell: Write what's visible, not internal thoughts
- Sensory Details: Include lighting, atmosphere, textures, colors
- Present Tense: Always write in present tense
- Active Voice: Use strong, active verbs
- Specific Props: Name objects that matter visually
- Atmosphere: Set mood through environmental details
Example - Weak:
A robot walks through the city. It's sad.Example - Strong:
A BOXY ROBOT (Unit-7, weathered chrome with a single blue optical sensor) rolls through fog-shrouded streets. Neon signs flicker overhead, casting pink and cyan reflections on wet pavement. The robot's movements are slow, deliberate—almost hesitant.Visual Enhancement Checklist:
- Lighting described (natural/artificial, quality, color)
- Atmosphere/mood established (fog, rain, dust, clarity)
- Character appearance detailed (first appearance only)
- Props/objects specified (important visual elements)
- Composition suggested (without technical camera direction)
- Colors/textures mentioned when relevant
用途: 描述观众在屏幕上看到的内容,这对图像生成至关重要。
高视觉信息量写作原则:
- 展示而非告知: 写可见的内容,而非内心想法
- 感官细节: 包含灯光、氛围、质感、颜色
- 现在时态: 始终使用现在时态写作
- 主动语态: 使用有力的主动动词
- 具体道具: 点明有视觉重要性的物品
- 氛围营造: 通过环境细节设定情绪
反面示例:
A robot walks through the city. It's sad.正面示例:
A BOXY ROBOT (Unit-7, weathered chrome with a single blue optical sensor) rolls through fog-shrouded streets. Neon signs flicker overhead, casting pink and cyan reflections on wet pavement. The robot's movements are slow, deliberate—almost hesitant.视觉增强检查清单:
- 描述了灯光(自然/人工、质感、颜色)
- 设定了氛围/情绪(雾、雨、灰尘、清晰度)
- 详细描述了角色外观(仅首次出场时)
- 明确了道具/物品(重要视觉元素)
- 暗示了画面构图(无需专业镜头指导术语)
- 相关场景提及了颜色/质感
Character Introduction
角色介绍
First Appearance - Detailed:
SARAH (28, sharp eyes, wearing a weathered leather jacket over faded jeans) enters the frame. Her dark hair is pulled back, revealing a small scar above her left eyebrow.Subsequent Appearances - Brief:
Sarah checks her watch.Guidelines:
- Character names in ALL CAPS on first appearance only
- Include: age (if relevant), key physical traits, wardrobe
- Focus on visual identifiers for consistent image generation
- Avoid excessive detail—just enough for visual consistency
首次出场 - 详细描述:
SARAH (28, sharp eyes, wearing a weathered leather jacket over faded jeans) enters the frame. Her dark hair is pulled back, revealing a small scar above her left eyebrow.后续出场 - 简要描述:
Sarah checks her watch.规范:
- 仅在角色首次出场时使用全大写姓名
- 包含:年龄(如相关)、关键外貌特征、着装
- 聚焦视觉识别特征,保证图像生成的一致性
- 避免过多细节,满足视觉一致性需求即可
Dialogue (Use Sparingly)
对话(尽量少用)
Format:
CHARACTER NAME
(parenthetical - optional)
Dialogue goes here.Guidelines for Short Films:
- Use dialogue ONLY when essential to story
- Favor visual storytelling over talking
- Keep lines concise (max 3-4 lines per block)
- Parentheticals only for critical tone/action
- Character names centered, ALL CAPS
Example:
UNIT-7 (robotic voice, soft)
Organic life form detected.
Probability of survival: low.格式:
CHARACTER NAME
(parenthetical - optional)
Dialogue goes here.短片写作规范:
- 仅在故事必要时使用对话
- 优先选择视觉叙事而非台词表达
- 保持台词简洁(每段最多3-4行)
- 仅在表达关键语气/动作时使用括号补充
- 角色名居中、全大写
示例:
UNIT-7 (robotic voice, soft)
Organic life form detected.
Probability of survival: low.Transitions (Minimal Use)
转场(尽量少用)
Common Transitions:
- - Opening of screenplay only
FADE IN: - - Scene change (usually implied, use for emphasis)
CUT TO: - - Abrupt, jarring transition
SMASH CUT TO: - - Passage of time
DISSOLVE TO: - - End of screenplay
FADE OUT.
Modern Best Practice: Most transitions are IMPLIED. Use sparingly, only for specific narrative effect.
常用转场:
- - 仅用于剧本开头
FADE IN: - - 场景切换(通常默认隐含,仅用于强调)
CUT TO: - - 生硬突兀的转场
SMASH CUT TO: - - 表示时间流逝
DISSOLVE TO: - - 用于剧本结尾
FADE OUT.
现代最佳实践: 大多数转场都是隐含的,仅为实现特定叙事效果时才明确标注。
XML Output Format
XML输出格式
Scene Tag Structure
场景标签结构
Each scene wrapped in XML with metadata for pipeline processing:
xml
<scene number="1" duration="30-45s">
<slugline>EXT. WASTELAND - DAWN</slugline>
<location>Wasteland</location>
<time>Dawn</time>
<characters>Unit-7</characters>
<mood>desolate, lonely</mood>
<key_visuals>
<visual>post-apocalyptic wasteland with ruined skyscrapers</visual>
<visual>boxy robot with single blue optical sensor</visual>
<visual>dust and smog atmosphere, weak pale sun</visual>
</key_visuals>
<action>
Gray dust covers everything. Skeletal remains of skyscrapers pierce the horizon. The sun, pale and weak, struggles through thick smog.
A ROBOT (Unit-7, boxy frame with single blue optical sensor) rolls across cracked asphalt. Its treads leave marks in the dust—the only sign of life.
The robot stops at a pile of rubble, extending a mechanical arm to sort through debris. Methodical. Purposeful. Lonely.
</action>
</scene>每个场景都包裹在XML标签中,附带供流水线处理的元数据:
xml
<scene number="1" duration="30-45s">
<slugline>EXT. WASTELAND - DAWN</slugline>
<location>Wasteland</location>
<time>Dawn</time>
<characters>Unit-7</characters>
<mood>desolate, lonely</mood>
<key_visuals>
<visual>post-apocalyptic wasteland with ruined skyscrapers</visual>
<visual>boxy robot with single blue optical sensor</visual>
<visual>dust and smog atmosphere, weak pale sun</visual>
</key_visuals>
<action>
Gray dust covers everything. Skeletal remains of skyscrapers pierce the horizon. The sun, pale and weak, struggles through thick smog.
A ROBOT (Unit-7, boxy frame with single blue optical sensor) rolls across cracked asphalt. Its treads leave marks in the dust—the only sign of life.
The robot stops at a pile of rubble, extending a mechanical arm to sort through debris. Methodical. Purposeful. Lonely.
</action>
</scene>Metadata Fields
元数据字段
- : Scene sequence number (1, 2, 3...)
number - : Estimated screen time (for 5-10 min total)
duration - : Master scene heading
slugline - : Extracted location name
location - : Time of day
time - : Comma-separated character list
characters - : Emotional tone/atmosphere
mood - : Array of specific visual elements for image generation
key_visuals - : The full action/description text
action - (optional): Character dialogue if present
dialogue
- : 场景序号(1, 2, 3...)
number - : 预估放映时长(总时长控制在5-10分钟)
duration - : 主场景标题
slugline - : 提取的地点名称
location - : 一天中的时间
time - : 逗号分隔的角色列表
characters - : 情感基调/氛围
mood - : 供图像生成使用的特定视觉元素数组
key_visuals - : 完整的动作/描述文本
action - (可选): 存在的角色对话内容
dialogue
Short Film Structure (5-10 Minutes)
短片结构(5-10分钟)
Scene Count Guidelines
场景数量指引
- 5 minutes: 6-10 scenes
- 7 minutes: 10-12 scenes
- 10 minutes: 12-15 scenes
Average: ~30-60 seconds per scene
- 5分钟: 6-10个场景
- 7分钟: 10-12个场景
- 10分钟: 12-15个场景
平均: 每个场景约30-60秒
Three-Act Structure (Compressed)
三幕式结构(压缩版)
Act 1 - Setup (20-25%): 2-3 scenes
- Establish world, character, situation
- Inciting incident
Act 2 - Confrontation (50-60%): 4-8 scenes
- Development, obstacles, rising tension
- Midpoint twist or escalation
Act 3 - Resolution (20-25%): 2-3 scenes
- Climax and resolution
- Emotional payoff
第一幕 - 背景搭建(20-25%): 2-3个场景
- 建立世界观、角色、处境
- 引入激励事件
第二幕 - 冲突对抗(50-60%): 4-8个场景
- 剧情发展、设置障碍、提升张力
- 中点转折或冲突升级
第三幕 - 结局收尾(20-25%): 2-3个场景
- 高潮和结局
- 情感回报
Pacing Tips
节奏技巧
- Open strong: Hook audience in first 10-15 seconds
- Visual variety: Alternate between wide/close, action/stillness
- Emotional beats: Each scene should shift emotional state
- Build tension: Escalate stakes scene-by-scene
- Satisfying end: Clear resolution, even if bittersweet
- 强开场: 前10-15秒抓住观众注意力
- 视觉多样性: 交替使用远景/近景、动态/静态画面
- 情感节点: 每个场景都要带动情绪状态变化
- 张力搭建: 逐场景提升 stakes
- 收尾圆满: 清晰的结局,即使是苦乐参半的结局也可
Best Practices
最佳实践
For Pipeline Integration
流水线集成相关
- Consistent naming: Use same character names throughout
- Rich visuals: Every scene needs 3-5 key_visuals for image generation
- Parseable format: Maintain strict XML structure
- Duration estimates: Help pipeline plan total video length
- 命名统一: 全程使用相同的角色名称
- 丰富视觉元素: 每个场景需要3-5个关键视觉元素供图像生成使用
- 可解析格式: 严格遵守XML结构
- 时长预估: 帮助流水线规划总视频时长
For Quality Output
输出质量相关
- Visual storytelling: Show emotions through actions, not dialogue
- Specific details: "weathered chrome" beats "old metal"
- Atmospheric writing: Set mood through environment
- Lean prose: Each word should serve the image
- 视觉叙事: 通过动作而非对话展示情绪
- 细节具体: "风化的镀铬表面"优于"旧金属"
- 氛围感写作: 通过环境设定情绪
- 文字精炼: 每个词都要为画面服务
Common Pitfalls to Avoid
需要避免的常见误区
- ❌ Vague descriptions: "A person walks" → ✅ "A weathered woman in her 50s trudges through snow"
- ❌ Telling emotions: "She feels sad" → ✅ "Tears streak her dusty cheeks"
- ❌ Camera directions: "CLOSE UP ON" → ✅ "The crack in the glass spreads"
- ❌ Over-dialogue: Short films need visual storytelling
- ❌ Inconsistent character names: Stick to ONE name per character
- ❌ 模糊描述: "一个人走路" → ✅ "一位50多岁饱经风霜的女人在雪中跋涉"
- ❌ 直接说明情绪: "她感到难过" → ✅ "泪水划过她沾满灰尘的脸颊"
- ❌ 镜头指导术语: "特写镜头对准" → ✅ "玻璃上的裂缝在蔓延"
- ❌ 对话过多: 短片需要视觉叙事
- ❌ 角色名称不一致: 每个角色固定使用一个名称
Additional Resources
额外资源
Pipeline Integration Guide
流水线集成指南
For detailed guidance on metadata standards, visual optimization, and integration with imagine/arch-v:
- references/pipeline-integration.md
如需了解元数据标准、视觉优化、与imagine/arch-v集成的详细指引:
- references/pipeline-integration.md
Advanced Techniques
高级技巧
For sophisticated screenwriting techniques, camera movement hints, and pacing optimization:
- references/advanced-techniques.md
如需了解复杂编剧技巧、镜头移动提示、节奏优化相关内容:
- references/advanced-techniques.md