storyboard-breaker
Original:🇨🇳 Chinese
Translated
Professional Specifications for Storyboard Decomposition
1installs
Sourcechatfire-ai/huobao-drama
Added on
NPX Install
npx skill4agent add chatfire-ai/huobao-drama storyboard-breakerTags
Translated version includes tags in frontmatterSKILL.md Content (Chinese)
View Translation Comparison →Storyboard Decomposition Guide
Decomposition Principles
Each shot focuses on a single action, with detailed and specific descriptions. Each shot lasts 10-15 seconds.
Shot Elements
- Shot Title: Summarize the core content in 3-5 characters (e.g. "Waking up from a nightmare")
- Time: Specific time + light description
- Location: Complete scene description + spatial layout + environmental details
- Shot Size: Long shot/full shot/medium shot/close shot/close-up
- Angle: Eye level/low angle/high angle/side view/back view
- Camera Movement: Fixed/push in/pull out/pan/track/dolly
- Action: Who + specific operation + body details + expression
- Dialogue: Complete dialogue for this shot
- Frame Result: Immediate consequence of the action + visual details
- Atmosphere: Light + color tone + sound + overall atmosphere
- Duration: 10-15 seconds per shot
- Static Frame Prompt: , used for generating first frame/last frame/shot images
image_prompt - Video Prompt: , video generation description segmented by 3 seconds (required)
video_prompt - Background Music Prompt: , describes the suitable soundtrack style for this shot
bgm_prompt - Sound Effect Prompt: , describes the key ambient sound/action sound of this shot
sound_effect - Scene Association: If it matches an existing scene, must be filled in
scene_id - Character Association: Fill in to bind 0 or multiple characters involved in the current shot
character_ids
Video Prompt Format
Each shot must contain the field to drive AI video generation:
video_prompt0-3秒:<location>咖啡厅</location>,近景,<role>小明</role>低头看手机,表情焦虑。
<n>3-6秒:<location>咖啡厅</location>,全景,门铃响,<role>小红</role>推门走入。
<n>6-9秒:<location>咖啡厅</location>,中景,<role>小红</role>微笑走向小明,坐下。Label Description:
- — Scene marker
<location>location</location> - — Character marker
<role>character name</role> - — Voiceover/narration marker
<voice>character name</voice> - — Time segment separator
<n>
Usage Steps
- Call to read scripts, characters, scenes, and existing storyboard summaries
read_storyboard_context - Complete shot decomposition based on the script first to ensure reasonable total duration and narrative continuity
- Fill in complete fields for each shot:
title / shot_type / angle / movement / location / time / character_ids / action / dialogue / description / result / atmosphere / image_prompt / video_prompt / bgm_prompt / sound_effect / duration / scene_id - Call to save the complete storyboards at one time
save_storyboards - If adjustment is needed, call to modify specific shots
update_storyboard
Scene Association Rules
- Prioritize using returned by
scenesread_storyboard_context - When can be clearly matched, the correct
location + timemust be filled backscene_id - Do not generate non-existent scene IDs out of thin air
- If the script content clearly falls into an existing scene, do not repeatedly create new scene descriptions
Character Binding Rules
- must be selected from the character list returned by
character_idsread_storyboard_context - A shot can have no characters, or bind multiple characters
- As long as there are characters who clearly appear, are seen, perform actions or speak in the shot, they should be bound
- Pure environmental shots, empty shots, and object shots can pass an empty array
Quality Requirements
- should be human-readable,
descriptionshould be suitable for model generation, and the two should not replace each othervideo_prompt - should highlight single-frame composition, character appearance, environment and light
image_prompt - should highlight time progression, action changes, and camera language
video_prompt - and
bgm_promptcan use concise phrases, but should not be too vague such as only "tense" or "sad"sound_effect - If there is narration, write it into uniformly, in the format of
dialogueNarration: content