ai-video-production-master

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

AI Video Production Master

AI视频制作专家

Expert in script-to-video production pipelines for Apple Silicon Macs. Specializes in:
  • Multiple video approaches: Stock footage, T2V (Sora-style), I2V, hybrid
  • Hybrid local/cloud workflows for cost optimization
  • Style and character consistency (LoRA, IPAdapter, prompt discipline)
  • Motion graphics and synthetic elements (title cards, data viz, lower thirds)
  • Artist commissioning for training datasets
  • Cloud GPU orchestration (Vast.ai, RunPod)
精通适用于Apple Silicon Mac的脚本转视频制作流程。专长包括:
  • 多种视频制作方式:库存素材、T2V(Sora风格)、I2V、混合模式
  • 用于成本优化的本地/云端混合工作流
  • 风格与角色一致性(LoRA、IPAdapter、提示词规范)
  • 动态图形与合成元素(标题卡、数据可视化、下三分之一字幕)
  • 用于训练数据集的艺术家委托服务
  • 云端GPU编排(Vast.ai、RunPod)

When to Use

适用场景

USE this skill for:
  • Script-to-video production pipelines
  • Stock footage assembly (InVideo-style workflows)
  • Text-to-video generation (Sora, Runway, Pika, Kling)
  • Image-to-video animation (Wan I2V, ComfyUI)
  • Cloud GPU orchestration (Vast.ai, RunPod, Lambda)
  • Motion graphics generation (title cards, lower thirds, data viz)
  • LoRA training for character/style consistency
  • Artist commissioning for training datasets
  • Cost optimization between local and cloud processing
DO NOT use for:
  • Real-time video editing → use DaVinci Resolve, Premiere Pro
  • Video effects/compositing → use After Effects, Fusion
  • Audio production/mixing → use
    sound-engineer
    skill
  • 3D modeling/animation → use Blender, Maya, or
    physics-rendering-expert
    skill
  • Static image generation → use
    clip-aware-embeddings
    or image gen tools
可使用本技能的场景:
  • 脚本转视频制作流程
  • 库存素材整合(InVideo风格工作流)
  • 文本转视频生成(Sora、Runway、Pika、Kling)
  • 图片转视频动画(Wan I2V、ComfyUI)
  • 云端GPU编排(Vast.ai、RunPod、Lambda)
  • 动态图形生成(标题卡、下三分之一字幕、数据可视化)
  • 用于角色/风格一致性的LoRA训练
  • 用于训练数据集的艺术家委托服务
  • 本地与云端处理的成本优化
不可使用本技能的场景:
  • 实时视频编辑 → 请使用DaVinci Resolve、Premiere Pro
  • 视频特效/合成 → 请使用After Effects、Fusion
  • 音频制作/混音 → 请使用
    sound-engineer
    技能
  • 3D建模/动画 → 请使用Blender、Maya或
    physics-rendering-expert
    技能
  • 静态图片生成 → 请使用
    clip-aware-embeddings
    或图片生成工具

Video Generation Approaches

视频生成方案

Choose the right approach based on your content:
根据你的内容选择合适的方案:

Stock Footage (Invideo-style) - RECOMMENDED for most content

库存素材(Invideo风格)- 推荐用于大多数内容

Best for: Educational, corporate, explainers, documentaries
  • Uses curated stock libraries (Pexels, Pixabay, Storyblocks)
  • Most professional, reliable results
  • Fast turnaround (~30 min for full video)
  • Script → AI selects matching clips → voiceover + music
bash
python scripts/stock_video_generator.py --script script.txt --style documentary
最佳适用场景:教育类、企业类、讲解类、纪录片类
  • 使用精选素材库(Pexels、Pixabay、Storyblocks)
  • 结果最专业、可靠
  • 周转速度快(完整视频约30分钟)
  • 流程:脚本 → AI选择匹配素材 → 旁白 + 音乐
bash
python scripts/stock_video_generator.py --script script.txt --style documentary

Text-to-Video (Sora-style) - For creative/artistic content

文本转视频(Sora风格)- 用于创意/艺术类内容

Best for: Abstract visuals, creative shorts, unique scenes
  • True generative AI (no stock footage)
  • Uses: Sora API, Runway Gen-3, Pika, Kling
  • Cleaner than I2V (no weird image artifacts)
  • Storyboard control for multi-shot narratives
bash
python scripts/t2v_generator.py --prompt "A serene mountain lake at sunset" --provider sora
最佳适用场景:抽象视觉效果、创意短片、独特场景
  • 纯生成式AI(无库存素材)
  • 可使用工具:Sora API、Runway Gen-3、Pika、Kling
  • 比I2V效果更清晰(无怪异图片瑕疵)
  • 多镜头叙事的分镜控制
bash
python scripts/t2v_generator.py --prompt "A serene mountain lake at sunset" --provider sora

Image-to-Video (I2V) - For animating specific images

图片转视频(I2V)- 用于特定图片的动画制作

Best for: Animating logos, concept art, specific compositions
  • Animates existing images with subtle motion
  • Can look "weird" if source images are AI-generated
  • Best with clean, professional source images
bash
python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai
最佳适用场景:标志动画、概念艺术、特定构图
  • 为现有图片添加微妙动画效果
  • 如果源图片是AI生成的,效果可能会“怪异”
  • 搭配清晰、专业的源图片效果最佳
bash
python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai

Hybrid Approach

混合模式

Combine approaches per shot:
  • Shot 1-3: Stock footage (b-roll, establishing)
  • Shot 4-5: T2V (creative transitions)
  • Shot 6-10: Stock footage (talking head, outro)
按镜头组合多种方案:
  • 镜头1-3:库存素材(B-roll、开场镜头)
  • 镜头4-5:T2V(创意转场)
  • 镜头6-10:库存素材(访谈镜头、结尾镜头)

Key Capabilities

核心功能

1. Cost Optimization

1. 成本优化

Compare and recommend the optimal mix of local (M4 Max) vs cloud (H100/A100) processing:
bash
python scripts/cost_calculator.py --shots 10 --duration 5
对比并推荐本地(M4 Max)与云端(H100/A100)处理的最优组合:
bash
python scripts/cost_calculator.py --shots 10 --duration 5

2. Cloud Batch Processing

2. 云端批量处理

Run I2V generation on cloud GPUs for 50x speedup:
bash
python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai
在云端GPU上运行I2V生成,速度提升50倍:
bash
python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai

3. Motion Graphics Generation

3. 动态图形生成

Create professional title cards, lower thirds, and data visualizations:
bash
python scripts/motion_graphics_generator.py --type title --style deep_glow --title "Your Title"
创建专业的标题卡、下三分之一字幕和数据可视化:
bash
python scripts/motion_graphics_generator.py --type title --style deep_glow --title "Your Title"

4. Style Consistency

4. 风格一致性

Provide guidance on:
  • LoRA training parameters (rank, alpha, learning rate, steps)
  • IPAdapter + FaceID for character consistency
  • Prompt discipline and trigger words
  • Reference image workflows
提供以下方面的指导:
  • LoRA训练参数(rank、alpha、学习率、步数)
  • IPAdapter + FaceID实现角色一致性
  • 提示词规范与触发词
  • 参考图片工作流

5. Artist Commissioning

5. 艺术家委托服务

Templates and guidance for:
  • Finding artists (ArtStation, Fiverr, Upwork)
  • Structuring commission requests
  • AI training rights contracts
  • Quality control and review processes
提供模板与指导:
  • 寻找艺术家的渠道(ArtStation、Fiverr、Upwork)
  • 委托需求的结构化撰写
  • AI训练授权合同
  • 质量控制与审核流程

Files in This Skill

本技能包含的文件

ai-video-production-master/
├── README.md                          # Comprehensive guide
├── SKILL.md                           # This file
├── scripts/
│   ├── cost_calculator.py             # Cost comparison tool
│   ├── cloud_i2v_batch.py             # Cloud batch I2V (Vast.ai/RunPod)
│   ├── stock_video_generator.py       # Stock footage assembly (Invideo-style)
│   ├── t2v_generator.py               # Text-to-video (Sora/Runway/Pika)
│   └── motion_graphics_generator.py   # Title cards, lower thirds
├── workflows/
│   └── comfyui_i2v_optimized.json     # Optimized ComfyUI workflow
└── docs/
    ├── ARTIST_COMMISSIONING_GUIDE.md  # Hiring artists
    └── contracts/
        └── artist_commission_template.md  # Contract template
ai-video-production-master/
├── README.md                          # 综合指南
├── SKILL.md                           # 本文件
├── scripts/
│   ├── cost_calculator.py             # 成本对比工具
│   ├── cloud_i2v_batch.py             # 云端批量I2V工具(Vast.ai/RunPod)
│   ├── stock_video_generator.py       # 库存素材整合工具(Invideo风格)
│   ├── t2v_generator.py               # 文本转视频工具(Sora/Runway/Pika)
│   └── motion_graphics_generator.py   # 标题卡、下三分之一字幕生成工具
├── workflows/
│   └── comfyui_i2v_optimized.json     # 优化后的ComfyUI工作流
└── docs/
    ├── ARTIST_COMMISSIONING_GUIDE.md  # 艺术家雇佣指南
    └── contracts/
        └── artist_commission_template.md  # 委托合同模板

Quick Reference

快速参考

Cost Comparison (10-shot video)

成本对比(10镜头视频)

ApproachTimeCostBest For
Stock Footage + AI30 minFree-$20/moEducational, corporate
Sora (ChatGPT Plus)30 min$20/moCreative, unique scenes
Full Local I2V (M4 Max)15+ hours$0When you need specific images
Cloud I2V (RTX 4090)30 min~$0.50Batch I2V processing
InVideo Max30 min$48/moFull automation
Runway Gen-330 min~$15-25High-quality T2V
方案时间成本最佳适用场景
库存素材 + AI30分钟免费-$20/月教育类、企业类
Sora(ChatGPT Plus)30分钟$20/月创意类、独特场景
全本地I2V(M4 Max)15+小时$0需要特定图片的场景
云端I2V(RTX 4090)30分钟~$0.50批量I2V处理
InVideo Max30分钟$48/月全自动化
Runway Gen-330分钟~$15-25高质量文本转视频

Cloud GPU Pricing

云端GPU定价

ProviderGPU$/hrI2V Time/Clip
Vast.aiH100 80GB$1.87~2 min
RunPodH100 80GB$1.99~2 min
RunPodA100 80GB$1.74~3 min
LambdaH100$2.99~2 min
服务商GPU每小时费用单段I2V耗时
Vast.aiH100 80GB$1.87~2分钟
RunPodH100 80GB$1.99~2分钟
RunPodA100 80GB$1.74~3分钟
LambdaH100$2.99~2分钟

Motion Graphics Styles

动态图形风格

  • neo_brutalist
    - Raw, glitchy, utilitarian
  • deep_glow
    - Intense light blooms, layered neons
  • liquid_motion
    - Fluid, morphing typography
  • retro_revival
    - 80s/90s grain and neon
  • glass_morphism
    - Frosted glass, depth layers
  • neo_brutalist
    - 原始、故障风、实用主义
  • deep_glow
    - 强烈光效、分层霓虹
  • liquid_motion
    - 流畅、变形字体
  • retro_revival
    - 80/90年代颗粒感与霓虹
  • glass_morphism
    - 毛玻璃、深度分层

Dependencies

依赖项

Python packages:
  • httpx (for cloud API calls)
  • argparse, json, subprocess (stdlib)
External tools:
  • FFmpeg (video encoding)
  • rsvg-convert or ImageMagick (SVG to PNG)
  • ComfyUI (local generation)
Python包:
  • httpx(用于云端API调用)
  • argparse、json、subprocess(标准库)
外部工具:
  • FFmpeg(视频编码)
  • rsvg-convert或ImageMagick(SVG转PNG)
  • ComfyUI(本地生成)