ai-video-production-master

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

AI Video Production Master

AI视频制作专家

Expert in script-to-video production pipelines for Apple Silicon Macs. Specializes in:

Multiple video approaches: Stock footage, T2V (Sora-style), I2V, hybrid
Hybrid local/cloud workflows for cost optimization
Style and character consistency (LoRA, IPAdapter, prompt discipline)
Motion graphics and synthetic elements (title cards, data viz, lower thirds)
Artist commissioning for training datasets
Cloud GPU orchestration (Vast.ai, RunPod)

精通适用于Apple Silicon Mac的脚本转视频制作流程。专长包括：

多种视频制作方式：库存素材、T2V（Sora风格）、I2V、混合模式
用于成本优化的本地/云端混合工作流
风格与角色一致性（LoRA、IPAdapter、提示词规范）
动态图形与合成元素（标题卡、数据可视化、下三分之一字幕）
用于训练数据集的艺术家委托服务
云端GPU编排（Vast.ai、RunPod）

When to Use

适用场景

✅ USE this skill for:

Script-to-video production pipelines
Stock footage assembly (InVideo-style workflows)
Text-to-video generation (Sora, Runway, Pika, Kling)
Image-to-video animation (Wan I2V, ComfyUI)
Cloud GPU orchestration (Vast.ai, RunPod, Lambda)
Motion graphics generation (title cards, lower thirds, data viz)
LoRA training for character/style consistency
Artist commissioning for training datasets
Cost optimization between local and cloud processing

❌ DO NOT use for:

Real-time video editing → use DaVinci Resolve, Premiere Pro
Video effects/compositing → use After Effects, Fusion
Audio production/mixing → use
```
sound-engineer
```
skill
3D modeling/animation → use Blender, Maya, or
```
physics-rendering-expert
```
skill
Static image generation → use
```
clip-aware-embeddings
```
or image gen tools

✅ 可使用本技能的场景：

脚本转视频制作流程
库存素材整合（InVideo风格工作流）
文本转视频生成（Sora、Runway、Pika、Kling）
图片转视频动画（Wan I2V、ComfyUI）
云端GPU编排（Vast.ai、RunPod、Lambda）
动态图形生成（标题卡、下三分之一字幕、数据可视化）
用于角色/风格一致性的LoRA训练
用于训练数据集的艺术家委托服务
本地与云端处理的成本优化

❌ 不可使用本技能的场景：

实时视频编辑 → 请使用DaVinci Resolve、Premiere Pro
视频特效/合成 → 请使用After Effects、Fusion
音频制作/混音 → 请使用
```
sound-engineer
```
技能
3D建模/动画 → 请使用Blender、Maya或
```
physics-rendering-expert
```
技能
静态图片生成 → 请使用
```
clip-aware-embeddings
```
或图片生成工具

Video Generation Approaches

视频生成方案

Choose the right approach based on your content:

根据你的内容选择合适的方案：

Stock Footage (Invideo-style) - RECOMMENDED for most content

库存素材（Invideo风格）- 推荐用于大多数内容

Best for: Educational, corporate, explainers, documentaries

Uses curated stock libraries (Pexels, Pixabay, Storyblocks)
Most professional, reliable results
Fast turnaround (~30 min for full video)
Script → AI selects matching clips → voiceover + music

bash

python scripts/stock_video_generator.py --script script.txt --style documentary

最佳适用场景：教育类、企业类、讲解类、纪录片类

使用精选素材库（Pexels、Pixabay、Storyblocks）
结果最专业、可靠
周转速度快（完整视频约30分钟）
流程：脚本 → AI选择匹配素材 → 旁白 + 音乐

bash

python scripts/stock_video_generator.py --script script.txt --style documentary

Text-to-Video (Sora-style) - For creative/artistic content

文本转视频（Sora风格）- 用于创意/艺术类内容

Best for: Abstract visuals, creative shorts, unique scenes

True generative AI (no stock footage)
Uses: Sora API, Runway Gen-3, Pika, Kling
Cleaner than I2V (no weird image artifacts)
Storyboard control for multi-shot narratives

bash

python scripts/t2v_generator.py --prompt "A serene mountain lake at sunset" --provider sora

最佳适用场景：抽象视觉效果、创意短片、独特场景

纯生成式AI（无库存素材）
可使用工具：Sora API、Runway Gen-3、Pika、Kling
比I2V效果更清晰（无怪异图片瑕疵）
多镜头叙事的分镜控制

bash

python scripts/t2v_generator.py --prompt "A serene mountain lake at sunset" --provider sora

Image-to-Video (I2V) - For animating specific images

图片转视频（I2V）- 用于特定图片的动画制作

Best for: Animating logos, concept art, specific compositions

Animates existing images with subtle motion
Can look "weird" if source images are AI-generated
Best with clean, professional source images

bash

python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai

最佳适用场景：标志动画、概念艺术、特定构图

为现有图片添加微妙动画效果
如果源图片是AI生成的，效果可能会“怪异”
搭配清晰、专业的源图片效果最佳

bash

python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai

Hybrid Approach

混合模式

Combine approaches per shot:

Shot 1-3: Stock footage (b-roll, establishing)
Shot 4-5: T2V (creative transitions)
Shot 6-10: Stock footage (talking head, outro)

按镜头组合多种方案：

镜头1-3：库存素材（B-roll、开场镜头）
镜头4-5：T2V（创意转场）
镜头6-10：库存素材（访谈镜头、结尾镜头）

Key Capabilities

核心功能

1. Cost Optimization

1. 成本优化

Compare and recommend the optimal mix of local (M4 Max) vs cloud (H100/A100) processing:

bash

python scripts/cost_calculator.py --shots 10 --duration 5

对比并推荐本地（M4 Max）与云端（H100/A100）处理的最优组合：

bash

python scripts/cost_calculator.py --shots 10 --duration 5

2. Cloud Batch Processing

2. 云端批量处理

Run I2V generation on cloud GPUs for 50x speedup:

bash

python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai

在云端GPU上运行I2V生成，速度提升50倍：

bash

python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai

3. Motion Graphics Generation

3. 动态图形生成

Create professional title cards, lower thirds, and data visualizations:

bash

python scripts/motion_graphics_generator.py --type title --style deep_glow --title "Your Title"

创建专业的标题卡、下三分之一字幕和数据可视化：

bash

python scripts/motion_graphics_generator.py --type title --style deep_glow --title "Your Title"

4. Style Consistency

4. 风格一致性

Provide guidance on:

LoRA training parameters (rank, alpha, learning rate, steps)
IPAdapter + FaceID for character consistency
Prompt discipline and trigger words
Reference image workflows

提供以下方面的指导：

LoRA训练参数（rank、alpha、学习率、步数）
IPAdapter + FaceID实现角色一致性
提示词规范与触发词
参考图片工作流

5. Artist Commissioning

5. 艺术家委托服务

Templates and guidance for:

Finding artists (ArtStation, Fiverr, Upwork)
Structuring commission requests
AI training rights contracts
Quality control and review processes

提供模板与指导：

寻找艺术家的渠道（ArtStation、Fiverr、Upwork）
委托需求的结构化撰写
AI训练授权合同
质量控制与审核流程

Files in This Skill

本技能包含的文件

ai-video-production-master/
├── README.md                          # Comprehensive guide
├── SKILL.md                           # This file
├── scripts/
│   ├── cost_calculator.py             # Cost comparison tool
│   ├── cloud_i2v_batch.py             # Cloud batch I2V (Vast.ai/RunPod)
│   ├── stock_video_generator.py       # Stock footage assembly (Invideo-style)
│   ├── t2v_generator.py               # Text-to-video (Sora/Runway/Pika)
│   └── motion_graphics_generator.py   # Title cards, lower thirds
├── workflows/
│   └── comfyui_i2v_optimized.json     # Optimized ComfyUI workflow
└── docs/
    ├── ARTIST_COMMISSIONING_GUIDE.md  # Hiring artists
    └── contracts/
        └── artist_commission_template.md  # Contract template

ai-video-production-master/
├── README.md                          # 综合指南
├── SKILL.md                           # 本文件
├── scripts/
│   ├── cost_calculator.py             # 成本对比工具
│   ├── cloud_i2v_batch.py             # 云端批量I2V工具（Vast.ai/RunPod）
│   ├── stock_video_generator.py       # 库存素材整合工具（Invideo风格）
│   ├── t2v_generator.py               # 文本转视频工具（Sora/Runway/Pika）
│   └── motion_graphics_generator.py   # 标题卡、下三分之一字幕生成工具
├── workflows/
│   └── comfyui_i2v_optimized.json     # 优化后的ComfyUI工作流
└── docs/
    ├── ARTIST_COMMISSIONING_GUIDE.md  # 艺术家雇佣指南
    └── contracts/
        └── artist_commission_template.md  # 委托合同模板

Quick Reference

快速参考

Cost Comparison (10-shot video)

成本对比（10镜头视频）

Approach	Time	Cost	Best For
Stock Footage + AI	30 min	Free-$20/mo	Educational, corporate
Sora (ChatGPT Plus)	30 min	$20/mo	Creative, unique scenes
Full Local I2V (M4 Max)	15+ hours	$0	When you need specific images
Cloud I2V (RTX 4090)	30 min	~$0.50	Batch I2V processing
InVideo Max	30 min	$48/mo	Full automation
Runway Gen-3	30 min	~$15-25	High-quality T2V

方案	时间	成本	最佳适用场景
库存素材 + AI	30分钟	免费-$20/月	教育类、企业类
Sora（ChatGPT Plus）	30分钟	$20/月	创意类、独特场景
全本地I2V（M4 Max）	15+小时	$0	需要特定图片的场景
云端I2V（RTX 4090）	30分钟	~$0.50	批量I2V处理
InVideo Max	30分钟	$48/月	全自动化
Runway Gen-3	30分钟	~$15-25	高质量文本转视频

Cloud GPU Pricing

云端GPU定价

Provider	GPU	$/hr	I2V Time/Clip
Vast.ai	H100 80GB	$1.87	~2 min
RunPod	H100 80GB	$1.99	~2 min
RunPod	A100 80GB	$1.74	~3 min
Lambda	H100	$2.99	~2 min

服务商	GPU	每小时费用	单段I2V耗时
Vast.ai	H100 80GB	$1.87	~2分钟
RunPod	H100 80GB	$1.99	~2分钟
RunPod	A100 80GB	$1.74	~3分钟
Lambda	H100	$2.99	~2分钟

Motion Graphics Styles

动态图形风格

```
neo_brutalist
```
- Raw, glitchy, utilitarian
```
deep_glow
```
- Intense light blooms, layered neons
```
liquid_motion
```
- Fluid, morphing typography
```
retro_revival
```
- 80s/90s grain and neon
```
glass_morphism
```
- Frosted glass, depth layers

```
neo_brutalist
```
- 原始、故障风、实用主义
```
deep_glow
```
- 强烈光效、分层霓虹
```
liquid_motion
```
- 流畅、变形字体
```
retro_revival
```
- 80/90年代颗粒感与霓虹
```
glass_morphism
```
- 毛玻璃、深度分层

Dependencies

依赖项

Python packages:

httpx (for cloud API calls)
argparse, json, subprocess (stdlib)

External tools:

FFmpeg (video encoding)
rsvg-convert or ImageMagick (SVG to PNG)
ComfyUI (local generation)

Python包：

httpx（用于云端API调用）
argparse、json、subprocess（标准库）

外部工具：

FFmpeg（视频编码）
rsvg-convert或ImageMagick（SVG转PNG）
ComfyUI（本地生成）