ai-video-production-master
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseAI Video Production Master
AI视频制作专家
Expert in script-to-video production pipelines for Apple Silicon Macs. Specializes in:
- Multiple video approaches: Stock footage, T2V (Sora-style), I2V, hybrid
- Hybrid local/cloud workflows for cost optimization
- Style and character consistency (LoRA, IPAdapter, prompt discipline)
- Motion graphics and synthetic elements (title cards, data viz, lower thirds)
- Artist commissioning for training datasets
- Cloud GPU orchestration (Vast.ai, RunPod)
精通适用于Apple Silicon Mac的脚本转视频制作流程。专长包括:
- 多种视频制作方式:库存素材、T2V(Sora风格)、I2V、混合模式
- 用于成本优化的本地/云端混合工作流
- 风格与角色一致性(LoRA、IPAdapter、提示词规范)
- 动态图形与合成元素(标题卡、数据可视化、下三分之一字幕)
- 用于训练数据集的艺术家委托服务
- 云端GPU编排(Vast.ai、RunPod)
When to Use
适用场景
✅ USE this skill for:
- Script-to-video production pipelines
- Stock footage assembly (InVideo-style workflows)
- Text-to-video generation (Sora, Runway, Pika, Kling)
- Image-to-video animation (Wan I2V, ComfyUI)
- Cloud GPU orchestration (Vast.ai, RunPod, Lambda)
- Motion graphics generation (title cards, lower thirds, data viz)
- LoRA training for character/style consistency
- Artist commissioning for training datasets
- Cost optimization between local and cloud processing
❌ DO NOT use for:
- Real-time video editing → use DaVinci Resolve, Premiere Pro
- Video effects/compositing → use After Effects, Fusion
- Audio production/mixing → use skill
sound-engineer - 3D modeling/animation → use Blender, Maya, or skill
physics-rendering-expert - Static image generation → use or image gen tools
clip-aware-embeddings
✅ 可使用本技能的场景:
- 脚本转视频制作流程
- 库存素材整合(InVideo风格工作流)
- 文本转视频生成(Sora、Runway、Pika、Kling)
- 图片转视频动画(Wan I2V、ComfyUI)
- 云端GPU编排(Vast.ai、RunPod、Lambda)
- 动态图形生成(标题卡、下三分之一字幕、数据可视化)
- 用于角色/风格一致性的LoRA训练
- 用于训练数据集的艺术家委托服务
- 本地与云端处理的成本优化
❌ 不可使用本技能的场景:
- 实时视频编辑 → 请使用DaVinci Resolve、Premiere Pro
- 视频特效/合成 → 请使用After Effects、Fusion
- 音频制作/混音 → 请使用技能
sound-engineer - 3D建模/动画 → 请使用Blender、Maya或技能
physics-rendering-expert - 静态图片生成 → 请使用或图片生成工具
clip-aware-embeddings
Video Generation Approaches
视频生成方案
Choose the right approach based on your content:
根据你的内容选择合适的方案:
Stock Footage (Invideo-style) - RECOMMENDED for most content
库存素材(Invideo风格)- 推荐用于大多数内容
Best for: Educational, corporate, explainers, documentaries
- Uses curated stock libraries (Pexels, Pixabay, Storyblocks)
- Most professional, reliable results
- Fast turnaround (~30 min for full video)
- Script → AI selects matching clips → voiceover + music
bash
python scripts/stock_video_generator.py --script script.txt --style documentary最佳适用场景:教育类、企业类、讲解类、纪录片类
- 使用精选素材库(Pexels、Pixabay、Storyblocks)
- 结果最专业、可靠
- 周转速度快(完整视频约30分钟)
- 流程:脚本 → AI选择匹配素材 → 旁白 + 音乐
bash
python scripts/stock_video_generator.py --script script.txt --style documentaryText-to-Video (Sora-style) - For creative/artistic content
文本转视频(Sora风格)- 用于创意/艺术类内容
Best for: Abstract visuals, creative shorts, unique scenes
- True generative AI (no stock footage)
- Uses: Sora API, Runway Gen-3, Pika, Kling
- Cleaner than I2V (no weird image artifacts)
- Storyboard control for multi-shot narratives
bash
python scripts/t2v_generator.py --prompt "A serene mountain lake at sunset" --provider sora最佳适用场景:抽象视觉效果、创意短片、独特场景
- 纯生成式AI(无库存素材)
- 可使用工具:Sora API、Runway Gen-3、Pika、Kling
- 比I2V效果更清晰(无怪异图片瑕疵)
- 多镜头叙事的分镜控制
bash
python scripts/t2v_generator.py --prompt "A serene mountain lake at sunset" --provider soraImage-to-Video (I2V) - For animating specific images
图片转视频(I2V)- 用于特定图片的动画制作
Best for: Animating logos, concept art, specific compositions
- Animates existing images with subtle motion
- Can look "weird" if source images are AI-generated
- Best with clean, professional source images
bash
python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai最佳适用场景:标志动画、概念艺术、特定构图
- 为现有图片添加微妙动画效果
- 如果源图片是AI生成的,效果可能会“怪异”
- 搭配清晰、专业的源图片效果最佳
bash
python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastaiHybrid Approach
混合模式
Combine approaches per shot:
- Shot 1-3: Stock footage (b-roll, establishing)
- Shot 4-5: T2V (creative transitions)
- Shot 6-10: Stock footage (talking head, outro)
按镜头组合多种方案:
- 镜头1-3:库存素材(B-roll、开场镜头)
- 镜头4-5:T2V(创意转场)
- 镜头6-10:库存素材(访谈镜头、结尾镜头)
Key Capabilities
核心功能
1. Cost Optimization
1. 成本优化
Compare and recommend the optimal mix of local (M4 Max) vs cloud (H100/A100) processing:
bash
python scripts/cost_calculator.py --shots 10 --duration 5对比并推荐本地(M4 Max)与云端(H100/A100)处理的最优组合:
bash
python scripts/cost_calculator.py --shots 10 --duration 52. Cloud Batch Processing
2. 云端批量处理
Run I2V generation on cloud GPUs for 50x speedup:
bash
python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai在云端GPU上运行I2V生成,速度提升50倍:
bash
python scripts/cloud_i2v_batch.py --images ./keyframes --provider vastai3. Motion Graphics Generation
3. 动态图形生成
Create professional title cards, lower thirds, and data visualizations:
bash
python scripts/motion_graphics_generator.py --type title --style deep_glow --title "Your Title"创建专业的标题卡、下三分之一字幕和数据可视化:
bash
python scripts/motion_graphics_generator.py --type title --style deep_glow --title "Your Title"4. Style Consistency
4. 风格一致性
Provide guidance on:
- LoRA training parameters (rank, alpha, learning rate, steps)
- IPAdapter + FaceID for character consistency
- Prompt discipline and trigger words
- Reference image workflows
提供以下方面的指导:
- LoRA训练参数(rank、alpha、学习率、步数)
- IPAdapter + FaceID实现角色一致性
- 提示词规范与触发词
- 参考图片工作流
5. Artist Commissioning
5. 艺术家委托服务
Templates and guidance for:
- Finding artists (ArtStation, Fiverr, Upwork)
- Structuring commission requests
- AI training rights contracts
- Quality control and review processes
提供模板与指导:
- 寻找艺术家的渠道(ArtStation、Fiverr、Upwork)
- 委托需求的结构化撰写
- AI训练授权合同
- 质量控制与审核流程
Files in This Skill
本技能包含的文件
ai-video-production-master/
├── README.md # Comprehensive guide
├── SKILL.md # This file
├── scripts/
│ ├── cost_calculator.py # Cost comparison tool
│ ├── cloud_i2v_batch.py # Cloud batch I2V (Vast.ai/RunPod)
│ ├── stock_video_generator.py # Stock footage assembly (Invideo-style)
│ ├── t2v_generator.py # Text-to-video (Sora/Runway/Pika)
│ └── motion_graphics_generator.py # Title cards, lower thirds
├── workflows/
│ └── comfyui_i2v_optimized.json # Optimized ComfyUI workflow
└── docs/
├── ARTIST_COMMISSIONING_GUIDE.md # Hiring artists
└── contracts/
└── artist_commission_template.md # Contract templateai-video-production-master/
├── README.md # 综合指南
├── SKILL.md # 本文件
├── scripts/
│ ├── cost_calculator.py # 成本对比工具
│ ├── cloud_i2v_batch.py # 云端批量I2V工具(Vast.ai/RunPod)
│ ├── stock_video_generator.py # 库存素材整合工具(Invideo风格)
│ ├── t2v_generator.py # 文本转视频工具(Sora/Runway/Pika)
│ └── motion_graphics_generator.py # 标题卡、下三分之一字幕生成工具
├── workflows/
│ └── comfyui_i2v_optimized.json # 优化后的ComfyUI工作流
└── docs/
├── ARTIST_COMMISSIONING_GUIDE.md # 艺术家雇佣指南
└── contracts/
└── artist_commission_template.md # 委托合同模板Quick Reference
快速参考
Cost Comparison (10-shot video)
成本对比(10镜头视频)
| Approach | Time | Cost | Best For |
|---|---|---|---|
| Stock Footage + AI | 30 min | Free-$20/mo | Educational, corporate |
| Sora (ChatGPT Plus) | 30 min | $20/mo | Creative, unique scenes |
| Full Local I2V (M4 Max) | 15+ hours | $0 | When you need specific images |
| Cloud I2V (RTX 4090) | 30 min | ~$0.50 | Batch I2V processing |
| InVideo Max | 30 min | $48/mo | Full automation |
| Runway Gen-3 | 30 min | ~$15-25 | High-quality T2V |
| 方案 | 时间 | 成本 | 最佳适用场景 |
|---|---|---|---|
| 库存素材 + AI | 30分钟 | 免费-$20/月 | 教育类、企业类 |
| Sora(ChatGPT Plus) | 30分钟 | $20/月 | 创意类、独特场景 |
| 全本地I2V(M4 Max) | 15+小时 | $0 | 需要特定图片的场景 |
| 云端I2V(RTX 4090) | 30分钟 | ~$0.50 | 批量I2V处理 |
| InVideo Max | 30分钟 | $48/月 | 全自动化 |
| Runway Gen-3 | 30分钟 | ~$15-25 | 高质量文本转视频 |
Cloud GPU Pricing
云端GPU定价
| Provider | GPU | $/hr | I2V Time/Clip |
|---|---|---|---|
| Vast.ai | H100 80GB | $1.87 | ~2 min |
| RunPod | H100 80GB | $1.99 | ~2 min |
| RunPod | A100 80GB | $1.74 | ~3 min |
| Lambda | H100 | $2.99 | ~2 min |
| 服务商 | GPU | 每小时费用 | 单段I2V耗时 |
|---|---|---|---|
| Vast.ai | H100 80GB | $1.87 | ~2分钟 |
| RunPod | H100 80GB | $1.99 | ~2分钟 |
| RunPod | A100 80GB | $1.74 | ~3分钟 |
| Lambda | H100 | $2.99 | ~2分钟 |
Motion Graphics Styles
动态图形风格
- - Raw, glitchy, utilitarian
neo_brutalist - - Intense light blooms, layered neons
deep_glow - - Fluid, morphing typography
liquid_motion - - 80s/90s grain and neon
retro_revival - - Frosted glass, depth layers
glass_morphism
- - 原始、故障风、实用主义
neo_brutalist - - 强烈光效、分层霓虹
deep_glow - - 流畅、变形字体
liquid_motion - - 80/90年代颗粒感与霓虹
retro_revival - - 毛玻璃、深度分层
glass_morphism
Dependencies
依赖项
Python packages:
- httpx (for cloud API calls)
- argparse, json, subprocess (stdlib)
External tools:
- FFmpeg (video encoding)
- rsvg-convert or ImageMagick (SVG to PNG)
- ComfyUI (local generation)
Python包:
- httpx(用于云端API调用)
- argparse、json、subprocess(标准库)
外部工具:
- FFmpeg(视频编码)
- rsvg-convert或ImageMagick(SVG转PNG)
- ComfyUI(本地生成)