Loading...
Loading...
Compare original and translation side by side
| # | Question | Why it matters | When to ask |
|---|---|---|---|
| 1 | What's this video for? "Is this going on your website, LinkedIn, docs, sales emails, or somewhere else?" | Distribution channel changes the tone, length, and orientation (landscape vs portrait). | Always — unless the user already specified. |
| 2 | Who's watching? "Developers? Marketing people? Founders? General audience?" | Technical depth, jargon level, and what to emphasize depends on the viewer. | Only if not obvious from the source material. |
| 3 | What's the one takeaway? "If the viewer remembers one thing, what should it be?" | Forces clarity. Prevents the script from trying to cover everything. | Always in Full Producer mode. Skip in Quick Shot if the source material has one clear point. |
| 4 | Any specific visuals? "Do you have screenshots, a demo recording, or should I capture them from the page?" | Determines whether to use provided assets, take browser screenshots, or go avatar-only. | Always — even a "no, just grab them from the docs page" is useful. |
| 5 | What should it feel like? "Quick and punchy? Detailed walkthrough? Casual update?" | Sets the script tone and pacing. | Only if not obvious. A changelog is obviously a "casual update." A website feature page is obviously "polished." |
| 6 | Anything you definitely want included or excluded? "Any specific feature to highlight? Anything to avoid mentioning?" | Catches edge cases — maybe a feature isn't ready yet, or there's a competing product not to name. | Only in Full Producer mode. |
| 序号 | 问题 | 重要性 | 询问时机 |
|---|---|---|---|
| 1 | 视频用途?“视频将发布在官网、LinkedIn、文档、销售邮件还是其他平台?” | 分发渠道会影响视频语气、时长和画面方向(横屏/竖屏)。 | 除非用户已明确说明,否则必问。 |
| 2 | 受众是谁?“开发者?营销人员?创始人?普通用户?” | 技术深度、术语使用和重点内容需匹配受众。 | 仅当无法从素材推断时询问。 |
| 3 | 核心要点?“如果观众只能记住一件事,应该是什么?” | 确保脚本聚焦核心,避免内容冗余。 | 全流程制作模式下必问;快速拍摄模式下,若素材核心明确可跳过。 |
| 4 | 指定视觉素材?“你是否有截图、演示录像,还是需要我从页面截取?” | 决定使用用户提供的素材、浏览器截图还是仅用头像。 | 必问——即使得到“不用,直接从文档页面截取”的回答也很有用。 |
| 5 | 视频风格?“轻快简洁?详细演示?轻松更新?” | 设定脚本语气和节奏。 | 仅当无法推断时询问。更新日志显然是“轻松更新”风格,官网功能页则是“精致专业”风格。 |
| 6 | 必加/必删内容?“是否有特定功能需要突出?有没有需要避免提及的内容?” | 覆盖特殊情况——比如某项功能尚未就绪,或有竞品需规避。 | 仅在全流程制作模式下询问。 |
"I've read through this. Looks like a [changelog/docs/feature] video for [inferred audience]. Two quick things:
- Where is this going — docs page, LinkedIn, or something else?
- Should I grab screenshots from the page, or do you have specific ones?"
"Here's what I'm thinking based on the source material:
- Type: [changelog recap / docs walkthrough / feature explainer]
- Audience: [developers / marketers / general]
- Key takeaway: [one sentence summary]
- Tone: [casual / professional / energetic]
A few questions:
- Where will this video live? (website, LinkedIn, docs, email)
- Is that takeaway right, or should the focus be different?
- Do you have screenshots or should I capture them?
- Anything specific to include or avoid?"
- "What product or feature is this video about?"
- "Who's the audience?"
- "What's the one thing the viewer should take away?"
- "Where will this video be used?"
- "Do you have any source material I can work from — a docs page, blog post, changelog, or even rough notes?"
- "What tone — casual update, polished explainer, or something else?"
“我已阅读素材,这看起来是面向[推断受众]的[更新日志/文档/功能]视频。有两个小问题:
- 视频将发布在哪里——文档页面、LinkedIn还是其他平台?
- 需要我从页面截取截图,还是你有指定的截图?”
“根据素材,我的初步想法如下:
- 类型: [更新日志回顾/文档演示/功能讲解]
- 受众: [开发者/营销人员/普通用户]
- 核心要点: [一句话总结]
- 风格: [轻松/专业/活力]
几个问题:
- 视频将发布在何处?(官网、LinkedIn、文档、邮件)
- 这个核心要点是否准确,还是需要调整重点?
- 你有截图还是需要我截取?
- 是否有特定内容需要添加或删除?”
- “视频是关于哪个产品或功能的?”
- “受众是谁?”
- “观众需要记住的核心信息是什么?”
- “视频将用于何处?”
- “你有可使用的素材吗——文档页面、博客文章、更新日志,甚至粗略笔记?”
- “视频风格——轻松更新、精致讲解还是其他?”
| Discovery answer | Production decision |
|---|---|
| Distribution: LinkedIn | Portrait orientation (1080x1920), 60 sec max, punchy hook in first 3 seconds |
| Distribution: website/docs | Landscape (1920x1080), can be longer (up to 3 min), professional tone |
| Distribution: sales email | Landscape, 30-60 sec max, personalized hook, strong CTA |
| Distribution: internal/investors | Landscape, can be longer, data-heavy, less polished is fine |
| Audience: developers | Show code, use technical language, no marketing fluff |
| Audience: marketers | Show dashboards/results, use business impact language |
| Audience: founders | Keep it high-level, focus on outcomes not features |
| Tone: casual | Conversational script, contractions, "hey" openers |
| Tone: professional | Clean language, no slang, measured pacing |
| Tone: energetic | Shorter sentences, exclamation in hook, faster pacing |
| 调研答案 | 制作决策 |
|---|---|
| 分发渠道:LinkedIn | 竖屏(1080x1920),最长60秒,前3秒设置吸睛钩子 |
| 分发渠道:官网/文档 | 横屏(1920x1080),时长可延长至3分钟,专业风格 |
| 分发渠道:销售邮件 | 横屏,30-60秒,个性化钩子,明确行动号召(CTA) |
| 分发渠道:内部/投资者 | 横屏,时长可更长,数据导向,无需过度精致 |
| 受众:开发者 | 展示代码,使用技术术语,避免营销话术 |
| 受众:营销人员 | 展示仪表盘/结果,使用业务影响相关语言 |
| 受众:创始人 | 聚焦高层视角,重点关注成果而非功能细节 |
| 风格:轻松 | 对话式脚本,使用缩略语,以“嘿”等语气词开场 |
| 风格:专业 | 简洁语言,无俚语,节奏平稳 |
| 风格:活力 | 短句,钩子带感叹,节奏明快 |
AVATAR-CONFIG.mdAVATAR-CONFIG.mdAVATAR-CONFIG.mdAVATAR-CONFIG.md"Before we generate your first video, let's set up your avatar. This is a one-time thing — I'll save your choice for all future videos.How do you want to appear in your videos?
- Pick a stock avatar — I'll show you a few options from HeyGen's library
- Create from your photo — upload a headshot and I'll generate an avatar from it
- Create a digital twin — upload a 15-second video of yourself talking (best quality, looks like you)
- Generate from a description — describe the look you want and I'll generate it
Which option?"
AVATAR-CONFIG.mdAVATAR-CONFIG.md“在生成你的首个视频前,先设置头像。这是一次性操作——我会保存你的选择,后续所有视频都将沿用。你希望在视频中以何种形象呈现?
- 选择库存头像——我会展示HeyGen库中精选的几个选项
- 从照片生成——上传一张头像照片,我将为你生成专属头像
- 创建数字孪生——上传15秒的个人说话视频(质量最佳,形象与本人一致)
- 通过描述生成——描述你想要的形象,我将为你生成
选择哪个选项?”
GET https://api.heygen.com/v2/avatarsGET https://api.heygen.com/v2/avatarsPOST https://api.heygen.com/v3/avatarstype: "photo"POST https://api.heygen.com/v3/avatarstype: "photo""Record a 15-second video of yourself talking naturally — look at the camera, speak clearly, good lighting. This will create the most realistic avatar. HeyGen requires consent verification for digital twins."
POST https://api.heygen.com/v3/avatarstype: "digital_twin"“录制15秒的自然说话视频——看向镜头,清晰发言,光线良好。这将创建最逼真的头像。HeyGen要求进行同意验证以生成数字孪生。”
POST https://api.heygen.com/v3/avatarstype: "digital_twin"POST https://api.heygen.com/v3/avatarstype: "prompt"POST https://api.heygen.com/v3/avatarstype: "prompt""Now let's pick a voice. You can:
- Describe what you want — e.g., 'friendly male voice, warm and conversational' — and I'll generate a few options
- Browse the catalog — I'll show you voices filtered by language and gender
Which do you prefer?"
“现在选择语音。你可以:
- 描述需求——例如‘友好的男性语音,温暖且对话式’,我将生成几个选项
- 浏览目录——我会按语言和性别筛选展示语音
你更喜欢哪种方式?”
POST https://api.heygen.com/v3/voicespreview_audioPOST https://api.heygen.com/v3/voicespreview_audioGET https://api.heygen.com/v2/voicespreview_audioGET https://api.heygen.com/v2/voicespreview_audioAVATAR-CONFIG.mdundefinedAVATAR-CONFIG.mdundefined
After saving, confirm:
> "All set! I've saved your avatar config. From now on, all videos will use [avatar name] with [voice name]. You can update this anytime by editing `AVATAR-CONFIG.md` or asking me to change it."
Then proceed with the video production flow.
保存后,告知用户:
> “设置完成!我已保存你的头像配置。今后所有视频都将使用[头像名称]搭配[语音名称]。你可随时编辑`AVATAR-CONFIG.md`或要求我修改配置。”
随后进入视频制作流程。AVATAR-CONFIG.mdAVATAR-CONFIG.md| Preset Name | Background Color | Best For | Vibe |
|---|---|---|---|
| Clean Dark | | Technical content, developer audience | Professional, focused |
| Soft White | | Product updates, general audience | Clean, approachable |
| Warm Charcoal | | Feature explainers, demos | Modern, sleek |
| Deep Navy | | Investor updates, enterprise content | Authoritative, serious |
| Startup Teal | | Startup announcements, launches | Energetic, fresh |
| Subtle Gradient Dark | | Creative content, brand videos | Polished, distinctive |
| Warm Sand | | Onboarding, welcome videos | Friendly, inviting |
| Cool Gray | | FAQ, help center content | Neutral, informative |
| Bold Black | | Strong opinions, hot takes | Direct, dramatic |
| Forest | | Sustainability, growth content | Natural, grounded |
colorClean Dark| 预设名称 | 背景颜色 | 适用场景 | 氛围 |
|---|---|---|---|
| Clean Dark | | 技术内容、开发者受众 | 专业、专注 |
| Soft White | | 产品更新、普通受众 | 简洁、亲切 |
| Warm Charcoal | | 功能讲解、演示 | 现代、时尚 |
| Deep Navy | | 投资者更新、企业内容 | 权威、严肃 |
| Startup Teal | | 初创公司公告、产品发布 | 活力、新颖 |
| Subtle Gradient Dark | | 创意内容、品牌视频 | 精致、独特 |
| Warm Sand | | 入门引导、欢迎视频 | 友好、有吸引力 |
| Cool Gray | | FAQ、帮助中心内容 | 中立、信息丰富 |
| Bold Black | | 鲜明观点、热门话题 | 直接、有冲击力 |
| Forest | | 可持续发展、增长类内容 | 自然、沉稳 |
colorClean Dark| Output Type | Typical Duration | Scene Structure | Best For |
|---|---|---|---|
| Documentation walkthrough | 60-120 sec | Intro (full avatar) → code/UI sections (circle avatar over screenshots) → closing (full avatar) | Explaining how to use a feature, API, or tool |
| Changelog / product update | 45-90 sec | Hook (full avatar) → feature showcase (circle avatar over product screenshots) → closing (full avatar) | Weekly/biweekly "what we shipped" videos |
| Feature explainer | 60-150 sec | Problem (full avatar) → solution intro → demo walkthrough (circle avatar over screenshots) → why it matters → CTA (full avatar) | Product pages, sales enablement, launch announcements |
| FAQ / common question | 30-60 sec | Question (full avatar) → answer with visual (circle avatar over screenshot) → summary (full avatar) | Help center, embedded in docs |
| Onboarding welcome | 45-90 sec | Welcome (full avatar) → step-by-step setup (circle avatar over screenshots) → next steps (full avatar) | Post-signup onboarding flow |
| Investor update | 120-300 sec | Intro (full avatar) → metrics (circle avatar over charts/dashboards) → highlights → challenges → next month (full avatar) | Monthly investor communication |
| Sales outreach | 30-60 sec | Personal hook (full avatar) → relevant screenshot of their use case → CTA (full avatar) | Cold outreach, post-demo follow-up |
| 输出类型 | 典型时长 | 场景结构 | 适用场景 |
|---|---|---|---|
| 文档演示 | 60-120秒 | 开场(全屏头像)→ 代码/UI环节(圆形头像+截图)→ 结尾(全屏头像) | 讲解功能、API或工具的使用方法 |
| 更新日志/产品更新 | 45-90秒 | 钩子(全屏头像)→ 功能展示(圆形头像+产品截图)→ 结尾(全屏头像) | 每周/双周“新功能发布”视频 |
| 功能讲解 | 60-150秒 | 问题(全屏头像)→ 解决方案介绍 → 演示环节(圆形头像+截图)→ 价值说明 → CTA(全屏头像) | 产品页面、销售赋能、发布公告 |
| FAQ/常见问题 | 30-60秒 | 问题(全屏头像)→ 带视觉的解答(圆形头像+截图)→ 总结(全屏头像) | 帮助中心、嵌入文档 |
| 入门欢迎 | 45-90秒 | 欢迎(全屏头像)→ 分步设置(圆形头像+截图)→ 下一步(全屏头像) | 注册后入门流程 |
| 投资者更新 | 120-300秒 | 开场(全屏头像)→ 数据(圆形头像+图表/仪表盘)→ 亮点 → 挑战 → 下月计划(全屏头像) | 月度投资者沟通 |
| 销售拓展 | 30-60秒 | 个性化钩子(全屏头像)→ 用户场景相关截图 → CTA(全屏头像) | 陌生开发信、演示后跟进 |
| Input Type | What to provide | How the skill uses it |
|---|---|---|
| Text content | Blog post, changelog entry, release notes, documentation page, raw notes, transcript — pasted directly or as a file path | Extracts key messages, writes the script |
| URL | Link to a webpage (docs page, changelog, blog post) | Fetches and reads the content, takes screenshots of the page for backgrounds |
| Screenshots / images | File paths to PNG/JPG images to use as scene backgrounds | Used directly as backgrounds behind the circle avatar |
| Image URLs | Public URLs to images (e.g., from a CDN, S3, or docs page) | Downloaded, uploaded to HeyGen, used as backgrounds |
| GitHub PR link | URL to a GitHub pull request | Reads PR description, commit messages for additional context |
| Video file | File path to a screen recording or demo video (for Loom-to-polished workflow) | Used as video background behind circle avatar |
| 输入类型 | 提供内容 | Skill使用方式 |
|---|---|---|
| 文本内容 | 博客文章、更新日志、发布说明、文档页面、原始笔记、文字稿——直接粘贴或提供文件路径 | 提取核心信息,撰写脚本 |
| URL | 网页链接(文档页面、更新日志、博客文章) | 获取并读取内容,截取页面截图作为背景 |
| 截图/图片 | PNG/JPG图片的文件路径 | 直接用作圆形头像的背景 |
| 图片URL | 图片的公共URL(如CDN、S3或文档页面中的图片) | 下载后上传至HeyGen,用作背景 |
| GitHub PR链接 | GitHub拉取请求的URL | 读取PR描述、提交信息获取额外上下文 |
| 视频文件 | 屏幕录制或演示视频的文件路径(用于Loom转精致视频工作流) | 用作圆形头像的视频背景 |
| Asset Type | Supported Formats | Max Size | Recommended Resolution | Notes |
|---|---|---|---|---|
| Background images | PNG, JPG, JPEG, WebP | 50 MB | 1920x1080 (matches video output) | Images smaller than 1920x1080 will be scaled up with |
| Background videos | MP4, MOV, WebM | 100 MB | 1920x1080 | Play styles: |
| Avatar photo (for photo avatars) | PNG, JPG | 50 MB | Under 2K resolution | Only needed if creating a custom photo avatar |
| 素材类型 | 支持格式 | 最大大小 | 推荐分辨率 | 说明 |
|---|---|---|---|---|
| 背景图片 | PNG、JPG、JPEG、WebP | 50 MB | 1920x1080(匹配视频输出) | 小于1920x1080的图片将通过 |
| 背景视频 | MP4、MOV、WebM | 100 MB | 1920x1080 | 播放样式: |
| 头像照片(用于照片生成头像) | PNG、JPG | 50 MB | 低于2K分辨率 | 仅创建自定义照片头像时需要 |
| Option | Values | Default | Notes |
|---|---|---|---|
| Avatar | Stock avatar name or custom avatar ID | From | User can specify any avatar from their HeyGen account |
| Voice | Stock voice name or custom voice ID | From | User can specify any voice from their HeyGen account |
| Avatar model | | | Avatar IV has better lip sync and natural movement. Avatar III is cheaper (~6x) but more robotic. |
| Visual style | Preset name from the style table | | Sets the background for intro/outro scenes |
| Resolution | | | 4K increases generation time and cost |
| Orientation | | | Portrait (1080x1920) for social-first vertical video |
| Target duration | Any duration in seconds | Auto (based on script length) | Approximate — actual duration depends on TTS pacing |
| 选项 | 取值 | 默认值 | 说明 |
|---|---|---|---|
| Avatar | 库存头像名称或自定义头像ID | 来自 | 用户可指定HeyGen账户中的任意头像 |
| Voice | 库存语音名称或自定义语音ID | 来自 | 用户可指定HeyGen账户中的任意语音 |
| Avatar model | | | Avatar IV的唇形同步和自然动作效果更好;Avatar III成本更低(约为1/6)但更机械。 |
| Visual style | 风格预设表中的名称 | | 设置开场/结尾场景的背景 |
| Resolution | | | 4K分辨率会增加生成时间和成本 |
| Orientation | | | 竖屏(1080x1920)适用于社交平台优先的垂直视频 |
| Target duration | 任意秒数时长 | 自动(基于脚本长度) | 近似值——实际时长取决于文本转语音的语速 |
| Property | Value |
|---|---|
| Format | MP4 |
| Resolution | 1920x1080 (default), 1280x720, or 3840x2160 |
| Frame rate | 25 fps |
| Max scenes | 50 per video |
| Max duration | 30 minutes |
| Max script length | 5,000 characters per scene |
| Delivery | Signed URL (expires in 7 days) + local download |
| Additional outputs | Thumbnail (JPG), GIF preview, SRT subtitles (if captions enabled) |
| 属性 | 取值 |
|---|---|
| 格式 | MP4 |
| 分辨率 | 1920x1080(默认)、1280x720或3840x2160 |
| 帧率 | 25 fps |
| 最大场景数 | 每个视频50个 |
| 最大时长 | 30分钟 |
| 单场景脚本最大长度 | 5000字符 |
| 交付方式 | 签名URL(7天过期)+ 本地下载 |
| 额外输出 | 缩略图(JPG)、GIF预览、SRT字幕(若启用字幕) |
AVATAR-CONFIG.mdAVATAR-CONFIG.md| Source Type | What to extract | Script approach |
|---|---|---|
| Blog post | Core argument, key insights, proof points | Distill 2-3 most compelling points. Don't follow the blog structure — restructure for spoken delivery. Open with the hook, not the intro. |
| Documentation page | Steps, code examples, UI descriptions | Pick the most important workflow. Walk through it step by step. Show screenshots of each step. Keep it practical — "here is how you do this." |
| Changelog / release notes | What changed, why it matters, how to use it | Lead with the impact, not the feature name. "You can now do X" is better than "We shipped feature Y." Show the product UI. Always run changelog enrichment (Step 3b) before writing the script. |
| Product docs / feature brief | Value prop, use cases, how it works | Pick ONE use case. Show the problem-solution arc. Do not try to cover everything. |
| Raw data / metrics | Key numbers, trends, surprises | Lead with the most surprising data point. Build a "here is what this means" narrative. |
| Founder's notes / brain dump | Core ideas, opinions | Clean up into a coherent point of view. Preserve the voice and opinions. |
| Transcript / talk | Key segments, best quotes | Do not re-script from scratch. Pull the strongest 60-90 seconds and tighten. |
| Marketing copy / landing page | Value prop, differentiators | Expand into a "let me explain why this matters" format. Landing pages are compressed — video scripts need room to breathe. |
| 素材类型 | 提取内容 | 脚本撰写方式 |
|---|---|---|
| 博客文章 | 核心论点、关键见解、论据 | 提炼2-3个最具吸引力的要点。不要遵循博客结构——重新组织为口语化表达。以钩子开场,而非引言。 |
| 文档页面 | 步骤、代码示例、UI描述 | 挑选最重要的工作流,分步讲解。展示每个步骤的截图。注重实用性——“这是操作方法”。 |
| 更新日志/发布说明 | 变更内容、重要性、使用方法 | 以影响为切入点,而非功能名称。“你现在可以做X”比“我们发布了功能Y”更好。展示产品UI。撰写脚本前务必执行更新日志增强步骤(步骤3b)。 |
| 产品文档/功能简报 | 价值主张、使用场景、工作原理 | 挑选一个使用场景,展示问题-解决方案的脉络。不要试图覆盖所有内容。 |
| 原始数据/指标 | 关键数字、趋势、意外发现 | 以最令人惊讶的数据点开场,构建“这意味着什么”的叙事。 |
| 创始人笔记/思路草稿 | 核心想法、观点 | 整理为连贯的观点,保留原有的语气和见解。 |
| 文字稿/演讲内容 | 关键片段、最佳引语 | 不要从头重写脚本。选取最精彩的60-90秒内容并精简。 |
| 营销文案/着陆页 | 价值主张、差异化优势 | 扩展为“让我解释为什么这很重要”的形式。着陆页内容紧凑——视频脚本需要更宽松的表达空间。 |
"This looks like a changelog. Is there a GitHub repo behind these changes? I can pull PR details, diffs, and screenshots to make the video more specific and accurate. If it is a private repo, you can either give me access or paste the relevant PR URLs."
“这看起来是更新日志。这些变更背后是否有GitHub仓库?我可以提取PR详情、代码差异和截图,让视频更具体准确。如果是私有仓库,你可以授予我访问权限或粘贴相关PR链接。”
AVATAR-CONFIG.mdScene 1 (full avatar): "Here is how to [do X] in [product]. It takes about [N] steps and you will be done in [time]."
Scene 2-N (circle avatar over screenshots): Walk through each step. One step per scene. "First... Then... Now..."
Final scene (full avatar): "That is it. [Recap the outcome]. Check out the docs at [URL] for more."Scene 1 (full avatar): Hook with impact. "[Product] just shipped [feature]. Here is why it matters."
Scene 2 (circle avatar over product screenshot): What the feature does. Show the UI.
Scene 3 (circle avatar over detail screenshot): The interesting detail or power feature.
Scene 4 (full avatar): Why you should care + CTA.Scene 1 (full avatar): The problem. "If you have ever tried to [pain point], you know it is painful."
Scene 2 (full avatar or screenshot): The solution intro. "That is exactly what [feature] solves."
Scene 3-4 (circle avatar over screenshots): How it works. Walk through the UI.
Scene 5 (full avatar): Why it matters + CTA.Scene 1 (full avatar): The question. "One thing people ask a lot is: [question]?"
Scene 2 (circle avatar over relevant screenshot): The answer with visual context.
Scene 3 (full avatar): Summary + where to learn more.Production Plan — [Video Title]Summary: [N] scenes, estimated [X] seconds, [avatar model], [style preset]
Scene Layout Script Visual 1 Full avatar "Hook text here..." Clean Dark background (#1a1a2e) 2 Circle avatar "Feature explanation..." PR screenshot: [description] — [source URL or file] 3 Circle avatar "Detail walkthrough..." Browser screenshot: [page section description] 4 Full avatar "CTA text here..." Clean Dark background (#1a1a2e) Visual assets I will use:
- Scene 2: [thumbnail or description of the image, where it came from — PR #123, user-provided, browser screenshot of X page]
- Scene 3: [same detail]
Want me to adjust anything before I generate?
AVATAR-CONFIG.md场景1(全屏头像):“这是在[产品]中[完成X操作]的方法。只需[N]个步骤,[时长]即可完成。”
场景2-N(圆形头像+截图):分步讲解,每个步骤对应一个场景。“首先...然后...现在...”
最终场景(全屏头像):“操作完成。[总结成果]。如需更多信息,请查看文档[URL]。”场景1(全屏头像):以影响为钩子。“[产品]刚刚发布了[功能]。这对你来说很重要的原因如下。”
场景2(圆形头像+产品截图):功能介绍,展示UI。
场景3(圆形头像+细节截图):有趣的细节或高级功能。
场景4(全屏头像):重要性说明+CTA。场景1(全屏头像):提出问题。“如果你曾尝试[痛点],就知道有多麻烦。”
场景2(全屏头像或截图):介绍解决方案。“这正是[功能]要解决的问题。”
场景3-4(圆形头像+截图):工作原理,分步展示UI。
场景5(全屏头像):重要性说明+CTA。场景1(全屏头像):提出问题。“人们经常问的一个问题是:[问题]?”
场景2(圆形头像+相关截图):结合视觉背景解答问题。
场景3(全屏头像):总结+学习更多的渠道。制作方案 — [视频标题]摘要: [N]个场景,预计[X]秒,[头像模型],[风格预设]
场景 布局 脚本 视觉素材 1 全屏头像 “钩子文本...” Clean Dark背景 (#1a1a2e) 2 圆形头像 “功能讲解...” PR截图:[描述] — [来源URL或文件] 3 圆形头像 “细节演示...” 浏览器截图:[页面区域描述] 4 全屏头像 “CTA文本...” Clean Dark背景 (#1a1a2e) 将使用的视觉素材:
- 场景2:[图片缩略图或描述,来源——PR #123、用户提供、X页面浏览器截图]
- 场景3:[同上细节]
生成前需要调整什么内容吗?
这能让用户在生成前全面了解脚本和视觉素材。如果视觉素材有误或缺失,用户可立即指出,避免15分钟渲染后再修改。
**快速拍摄模式:** 跳过审批环节,立即生成。{
"type": "avatar",
"avatar_id": "<AVATAR_ID>",
"avatar_style": "normal",
"scale": 1.0,
"use_avatar_iv_model": true
}{
"type": "avatar",
"avatar_id": "<AVATAR_ID>",
"avatar_style": "circle",
"scale": 0.4,
"offset": {"x": 0.35, "y": 0.35},
"use_avatar_iv_model": true
}{"type": "color", "value": "#1a1a2e"}{"type": "image", "image_asset_id": "<ASSET_ID>", "fit": "cover"}{"type": "video", "video_asset_id": "<ASSET_ID>", "play_style": "fit_to_scene"}{"x": 0.3, "y": 0.4}scale: 0.3{
"type": "avatar",
"avatar_id": "<AVATAR_ID>",
"avatar_style": "normal",
"scale": 1.0,
"use_avatar_iv_model": true
}{
"type": "avatar",
"avatar_id": "<AVATAR_ID>",
"avatar_style": "circle",
"scale": 0.4,
"offset": {"x": 0.35, "y": 0.35},
"use_avatar_iv_model": true
}{"type": "color", "value": "#1a1a2e"}{"type": "image", "image_asset_id": "<ASSET_ID>", "fit": "cover"}{"type": "video", "video_asset_id": "<ASSET_ID>", "play_style": "fit_to_scene"}{"x": 0.3, "y": 0.4}scale: 0.3POST https://upload.heygen.com/v1/assetupload.heygen.comapi.heygen.comcurl -X POST "https://upload.heygen.com/v1/asset" \
-H "X-Api-Key: <HEYGEN_API_KEY>" \
-H "Content-Type: image/png" \
--data-binary @screenshot.pngidimage_asset_idPOST https://upload.heygen.com/v1/assetupload.heygen.comapi.heygen.comcurl -X POST "https://upload.heygen.com/v1/asset" \
-H "X-Api-Key: <HEYGEN_API_KEY>" \
-H "Content-Type: image/png" \
--data-binary @screenshot.pngidimage_asset_idPOST https://api.heygen.com/v2/video/generateX-Api-Key: <HEYGEN_API_KEY>
Content-Type: application/json{
"video_inputs": [
{
"character": { ... },
"voice": {
"type": "text",
"voice_id": "<VOICE_ID>",
"input_text": "<SCENE_SCRIPT>"
},
"background": { ... }
}
],
"dimension": {"width": 1920, "height": 1080}
}.envHEYGEN_API_KEYPOST https://api.heygen.com/v2/video/generateX-Api-Key: <HEYGEN_API_KEY>
Content-Type: application/json{
"video_inputs": [
{
"character": { ... },
"voice": {
"type": "text",
"voice_id": "<VOICE_ID>",
"input_text": "<SCENE_SCRIPT>"
},
"background": { ... }
}
],
"dimension": {"width": 1920, "height": 1080}
}.envHEYGEN_API_KEYvideo_idGET https://api.heygen.com/v1/video_status.get?video_id=<VIDEO_ID>completedvideo_urlVideo complete!
- Duration: [X] seconds
- Scenes: [N]
- Avatar model: [III or IV]
- Visual style: [preset name]
- File: [local path]
- Video URL: [signed URL — expires in 7 days]
- Estimated cost: $[X]
Want me to adjust anything and regenerate?video_idGET https://api.heygen.com/v1/video_status.get?video_id=<VIDEO_ID>completedvideo_url视频制作完成!
- 时长:[X]秒
- 场景数:[N]
- 头像模型:[III或IV]
- 视觉风格:[预设名称]
- 文件路径:[本地路径]
- 视频URL:[签名URL——7天过期]
- 预估成本:$[X]
需要调整内容并重新生成吗?video-log.jsonl{
"timestamp": "2026-04-16T10:30:00Z",
"video_id": "<heygen_video_id>",
"mode": "full_producer",
"output_type": "changelog",
"source_type": "changelog_entry",
"avatar_id": "<avatar_id>",
"avatar_model": "avatar_iv",
"voice_id": "<voice_id>",
"style_preset": "clean_dark",
"scenes": 5,
"duration_seconds": 93,
"generation_time_seconds": 510,
"resolution": "1920x1080",
"local_path": "/path/to/video.mp4",
"source_url": "https://posthog.com/changelog?id=2666"
}video-log.jsonl{
"timestamp": "2026-04-16T10:30:00Z",
"video_id": "<heygen_video_id>",
"mode": "full_producer",
"output_type": "changelog",
"source_type": "changelog_entry",
"avatar_id": "<avatar_id>",
"avatar_model": "avatar_iv",
"voice_id": "<voice_id>",
"style_preset": "clean_dark",
"scenes": 5,
"duration_seconds": 93,
"generation_time_seconds": 510,
"resolution": "1920x1080",
"local_path": "/path/to/video.mp4",
"source_url": "https://posthog.com/changelog?id=2666"
}| Avatar Model | Cost per second | 60-sec video | 90-sec video |
|---|---|---|---|
| Avatar III | ~$0.017/sec | ~$1.00 | ~$1.50 |
| Avatar IV (1080p) | ~$0.05/sec | ~$3.00 | ~$4.50 |
| Avatar IV (4K) | ~$0.067/sec | ~$4.00 | ~$6.00 |
| 头像模型 | 每秒成本 | 60秒视频成本 | 90秒视频成本 |
|---|---|---|---|
| Avatar III | ~$0.017/秒 | ~$1.00 | ~$1.50 |
| Avatar IV(1080p) | ~$0.05/秒 | ~$3.00 | ~$4.50 |
| Avatar IV(4K) | ~$0.067/秒 | ~$4.00 | ~$6.00 |
GET https://api.heygen.com/v2/avatarsupload.heygen.comapi.heygen.comGET https://api.heygen.com/v2/avatarsupload.heygen.comapi.heygen.comcurl -s "https://api.heygen.com/v2/avatars" -H "X-Api-Key: <HEYGEN_API_KEY>"curl -s "https://api.heygen.com/v2/voices" -H "X-Api-Key: <HEYGEN_API_KEY>"curl -X POST "https://api.heygen.com/v3/voices" \
-H "X-Api-Key: <HEYGEN_API_KEY>" \
-H "Content-Type: application/json" \
-d '{"description": "friendly male voice, mid-30s, warm and conversational"}'Adrian_public_3_20240312f38a635bee7a4d1f9b0a654a31d050d2curl -s "https://api.heygen.com/v2/avatars" -H "X-Api-Key: <HEYGEN_API_KEY>"curl -s "https://api.heygen.com/v2/voices" -H "X-Api-Key: <HEYGEN_API_KEY>"curl -X POST "https://api.heygen.com/v3/voices" \
-H "X-Api-Key: <HEYGEN_API_KEY>" \
-H "Content-Type: application/json" \
-d '{"description": "friendly male voice, mid-30s, warm and conversational"}'Adrian_public_3_20240312f38a635bee7a4d1f9b0a654a31d050d2