Loading...
Loading...
Compare original and translation side by side
npx skills add agentspace-so/runcomfy-skills --skill image-to-video -gnpx skills add agentspace-so/runcomfy-skills --skill image-to-video -g| User intent | Model | Why |
|---|---|---|
| Animate a portrait — keep identity stable | HappyHorse 1.0 I2V | #1 on Artificial Analysis Arena (Elo 1392); strong facial fidelity |
| Product reveal / 360 / macro motion | HappyHorse 1.0 I2V | Geometry preservation + smooth camera moves |
| Native synchronized ambient audio in one pass | HappyHorse 1.0 I2V | In-pass audio synthesis |
| Animate and lip-sync to a custom voiceover track | Wan 2.7 + | Accepts your own MP3/WAV (3–30s, ≤15MB) and drives lip-sync to it |
| Multi-language dub variants (same image, different audio per call) | Wan 2.7 + | Same shot, swap |
| Multi-modal — image + reference video + reference audio together | Seedance 2.0 Pro | Up to 9 image refs, 3 video refs (2–15s each), 3 audio refs |
| Brand-consistent narrative with character ref + scene ref + voice ref | Seedance 2.0 Pro | Image holds identity, video holds scene, audio holds voice |
| Default if unspecified | HappyHorse 1.0 I2V | Best all-round quality + native audio |
| 用户需求 | 模型 | 选用原因 |
|---|---|---|
| 人像动画——保持身份特征稳定 | HappyHorse 1.0 I2V | 在Artificial Analysis竞技场排名第1(Elo分数1392);面部特征还原度高 |
| 产品展示/360度全景/微距动画 | HappyHorse 1.0 I2V | 几何特征保留+流畅的镜头移动 |
| 一键生成同步环境音 | HappyHorse 1.0 I2V | 支持生成过程中同步合成音频 |
| 动画制作+自定义旁白唇形同步 | Wan 2.7 + | 支持上传自定义MP3/WAV音频(时长3-30秒,大小≤15MB)并驱动唇形同步 |
| 多语言配音变体(同一图片,每次调用更换音频) | Wan 2.7 + | 同一镜头,更换 |
| 多模态——图片+参考视频+参考音频结合 | Seedance 2.0 Pro | 支持最多9张参考图片、3段参考视频(每段2-15秒)、3段参考音频 |
| 品牌一致性叙事(角色参考+场景参考+声音参考) | Seedance 2.0 Pro | 图片保留身份特征,视频保留场景风格,音频保留声音特质 |
| 未指定需求时默认选用 | HappyHorse 1.0 I2V | 综合质量最佳+支持原生音频 |
npm i -g @runcomfy/cliruncomfy loginRUNCOMFY_TOKEN=<token>npm i -g @runcomfy/cliruncomfy loginRUNCOMFY_TOKEN=<token>happyhorse/happyhorse-1-0/image-to-videohappyhorse/happyhorse-1-0/image-to-video| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
| string | yes | — | JPEG/JPG/PNG/WEBP. Min 300px. Aspect 1:2.5–2.5:1. ≤10MB. |
| string | yes | — | ≤5000 non-CJK or 2500 CJK chars. Motion / camera / lighting description. |
| enum | no | | |
| int | no | 5 | 3–15 seconds. |
| int | no | 0 | Reuse for variant comparisons. |
| bool | no | true | Provider watermark toggle. |
| 字段 | 类型 | 必填 | 默认值 | 说明 |
|---|---|---|---|---|
| 字符串 | 是 | — | JPEG/JPG/PNG/WEBP格式。最小300像素。宽高比1:2.5–2.5:1。大小≤10MB。 |
| 字符串 | 是 | — | 非CJK字符≤5000个,CJK字符≤2500个。需描述运动/镜头/灯光效果。 |
| 枚举值 | 否 | | 可选 |
| 整数 | 否 | 5 | 时长3-15秒。 |
| 整数 | 否 | 0 | 重复使用可对比不同变体效果。 |
| 布尔值 | 否 | true | 可切换是否显示服务商水印。 |
runcomfy run happyhorse/happyhorse-1-0/image-to-video \
--input '{
"image_url": "https://.../portrait.jpg",
"prompt": "Gentle camera drift around the subject'\''s face, subtle breathing motion, identity-stable features, soft natural light."
}' \
--output-dir <absolute/path>runcomfy run happyhorse/happyhorse-1-0/image-to-video \
--input '{
"image_url": "https://.../portrait.jpg",
"prompt": "Gentle camera drift around the subject'\''s face, subtle breathing motion, identity-stable features, soft natural light."
}' \
--output-dir <absolute/path>audio_urlaudio_urlwan-ai/wan-2-7/text-to-video/image-to-videoaudio_urlNote on i2v with Wan 2.7: Wan 2.7's primary i2v animation isn't on a dedicated endpoint here. For pure i2v (image animated by motion prompt only), prefer HappyHorse i2v. Use Wan 2.7 specifically when the user has a custom audio track they want lip-synced to a generated talking-head clip.
wan-ai/wan-2-7/text-to-video/image-to-videoaudio_url关于Wan 2.7的图生视频说明:Wan 2.7的基础图生视频动画未在此处提供专用接口。如果仅需通过运动提示词实现图生视频,优先选用HappyHorse i2v。仅当用户需要将自定义音频与生成的说话人片段进行唇形同步时,才使用Wan 2.7。
| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
| string | yes | — | Up to ~5000 chars. Describe the talking-head shot: framing, lighting, motion. |
| string | yes (for lip-sync) | — | WAV/MP3, 3–30s, ≤15MB. Drives lip-sync. |
| enum | no | | |
| enum | no | | |
| enum | no | | 2–15 (whole seconds). Match your audio length. |
| string | no | — | Concrete issues to avoid (e.g. "no subtitles, no flicker"). |
| int | no | — | Reproducibility. |
| 字段 | 类型 | 必填 | 默认值 | 说明 |
|---|---|---|---|---|
| 字符串 | 是 | — | 最多约5000字符。描述说话人镜头:构图、灯光、动作。 |
| 字符串 | 是(唇形同步场景) | — | WAV/MP3格式,时长3-30秒,大小≤15MB。驱动唇形同步。 |
| 枚举值 | 否 | | 可选 |
| 枚举值 | 否 | | 可选 |
| 枚举值 | 否 | | 时长2-15秒(整数)。需与音频时长匹配。 |
| 字符串 | 否 | — | 指定需要避免的问题(例如:“no subtitles, no flicker”即无字幕、无闪烁)。 |
| 整数 | 否 | — | 用于复现结果。 |
runcomfy run wan-ai/wan-2-7/text-to-video \
--input '{
"prompt": "Medium close-up of a confident spokesperson in a softly-lit recording booth, leaning slightly toward the camera, locked tripod, shallow DOF, warm key light from camera-left.",
"audio_url": "https://.../voiceover-en.mp3",
"duration": 12,
"aspect_ratio": "9:16"
}' \
--output-dir <absolute/path>runcomfy run wan-ai/wan-2-7/text-to-video \
--input '{
"prompt": "Medium close-up of a confident spokesperson in a softly-lit recording booth, leaning slightly toward the camera, locked tripod, shallow DOF, warm key light from camera-left.",
"audio_url": "https://.../voiceover-en.mp3",
"duration": 12,
"aspect_ratio": "9:16"
}' \
--output-dir <absolute/path>durationnegative_prompt"no subtitles, no flicker, no distorted hands"audio_urldurationnegative_prompt"no subtitles, no flicker, no distorted hands"audio_urlbytedance/seedance-v2/probytedance/seedance-v2/pro| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
| string | yes | — | CN ≤500 chars OR EN ≤1000 words. |
| array | yes (for i2v) | | 0–9 images. First is the primary subject. |
| array | no | | 0–3 reference clips (MP4/MOV), 2–15s each. |
| array | no | | 0–3 reference audio (WAV/MP3), 2–15s, < 15MB each. |
| enum | no | | |
| int | no | 5 | 4–15 (whole seconds). |
| enum | no | | |
| bool | no | true | In-pass synchronized speech / SFX / music. |
| int | no | — | Reproducibility. |
| 字段 | 类型 | 必填 | 默认值 | 说明 |
|---|---|---|---|---|
| 字符串 | 是 | — | 中文≤500字符或英文≤1000词。 |
| 数组 | 是(图生视频场景) | | 0-9张图片。第一张为主体图片。 |
| 数组 | 否 | | 0-3段参考视频(MP4/MOV格式),每段2-15秒。 |
| 数组 | 否 | | 0-3段参考音频(WAV/MP3格式),每段2-15秒,大小<15MB。 |
| 枚举值 | 否 | | 可选 |
| 整数 | 否 | 5 | 时长4-15秒(整数)。 |
| 枚举值 | 否 | | 可选 |
| 布尔值 | 否 | true | 生成过程中同步合成语音/音效/音乐。 |
| 整数 | 否 | — | 用于复现结果。 |
runcomfy run bytedance/seedance-v2/pro \
--input '{
"prompt": "Subject from image 1 walks through the café in video 1, voice tone matches audio 1. Medium close-up, slow push-in, warm light, gentle ambience.",
"image_url": ["https://.../subject.jpg"],
"video_url": ["https://.../cafe-locked-shot.mp4"],
"audio_url": ["https://.../voice-tone.mp3"],
"duration": 8
}' \
--output-dir <absolute/path>runcomfy run bytedance/seedance-v2/pro \
--input '{
"prompt": "Subject from image 1 walks through the café in video 1, voice tone matches audio 1. Medium close-up, slow push-in, warm light, gentle ambience.",
"image_url": ["https://.../subject.jpg"],
"video_url": ["https://.../cafe-locked-shot.mp4"],
"audio_url": ["https://.../voice-tone.mp3"],
"duration": 8
}' \
--output-dir <absolute/path>image_urlprompt"subject from image 1, lighting from video 1, voice from audio 1"image_urlprompt"subject from image 1, lighting from video 1, voice from audio 1"wan-2-7seedance-v2wan-2-7seedance-v2| code | meaning |
|---|---|
| 0 | success |
| 64 | bad CLI args |
| 65 | bad input JSON / schema mismatch |
| 69 | upstream 5xx |
| 75 | retryable: timeout / 429 |
| 77 | not signed in or token rejected |
| 代码 | 含义 |
|---|---|
| 0 | 成功 |
| 64 | CLI参数错误 |
| 65 | 输入JSON错误/ Schema不匹配 |
| 69 | 上游服务5xx错误 |
| 75 | 可重试:超时/429请求过多 |
| 77 | 未登录或令牌被拒绝 |
runcomfy run <model_id>.runcomfy.net.runcomfy.com--output-dirCtrl-Cruncomfy run <model_id>.runcomfy.net.runcomfy.com--output-dirCtrl-Cruncomfy login~/.config/runcomfy/token.jsonRUNCOMFY_TOKEN--inputmodel-api.runcomfy.net*.runcomfy.net*.runcomfy.comruncomfy login~/.config/runcomfy/token.jsonRUNCOMFY_TOKEN--inputmodel-api.runcomfy.net*.runcomfy.net*.runcomfy.com