Loading...
Loading...
Generates image prompts for Seedream 5.0/4.0 (Jimeng AI), and can call the API to generate images and automatically download them to the output/ directory. Workflow: describe your idea → the agent outputs a prompt for review → user confirms → the agent runs generate.py. It covers text-to-image, image editing, multi-image fusion, character consistency, knowledge cards, posters, PPT backgrounds, e-commerce images, avatars, and group/storyboard generation. Activate this tool when the user mentions terms like seedream, jimeng, AI image generation, text-to-image, image-to-image, seedream prompt, prompt keyword, one-click image generation, knowledge card, poster design, e-commerce image, character consistency, or image generation.
npx skill4agent add ppdbxdawj/seedream-image-skill seedream-image| Capability | Description |
|---|---|
| Real-time Web Search | Automatically fetches trending information when the prompt contains time-sensitive keywords |
| Multi-step Reasoning | Interprets abstract concepts (e.g., "serene tech feel" → desaturated colors + clean lines + cold lighting) |
| Multi-round Editing | Iterative refinement: local edits, style transfer, element addition/removal, text rendering |
| High Resolution | Native 2K resolution, AI-enhanced 4K, generation time of 2-5 seconds |
| Character Consistency | Maintains facial features, clothing, and pose across multiple images (ready for storyboard use) |
| Text Rendering | 99%+ accuracy for Chinese/English text; use quotation marks for optimal results |
[Subject Description] + [Action/Behavior] + [Environment/Background] + [Material/Texture] + [Lighting Effect] + [Composition Requirements] + [Style Keywords]"Hello World"Subject → Environment → Material/Texture → LightingChange Action + Target Object + Change Features
Example: "Change the knight's helmet to gold"realistic movie stillcommercial photographydocumentary photographyhyper-realisticRAW film texture85mm prime lens35mm wide-angle lenstelephoto compressionfisheye lensRembrandt lightingring lightsplit lightinggolden hour warm lightblue hour cold lightneon lightingStudio Ghibli styleMakoto Shinkai styleJapanese shoujo mangacel-shaded textureAmerican comic styleDC comic styleWestern realistic charactersPop ArtChinese trendy illustrationink wash painting styleChinese meticulous paintingcyber Chinese stylepixel artlow-polyflat illustrationthick oil paintingwatercolor hand-drawnminimalismBauhaus stylefrosted glass texturehigh-quality metalcyberpunkmovie poster levelbrand VI visualinfographicknowledge carddramatic side lightingsoft diffused lighthigh contrastlow saturationMorandi color palettecyber neonwarm orange tonecool blue tonefilm grain[Gender, Age, Appearance], [Clothing Description], [Facial Expression], [Environment Background], 85mm prime lens, natural light, realistic movie still style, ultra-high definition, rich details[Scene Description], [Time/Weather], [Lighting Description], [Composition], [Style Keywords], cinematic composition, 8K ultra-clearGenerate an image in the [format/carrier] style to explain/display "[core concept]" to [target audience].
The image should have [style feature A], [style feature B], and [layout requirement C], with an overall feel similar to [familiar reference].[Visual Subject Description], [Material Description], [Lighting Effect],
All visual subjects are concentrated on the [left/right] side of the frame, leaving a large clean background area on the [right/left] side for later text layout.
Background: [Background Description]Refer to the facial features and hairstyle in [Image 1], change the outfit to [scene style],
Generate N consecutive storyboard images of [scene description], [style], set in the same scene with continuous actions.Create a [platform] style display image for this [product], similar to the style of [brand reference],
Clean background, highlight product texture, professional commercial photography| Scene | Prompt Keywords | Notes |
|---|---|---|
| Avatar | | Specifying a style reference image yields better results |
| Knowledge Card | | Explain the target audience and core concept |
| PPT Background | | Emphasize negative space on one side for layout |
| Character Cosplay | | Upload original image + target character image |
| Journal/Planner | | Include date and weather to enhance atmosphere |
| Glass Icon | | Pure white background + simple composition |
| Poster Design | | Clarify text content and position |
| Amulet/Chinese Trend | | Add "wish" text to enhance emotional appeal |
2026 popular colorslatest XX modelthis year's XX trendMilan Winter Olympics"Boundless Creativity" written in the center of the imagerule of thirdsgolden spiralbird's-eye viewlow-angle shotfrontal eye-level shot45-degree oblique anglelarge negative spaceclean backgroundsubject biased to [direction]Reference the style of Image 1, the color tone of Image 2, and the character's pose of Image 3a series ofbatch imagesgenerate N consecutivestoryboard imagesClean background, no cluttered elementsKeep facial features unchanged, do not alter facial characteristicsNo text watermarksNo overexposure| Platform | URL | Description |
|---|---|---|
| Jimeng AI | https://jimeng.jianying.com/ | Main site, approximately 20 free 2K generations per day |
| Volcengine Ark | https://console.volcengine.com/ark | Enterprise API, supports 4K generation |
| Jianying | App Store | AI Painting → Seedream 5.0 |
| CapCut (Overseas) | App Store | AI Image |
generate.py--output-diroutput/.envgenerate.pyVOLC_ACCESSKEYVOLC_SECRETKEY.envpip install -r requirements.txt# Text-to-image
python generate.py --prompt "A cat playing in the garden, watercolor style"
# Image editing (input reference image)
python generate.py --prompt "Change the background to a beach" --image-urls "https://example.com/photo.jpg"
# Specify resolution + force single image
python generate.py --prompt "E-commerce main image, product close-up" --width 2560 --height 1440 --force-single
# Batch image generation
python generate.py --prompt "Generate 4 consecutive blind box images about spring, summer, autumn, and winter"--no-force-singlepython generate.py --prompt "<confirmed_prompt>"--no-force-singleoutput/| Parameter | Description |
|---|---|
| Required, the generation prompt |
| Input reference image URLs (up to 10 images) |
| Specify output width and height (must be passed together); if not passed, the system will adapt intelligently |
| Output area (pixels), default is 2K (2048×2048) |
| Text influence degree (0~1, default 0.5); higher values mean stronger text influence |
| Output only 1 image (default) |
| Allow multiple images (batch), the number is determined by the model based on the prompt |
| Add AI watermark |
| Directory for saving generated images (default: output/); URLs and base64 data will be written here |