image-assistant
Original:🇨🇳 Chinese
Translated
1 scripts
Image Assistant - Convert article/module content into 16:9 infographic prompts with a unified style, minimal text, and high readability; first determine "how many images are needed + what each image covers", then condense copy and metaphors, and finally output directly copyable image generation prompts and iterate.
8installs
Added on
NPX Install
npx skill4agent add yunshu0909/yunshu_skillshub image-assistantTags
Translated version includes tags in frontmatterSKILL.md Content (Chinese)
View Translation Comparison →Image Assistant
Trigger Methods
Trigger when the user says something like:
- "Make a graphic for this content / create several supporting images?"
- "Give me two (or multiple) image generation prompts"
- "Too much text, make it more interesting and readable"
- "Package this process into a prompt template/skills"
- "/image " "/配图" "/出图"
Process Overview
| Stage | Name | Goal | Detailed File |
|---|---|---|---|
| 1 | Requirements Clarification (Spec/DoD) | First explore requirements: content/scenario/audience/text density; default size and style; produce a one-sentence restatement and requirement summary | |
| 2 | Image Planning (Chunking → List) | Chunk content → define image list (quantity / what each image covers / recommended template) | |
| 3 | Copy Finalization (Copy Spec) | Finalize "what text to put on the image" word by word (single source of truth) | |
| 4 | Prompt Packaging (Prompt Pack) | Package Copy Spec into copyable prompts; generate batch request packages and batch generate images after user confirmation | |
| 5 | Iterative Polishing | Reduce text, replace metaphors, and improve readability based on feedback | |
Scheduling Rules
How to determine the current stage:
- Requirements not clearly defined (content + scenario + audience + text density) → Stage 1
- Long article that needs chunking, or need to confirm "how many images / what each image covers" → Stage 2
- Image list confirmed, but "exact text for each image" not finalized → Stage 3
- Copy Spec confirmed, need to generate copyable prompts; (optional) batch generate images after prompt confirmation → Stage 4
- User feedbacks "too much text / not good-looking / doesn't fit the cover" → Stage 5 (roll back to Stage 1 to re-lock requirements and text density if necessary)
At the start of each stage:
- Inform the user of the current stage and deliverables for this stage
- Read the corresponding stage file and execute according to steps
Output Specifications (Must Follow)
- Each image has one "core message", do not include explanatory paragraphs in the image
- All Chinese text must be clearly readable: large font size, short sentences with minimal text, avoid dense small text
- Each prompt is output in an independent code block for easy copying
- Default output is 16:9 landscape (unless the user explicitly requests 3:4 comic/portrait)
- Default style: cream paper background + colored pencil watercolor hand-drawn + light doodles, fun but clean (can use )
templates/style-block.md - Once the Stage 3 deliverable (Copy Spec) is confirmed, the copy cannot be modified without permission in Stage 4; only packaging and parameter/constraint completion are allowed
Quick Start (Minimum Input for Users)
Users only need to provide these four items to start:
- Content to be illustrated (can be a paragraph, a section, or an entire article)
- Usage scenario + viewing distance (PPT projection for long-distance viewing / mobile phone for close viewing / poster)
- Target audience (beginners/practitioners/managers/students...)
- Preference: "more minimal and clean" or "higher information density"
Optional supplements (not required):
- What type of images you roughly want: cover/table of contents, single-page overview, lecture explanation, social media poster (I will recommend based on scenario and preference if unsure)
Your delivery order:
- First output: Image list (quantity + one-sentence purpose for each + template recommendation) (Stage 2)
- After user confirmation: Output Copy Spec for each image (finalize text word by word) (Stage 3)
- After Copy Spec confirmation: Output copyable prompts/call packages for each image (Stage 4)
- (Optional) After prompt confirmation: Batch call API to generate images (continuous process within Stage 4)
- Enter iteration when the user says "too much text / not good-looking" (Stage 5)
File Structure
stages/
├── 01-brief.md
├── 02-plan.md
├── 03-copy.md
├── 04-prompts.md
└── 05-iterate.md
templates/
├── style-block.md
├── 16x9-infographic.md
├── 16x9-contrast-2cards.md
├── 16x9-3cards-insights.md
├── 16x9-cover-roadmap.md
├── 16x9-5panel-comic.md
├── api-config.md
├── apimart-curl.md
├── apimart-requests-jsonl.md
└── checklist.md
examples/
└── ai-tools-selection.md
scripts/
├── apimart_batch_generate.py
├── apimart.env.example
└── README.md