image-assistant

Original：🇨🇳 Chinese

Translated

1 scripts

Image Assistant - Convert article/module content into 16:9 infographic prompts with a unified style, minimal text, and high readability; first determine "how many images are needed + what each image covers", then condense copy and metaphors, and finally output directly copyable image generation prompts and iterate.

8installs

Sourceyunshu0909/yunshu_skillshub

Added on2026-02-07

NPX Install

npx skill4agent add yunshu0909/yunshu_skillshub image-assistant

SKILL.md Content (Chinese)

View Translation Comparison →

Image Assistant

Trigger Methods

Trigger when the user says something like:

"Make a graphic for this content / create several supporting images?"
"Give me two (or multiple) image generation prompts"
"Too much text, make it more interesting and readable"
"Package this process into a prompt template/skills"
"/image " "/配图" "/出图"

Process Overview


stages/01-brief.md
stages/02-plan.md
stages/03-copy.md
stages/04-prompts.md
stages/05-iterate.md

Stage	Name	Goal	Detailed File
1	Requirements Clarification (Spec/DoD)	First explore requirements: content/scenario/audience/text density; default size and style; produce a one-sentence restatement and requirement summary	`stages/01-brief.md`
2	Image Planning (Chunking → List)	Chunk content → define image list (quantity / what each image covers / recommended template)	`stages/02-plan.md`
3	Copy Finalization (Copy Spec)	Finalize "what text to put on the image" word by word (single source of truth)	`stages/03-copy.md`
4	Prompt Packaging (Prompt Pack)	Package Copy Spec into copyable prompts; generate batch request packages and batch generate images after user confirmation	`stages/04-prompts.md`
5	Iterative Polishing	Reduce text, replace metaphors, and improve readability based on feedback	`stages/05-iterate.md`

Scheduling Rules

How to determine the current stage:

Requirements not clearly defined (content + scenario + audience + text density) → Stage 1
Long article that needs chunking, or need to confirm "how many images / what each image covers" → Stage 2
Image list confirmed, but "exact text for each image" not finalized → Stage 3
Copy Spec confirmed, need to generate copyable prompts; (optional) batch generate images after prompt confirmation → Stage 4
User feedbacks "too much text / not good-looking / doesn't fit the cover" → Stage 5 (roll back to Stage 1 to re-lock requirements and text density if necessary)

At the start of each stage:

Inform the user of the current stage and deliverables for this stage
Read the corresponding stage file and execute according to steps

Output Specifications (Must Follow)

Each image has one "core message", do not include explanatory paragraphs in the image
All Chinese text must be clearly readable: large font size, short sentences with minimal text, avoid dense small text
Each prompt is output in an independent code block for easy copying
Default output is 16:9 landscape (unless the user explicitly requests 3:4 comic/portrait)
Default style: cream paper background + colored pencil watercolor hand-drawn + light doodles, fun but clean (can use
```
templates/style-block.md
```
)
Once the Stage 3 deliverable (Copy Spec) is confirmed, the copy cannot be modified without permission in Stage 4; only packaging and parameter/constraint completion are allowed

Quick Start (Minimum Input for Users)

Users only need to provide these four items to start:

Content to be illustrated (can be a paragraph, a section, or an entire article)
Usage scenario + viewing distance (PPT projection for long-distance viewing / mobile phone for close viewing / poster)
Target audience (beginners/practitioners/managers/students...)
Preference: "more minimal and clean" or "higher information density"

Optional supplements (not required):

What type of images you roughly want: cover/table of contents, single-page overview, lecture explanation, social media poster (I will recommend based on scenario and preference if unsure)

Your delivery order:

First output: Image list (quantity + one-sentence purpose for each + template recommendation) (Stage 2)
After user confirmation: Output Copy Spec for each image (finalize text word by word) (Stage 3)
After Copy Spec confirmation: Output copyable prompts/call packages for each image (Stage 4)
(Optional) After prompt confirmation: Batch call API to generate images (continuous process within Stage 4)
Enter iteration when the user says "too much text / not good-looking" (Stage 5)

File Structure

stages/
├── 01-brief.md
├── 02-plan.md
├── 03-copy.md
├── 04-prompts.md
└── 05-iterate.md

templates/
├── style-block.md
├── 16x9-infographic.md
├── 16x9-contrast-2cards.md
├── 16x9-3cards-insights.md
├── 16x9-cover-roadmap.md
├── 16x9-5panel-comic.md
├── api-config.md
├── apimart-curl.md
├── apimart-requests-jsonl.md
└── checklist.md

examples/
└── ai-tools-selection.md

scripts/
├── apimart_batch_generate.py
├── apimart.env.example
└── README.md