image-assistant

Original🇨🇳 Chinese
Translated
1 scripts

Image Assistant - Convert article/module content into 16:9 infographic prompts with a unified style, minimal text, and high readability; first determine "how many images are needed + what each image covers", then condense copy and metaphors, and finally output directly copyable image generation prompts and iterate.

8installs
Added on

NPX Install

npx skill4agent add yunshu0909/yunshu_skillshub image-assistant

SKILL.md Content (Chinese)

View Translation Comparison →

Image Assistant

Trigger Methods

Trigger when the user says something like:
  • "Make a graphic for this content / create several supporting images?"
  • "Give me two (or multiple) image generation prompts"
  • "Too much text, make it more interesting and readable"
  • "Package this process into a prompt template/skills"
  • "/image " "/配图" "/出图"

Process Overview

StageNameGoalDetailed File
1Requirements Clarification (Spec/DoD)First explore requirements: content/scenario/audience/text density; default size and style; produce a one-sentence restatement and requirement summary
stages/01-brief.md
2Image Planning (Chunking → List)Chunk content → define image list (quantity / what each image covers / recommended template)
stages/02-plan.md
3Copy Finalization (Copy Spec)Finalize "what text to put on the image" word by word (single source of truth)
stages/03-copy.md
4Prompt Packaging (Prompt Pack)Package Copy Spec into copyable prompts; generate batch request packages and batch generate images after user confirmation
stages/04-prompts.md
5Iterative PolishingReduce text, replace metaphors, and improve readability based on feedback
stages/05-iterate.md

Scheduling Rules

How to determine the current stage:
  1. Requirements not clearly defined (content + scenario + audience + text density) → Stage 1
  2. Long article that needs chunking, or need to confirm "how many images / what each image covers" → Stage 2
  3. Image list confirmed, but "exact text for each image" not finalized → Stage 3
  4. Copy Spec confirmed, need to generate copyable prompts; (optional) batch generate images after prompt confirmation → Stage 4
  5. User feedbacks "too much text / not good-looking / doesn't fit the cover" → Stage 5 (roll back to Stage 1 to re-lock requirements and text density if necessary)
At the start of each stage:
  • Inform the user of the current stage and deliverables for this stage
  • Read the corresponding stage file and execute according to steps

Output Specifications (Must Follow)

  • Each image has one "core message", do not include explanatory paragraphs in the image
  • All Chinese text must be clearly readable: large font size, short sentences with minimal text, avoid dense small text
  • Each prompt is output in an independent code block for easy copying
  • Default output is 16:9 landscape (unless the user explicitly requests 3:4 comic/portrait)
  • Default style: cream paper background + colored pencil watercolor hand-drawn + light doodles, fun but clean (can use
    templates/style-block.md
    )
  • Once the Stage 3 deliverable (Copy Spec) is confirmed, the copy cannot be modified without permission in Stage 4; only packaging and parameter/constraint completion are allowed

Quick Start (Minimum Input for Users)

Users only need to provide these four items to start:
  1. Content to be illustrated (can be a paragraph, a section, or an entire article)
  2. Usage scenario + viewing distance (PPT projection for long-distance viewing / mobile phone for close viewing / poster)
  3. Target audience (beginners/practitioners/managers/students...)
  4. Preference: "more minimal and clean" or "higher information density"
Optional supplements (not required):
  • What type of images you roughly want: cover/table of contents, single-page overview, lecture explanation, social media poster (I will recommend based on scenario and preference if unsure)
Your delivery order:
  • First output: Image list (quantity + one-sentence purpose for each + template recommendation) (Stage 2)
  • After user confirmation: Output Copy Spec for each image (finalize text word by word) (Stage 3)
  • After Copy Spec confirmation: Output copyable prompts/call packages for each image (Stage 4)
  • (Optional) After prompt confirmation: Batch call API to generate images (continuous process within Stage 4)
  • Enter iteration when the user says "too much text / not good-looking" (Stage 5)

File Structure

stages/
├── 01-brief.md
├── 02-plan.md
├── 03-copy.md
├── 04-prompts.md
└── 05-iterate.md

templates/
├── style-block.md
├── 16x9-infographic.md
├── 16x9-contrast-2cards.md
├── 16x9-3cards-insights.md
├── 16x9-cover-roadmap.md
├── 16x9-5panel-comic.md
├── api-config.md
├── apimart-curl.md
├── apimart-requests-jsonl.md
└── checklist.md

examples/
└── ai-tools-selection.md

scripts/
├── apimart_batch_generate.py
├── apimart.env.example
└── README.md