qwen-image

Original🇺🇸 English
Not Translated

Generate and edit images with Alibaba Qwen-Image-2.0 models via inference.sh CLI. Models: Qwen-Image-2.0 (fast), Qwen-Image-2.0-Pro (professional text rendering). Capabilities: text-to-image, multi-image editing, complex text rendering. Triggers: qwen image, qwen-image, alibaba image, dashscope image, qwen image 2, qwen image pro

3installs
Added on

NPX Install

npx skill4agent add toolshell/skills qwen-image

SKILL.md Content

Qwen-Image - Alibaba Image Generation

Generate and edit images with Alibaba Qwen-Image-2.0 models via inference.sh CLI.
Qwen-Image-2.0

Quick Start

Requires inference.sh CLI (
infsh
). Get installation instructions:
npx skills add inference-sh/skills@agent-tools
bash
infsh login

infsh app run alibaba/qwen-image-2 --input '{"prompt": "A serene mountain landscape at sunset"}'

Models

ModelApp IDSpeedText RenderingBest For
Qwen-Image-2.0
alibaba/qwen-image-2
FastGoodGeneral use
Qwen-Image-2.0-Pro
alibaba/qwen-image-2-pro
StandardProfessionalPosters, text-heavy designs

Search Qwen Image Apps

bash
infsh app list --search "qwen image"

Examples

Basic Text-to-Image

bash
infsh app run alibaba/qwen-image-2 --input '{
  "prompt": "A futuristic cityscape at sunset with flying cars"
}'

Multiple Images

bash
infsh app run alibaba/qwen-image-2 --input '{
  "prompt": "Minimalist logo design for a coffee shop",
  "num_images": 4
}'

Custom Resolution

bash
infsh app run alibaba/qwen-image-2-pro --input '{
  "prompt": "Panoramic mountain landscape with northern lights",
  "width": 1536,
  "height": 1024
}'

Text-Heavy Poster (Pro)

bash
infsh app run alibaba/qwen-image-2-pro --input '{
  "prompt": "Poster with title \"Summer Sale!\" in bold red text at the top. Subtitle \"50% Off Everything\" in blue below. Beach background with palm trees.",
  "width": 1024,
  "height": 1536,
  "prompt_extend": false
}'

Image Editing (Multi-Reference)

bash
infsh app run alibaba/qwen-image-2 --input '{
  "prompt": "Make the girl from Image 1 wear the dress from Image 2 in the pose from Image 3",
  "reference_images": [
    {"uri": "https://example.com/person.jpg"},
    {"uri": "https://example.com/dress.jpg"},
    {"uri": "https://example.com/pose.jpg"}
  ]
}'

With Negative Prompt

bash
infsh app run alibaba/qwen-image-2-pro --input '{
  "prompt": "Professional headshot portrait, studio lighting",
  "negative_prompt": "low resolution, blurry, deformed, oversaturated"
}'

Reproducible with Seed

bash
infsh app run alibaba/qwen-image-2 --input '{
  "prompt": "Abstract geometric art in blue and gold",
  "seed": 12345
}'

Input Options

ParameterTypeDescription
prompt
stringRequired. What to generate or edit (max 800 chars)
reference_images
arrayInput images for editing (1-3 images)
num_images
integerNumber of images to generate (1-6)
width
integerOutput width in pixels (512-2048)
height
integerOutput height in pixels (512-2048)
watermark
booleanAdd "Qwen-Image" watermark
negative_prompt
stringContent to avoid (max 500 chars)
prompt_extend
booleanEnable prompt rewriting (default: true)
seed
integerRandom seed for reproducibility (0-2147483647)
Size constraint: Total pixels must be between 512×512 and 2048×2048.

Output

FieldTypeDescription
images
arrayThe generated or edited images (PNG format)
output_meta
objectMetadata with dimensions and count

Prompt Tips

For Text Rendering (use Pro model):
  • Put exact text in quotes:
    "Title: \"Hello World!\""
  • Specify font style, color, position
  • Set
    prompt_extend: false
    for precise control
Styles: photorealistic, illustration, watercolor, oil painting, digital art, anime, 3D render
Composition: close-up, wide shot, aerial view, macro, portrait, landscape
Lighting: natural light, studio lighting, golden hour, dramatic shadows, neon

Sample Workflow

bash
# 1. Generate sample input to see all options
infsh app sample alibaba/qwen-image-2-pro --save input.json

# 2. Edit the prompt
# 3. Run
infsh app run alibaba/qwen-image-2-pro --input input.json

Model Comparison

Featureqwen-image-2qwen-image-2-pro
SpeedFasterStandard
Text RenderingGoodProfessional
RealismStandardFine-grained
Semantic AdherenceGoodEnhanced

Related Skills

bash
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@agent-tools

# All image generation models
npx skills add inference-sh/skills@ai-image-generation

# Video generation (for image-to-video)
npx skills add inference-sh/skills@ai-video-generation
Browse all image apps:
infsh app list --category image

Documentation