generate-image
Original:🇺🇸 English
Not Translated
1 scriptsChecked / no sensitive code detected
Generate or edit images using AI models (FLUX, Gemini). Use for general-purpose image generation including photos, illustrations, artwork, visual assets, concept art, and any image that isn't a technical diagram or schematic. For flowcharts, circuits, pathways, and technical diagrams, use the scientific-schematics skill instead.
3installs
Sourceynulihao/agentskillos
Added on
NPX Install
npx skill4agent add ynulihao/agentskillos generate-imageSKILL.md Content
Generate Image
Generate and edit high-quality images using OpenRouter's image generation models including FLUX.2 Pro and Gemini 3 Pro.
When to Use This Skill
Use generate-image for:
- Photos and photorealistic images
- Artistic illustrations and artwork
- Concept art and visual concepts
- Visual assets for presentations or documents
- Image editing and modifications
- Any general-purpose image generation needs
Use scientific-schematics instead for:
- Flowcharts and process diagrams
- Circuit diagrams and electrical schematics
- Biological pathways and signaling cascades
- System architecture diagrams
- CONSORT diagrams and methodology flowcharts
- Any technical/schematic diagrams
Quick Start
Use the script to generate or edit images:
scripts/generate_image.pybash
# Generate a new image
python scripts/generate_image.py "A beautiful sunset over mountains"
# Edit an existing image
python scripts/generate_image.py "Make the sky purple" --input photo.jpgThis generates/edits an image and saves it as in the current directory.
generated_image.pngAPI Key Setup
CRITICAL: The script requires an OpenRouter API key. Before running, check if the user has configured their API key:
- Look for a file in the project directory or parent directories
.env - Check for in the
OPENROUTER_API_KEY=<key>file.env - If not found, inform the user they need to:
- Create a file with
.envOPENROUTER_API_KEY=your-api-key-here - Or set the environment variable:
export OPENROUTER_API_KEY=your-api-key-here - Get an API key from: https://openrouter.ai/keys
- Create a
The script will automatically detect the file and provide clear error messages if the API key is missing.
.envModel Selection
Default model: (high quality, recommended)
google/gemini-3-pro-image-previewAvailable models for generation and editing:
- - High quality, supports generation + editing
google/gemini-3-pro-image-preview - - Fast, high quality, supports generation + editing
black-forest-labs/flux.2-pro
Generation only:
- - Fast and cheap, but not as high quality as pro
black-forest-labs/flux.2-flex
Select based on:
- Quality: Use gemini-3-pro or flux.2-pro
- Editing: Use gemini-3-pro or flux.2-pro (both support image editing)
- Cost: Use flux.2-flex for generation only
Common Usage Patterns
Basic generation
bash
python scripts/generate_image.py "Your prompt here"Specify model
bash
python scripts/generate_image.py "A cat in space" --model "black-forest-labs/flux.2-pro"Custom output path
bash
python scripts/generate_image.py "Abstract art" --output artwork.pngEdit an existing image
bash
python scripts/generate_image.py "Make the background blue" --input photo.jpgEdit with a specific model
bash
python scripts/generate_image.py "Add sunglasses to the person" --input portrait.png --model "black-forest-labs/flux.2-pro"Edit with custom output
bash
python scripts/generate_image.py "Remove the text from the image" --input screenshot.png --output cleaned.pngMultiple images
Run the script multiple times with different prompts or output paths:
bash
python scripts/generate_image.py "Image 1 description" --output image1.png
python scripts/generate_image.py "Image 2 description" --output image2.pngScript Parameters
- (required): Text description of the image to generate, or editing instructions
prompt - or
--input: Input image path for editing (enables edit mode)-i - or
--model: OpenRouter model ID (default: google/gemini-3-pro-image-preview)-m - or
--output: Output file path (default: generated_image.png)-o - : OpenRouter API key (overrides .env file)
--api-key
Example Use Cases
For Scientific Documents
bash
# Generate a conceptual illustration for a paper
python scripts/generate_image.py "Microscopic view of cancer cells being attacked by immunotherapy agents, scientific illustration style" --output figures/immunotherapy_concept.png
# Create a visual for a presentation
python scripts/generate_image.py "DNA double helix structure with highlighted mutation site, modern scientific visualization" --output slides/dna_mutation.pngFor Presentations and Posters
bash
# Title slide background
python scripts/generate_image.py "Abstract blue and white background with subtle molecular patterns, professional presentation style" --output slides/background.png
# Poster hero image
python scripts/generate_image.py "Laboratory setting with modern equipment, photorealistic, well-lit" --output poster/hero.pngFor General Visual Content
bash
# Website or documentation images
python scripts/generate_image.py "Professional team collaboration around a digital whiteboard, modern office" --output docs/team_collaboration.png
# Marketing materials
python scripts/generate_image.py "Futuristic AI brain concept with glowing neural networks" --output marketing/ai_concept.pngError Handling
The script provides clear error messages for:
- Missing API key (with setup instructions)
- API errors (with status codes)
- Unexpected response formats
- Missing dependencies (requests library)
If the script fails, read the error message and address the issue before retrying.
Notes
- Images are returned as base64-encoded data URLs and automatically saved as PNG files
- The script supports both and
imagesresponse formats from different OpenRouter modelscontent - Generation time varies by model (typically 5-30 seconds)
- For image editing, the input image is encoded as base64 and sent to the model
- Supported input image formats: PNG, JPEG, GIF, WebP
- Check OpenRouter pricing for cost information: https://openrouter.ai/models
Image Editing Tips
- Be specific about what changes you want (e.g., "change the sky to sunset colors" vs "edit the sky")
- Reference specific elements in the image when possible
- For best results, use clear and detailed editing instructions
- Both Gemini 3 Pro and FLUX.2 Pro support image editing through OpenRouter
Integration with Other Skills
- scientific-schematics: Use for technical diagrams, flowcharts, circuits, pathways
- generate-image: Use for photos, illustrations, artwork, visual concepts
- scientific-slides: Combine with generate-image for visually rich presentations
- latex-posters: Use generate-image for poster visuals and hero images