Loading...
Loading...
Image generation skill using Gemini Web. Generates images from text prompts via Google Gemini. Also supports text generation. Use as the image generation backend for other skills like cover-image, xhs-images, article-illustrator.
npx skill4agent add zsyggg/paper-craft-skills baoyu-gemini-webkeepSessiongenerateVideonpx -y bun scripts/main.ts "Hello, Gemini"
npx -y bun scripts/main.ts --prompt "Explain quantum computing"
npx -y bun scripts/main.ts --prompt "A cute cat" --image cat.png
npx -y bun scripts/main.ts --promptfiles system.md content.md --image out.png
# Multi-turn conversation (agent generates unique sessionId)
npx -y bun scripts/main.ts "Remember this: 42" --sessionId my-unique-id-123
npx -y bun scripts/main.ts "What number?" --sessionId my-unique-id-123createGeminiWebExecutor(geminiOptions)scripts/executor.tsGeminiWebOptionsreferenceImages?: string | string[]keepSession?: booleanchatMetadatagenerateVideo?: stringvideo_gen_chipgenerateVideogenerateImageeditImagekeepSession=truereferenceImages# Simple prompt (positional)
npx -y bun scripts/main.ts "Your prompt here"
# Explicit prompt flag
npx -y bun scripts/main.ts --prompt "Your prompt here"
npx -y bun scripts/main.ts -p "Your prompt here"
# With model selection
npx -y bun scripts/main.ts -p "Hello" -m gemini-2.5-pro
# Pipe from stdin
echo "Summarize this" | npx -y bun scripts/main.ts# Generate image with default path (./generated.png)
npx -y bun scripts/main.ts --prompt "A sunset over mountains" --image
# Generate image with custom path
npx -y bun scripts/main.ts --prompt "A cute robot" --image robot.png
# Shorthand
npx -y bun scripts/main.ts "A dragon" --image=dragon.png# Plain text (default)
npx -y bun scripts/main.ts "Hello"
# JSON output
npx -y bun scripts/main.ts "Hello" --json| Option | Description |
|---|---|
| Prompt text |
| Read prompt from files (concatenated in order) |
| Model: gemini-3-pro (default), gemini-2.5-pro, gemini-2.5-flash |
| Generate image, save to path (default: generated.png) |
| Session ID for multi-turn conversation (agent generates unique ID) |
| List saved sessions (max 100, sorted by update time) |
| Output as JSON |
| Refresh cookies only, then exit |
| Custom cookie file path |
| Chrome profile directory |
| Show help |
scripts/main.ts--sessionIdgemini-3-progemini-2.5-progemini-2.5-flash# Force cookie refresh
npx -y bun scripts/main.ts --login| Variable | Description |
|---|---|
| Data directory |
| Cookie file path |
| Chrome profile directory |
| Chrome executable path |
npx -y bun scripts/main.ts "What is the capital of France?"npx -y bun scripts/main.ts "A photorealistic image of a golden retriever puppy" --image puppy.pngnpx -y bun scripts/main.ts "Hello" --json | jq '.text'# Concatenate system.md + content.md as prompt
npx -y bun scripts/main.ts --promptfiles system.md content.md --image output.png# Start a session with unique ID (agent generates this)
npx -y bun scripts/main.ts "You are a helpful math tutor." --sessionId task-abc123
# Continue the conversation (remembers context)
npx -y bun scripts/main.ts "What is 2+2?" --sessionId task-abc123
npx -y bun scripts/main.ts "Now multiply that by 10" --sessionId task-abc123
# List recent sessions (max 100, sorted by update time)
npx -y bun scripts/main.ts --list-sessions~/Library/Application Support/baoyu-skills/gemini-web/sessions/<id>.jsonidmetadatamessages{role, content, timestamp, error?}createdAtupdatedAt