ai-image
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseAI Image Generation Skill
AI图像生成Skill
Generate high-quality AI images using OpenAI's gpt-image-1 model with customizable styles and themes.
使用OpenAI的gpt-image-1模型生成高质量AI图像,支持自定义风格和主题。
When to Use This Skill
何时使用本Skill
Use this skill when the user wants to:
- Generate images from text descriptions
- Create artwork with specific artistic styles
- Generate images with particular aspect ratios (vertical, horizontal, square)
- Apply themed visual styles (Studio Ghibli, futuristic, Pixar, oil painting, Chinese painting)
当用户有以下需求时,可使用本Skill:
- 根据文本描述生成图像
- 创建带有特定艺术风格的作品
- 生成特定宽高比(竖版、横版、正方形)的图像
- 应用主题视觉风格(吉卜力工作室、未来主义、皮克斯、油画、中国画)
Instructions
使用说明
- Check for API Key: Verify that the OPENAI_API_KEY environment variable is set
- Gather Requirements: Ask the user for:
- Image prompt (required)
- Style/aspect ratio: vertical (1024x1536), horizontal (1536x1024), or square (1024x1024)
- Theme: ghibli, futuristic, pixar, oil-paint, or chinese-paint (optional)
- Output location (optional, defaults to ./generated_image.png)
- Run the CLI: Execute the main.py script with the appropriate parameters
- Report Results: Show the user where the image was saved and any relevant details
- 检查API密钥:确认已设置OPENAI_API_KEY环境变量
- 收集需求信息:向用户询问以下内容:
- 图像提示词(必填)
- 风格/宽高比:竖版(1024x1536)、横版(1536x1024)或正方形(1024x1024)
- 主题:ghibli、futuristic、pixar、oil-paint或chinese-paint(可选)
- 输出位置(可选,默认值为./generated_image.png)
- 运行CLI:使用相应参数执行main.py脚本
- 反馈结果:告知用户图像的保存位置及相关细节
Available Options
可用选项
Aspect Ratios (--style)
宽高比(--style)
- : 1024x1536 pixels (portrait orientation)
vertical - : 1536x1024 pixels (landscape orientation)
horizontal - : 1024x1024 pixels (default)
square
- : 1024x1536像素(竖版方向)
vertical - : 1536x1024像素(横版方向)
horizontal - : 1024x1024像素(默认)
square
Artistic Themes (--theme)
艺术主题(--theme)
- : Studio Ghibli animation style with whimsical, dreamlike aesthetics
ghibli - : Sci-fi style with sleek designs and neon lights
futuristic - : Vibrant 3D animation style with expressive characters
pixar - : Classical oil painting with rich textures and brushstrokes
oil-paint - : Traditional Chinese ink painting with delicate brushwork
chinese-paint
- : 吉卜力工作室动画风格,带有奇幻、梦幻的美学特征
ghibli - : 科幻风格,具有流畅的设计和霓虹灯光效果
futuristic - : 充满活力的3D动画风格,角色表现力丰富
pixar - : 经典油画风格,具有丰富的纹理和笔触
oil-paint - : 传统中国水墨画风格,笔触细腻
chinese-paint
Usage Examples
使用示例
Basic Usage
基础用法
bash
uv run main.py --prompt "a cat sitting on a tree"bash
uv run main.py --prompt "a cat sitting on a tree"With Style and Theme
指定风格和主题
bash
uv run main.py --prompt "a sunset over mountains" --style horizontal --theme oil-paint --output ./sunset.pngbash
uv run main.py --prompt "a sunset over mountains" --style horizontal --theme oil-paint --output ./sunset.pngFuturistic Portrait
未来主义肖像
bash
uv run main.py --prompt "a robot in a city" --style vertical --theme futuristic --output ./robot.pngbash
uv run main.py --prompt "a robot in a city" --style vertical --theme futuristic --output ./robot.pngStudio Ghibli Landscape
吉卜力风格风景
bash
uv run main.py --prompt "a magical forest with spirits" --style horizontal --theme ghibli --output ./forest.pngbash
uv run main.py --prompt "a magical forest with spirits" --style horizontal --theme ghibli --output ./forest.pngSetup Requirements
环境设置要求
This skill requires an OpenAI API key with access to the gpt-image-1 model:
bash
export OPENAI_API_KEY='your-api-key-here'Note: Using gpt-image-1 requires organization verification on the OpenAI platform.
本Skill需要拥有gpt-image-1模型访问权限的OpenAI API密钥:
bash
export OPENAI_API_KEY='your-api-key-here'注意:使用gpt-image-1需要在OpenAI平台完成组织验证。
Technical Details
技术细节
- Model: OpenAI gpt-image-1 (released April 2025)
- Response Format: Base64 encoded images (b64_json)
- Supported Sizes: 1024x1024, 1024x1536, 1536x1024
- Maximum Resolution: Up to 4096x4096 pixels
- Dependencies: openai>=2.7.1
- Model: OpenAI gpt-image-1(2025年4月发布)
- 响应格式: Base64编码图像(b64_json)
- 支持尺寸: 1024x1024, 1024x1536, 1536x1024
- 最大分辨率: 最高4096x4096像素
- 依赖项: openai>=2.7.1
Pricing Information
定价信息
Usage is priced per token:
- Text tokens: $5 per million
- Image input tokens: $10 per million
- Image output tokens: $40 per million
Approximate costs per generated image:
- Low quality square: ~$0.02
- Medium quality square: ~$0.07
- High quality square: ~$0.19
使用按token计费:
- 文本token:每百万$5
- 图像输入token:每百万$10
- 图像输出token:每百万$40
单张生成图像的大致成本:
- 低质量正方形:约$0.02
- 中等质量正方形:约$0.07
- 高质量正方形:约$0.19
Troubleshooting
故障排除
API Key Not Set
API密钥未设置
If you see "Error: OPENAI_API_KEY environment variable not set", ensure your API key is exported in your shell session.
如果出现"Error: OPENAI_API_KEY environment variable not set"错误,请确保已在Shell会话中导出API密钥。
Organization Not Verified
组织未验证
gpt-image-1 requires organization verification on platform.openai.com. Visit your OpenAI account settings to complete verification.
gpt-image-1需要在platform.openai.com完成组织验证。请访问你的OpenAI账户设置完成验证。
Invalid Size Error
尺寸无效错误
Ensure you're using one of the supported sizes: 1024x1024, 1024x1536, or 1536x1024.
确保使用支持的尺寸:1024x1024、1024x1536或1536x1024。