image_generation

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

能力概述

Capability Overview

AI图像生成能力让你能够：

文生图：根据文字描述生成图像
图生图：基于参考图像生成新图像
图像编辑：修改现有图像的特定部分
风格转换：改变图像风格（写实、动漫、油画等）
文字渲染：在图像中生成清晰可读的文字

底层基于 Google Gemini 的 Nano Banana / Nano Banana Pro 模型。

AI image generation capabilities allow you to:

Text-to-Image: Generate images based on text descriptions
Image-to-Image: Generate new images based on reference images
Image Editing: Modify specific parts of existing images
Style Transfer: Change image styles (realistic, anime, oil painting, etc.)
Text Rendering: Generate clear and readable text in images

工作流程

Workflow

Phase 1: 需求理解

Phase 1: Requirement Understanding

理解用户的图像需求（主题、风格、用途）
确认输出格式（尺寸、分辨率、数量）
如有参考图，确认编辑意图

Understand the user's image requirements (theme, style, usage)
Confirm output format (dimensions, resolution, quantity)
If there are reference images, confirm the editing intent

Phase 2: Prompt 构建

Phase 2: Prompt Construction

将用户意图转化为英文 Prompt（效果更好）

遵循 Prompt 公式：

<subject> <action> <scene> <style> <quality>

补充必要的细节描述

Convert user intent into English prompts (better results)

Follow the prompt formula:

<subject> <action> <scene> <style> <quality>

Supplement necessary detailed descriptions

Phase 3: 图像生成

Phase 3: Image Generation

调用
```
generate_image
```
工具
如需编辑，调用
```
edit_image
```
工具
生成多个候选（如用户需要选择）

Call the
```
generate_image
```
tool
Call the
```
edit_image
```
tool if editing is needed
Generate multiple candidates (for user selection if required)

Phase 4: 交付

Phase 4: Delivery

展示生成结果
询问是否需要调整
保存到用户指定位置

Display the generated results
Ask if adjustments are needed
Save to the user-specified location

工具使用

Tool Usage

generate_image

用途：根据文字描述生成图像
参数：
- ```
prompt
```
  : 图像描述（英文效果更佳）
- ```
style
```
  : 风格预设（realistic, anime, oil_painting, watercolor, minimal, cinematic）
- ```
aspect_ratio
```
  : 宽高比（1:1, 16:9, 9:16, 4:3, 3:4）
- ```
resolution
```
  : 分辨率（1K, 2K, 4K）
- ```
num_images
```
  : 生成数量（1-4）

示例：

python

generate_image(
    prompt="A majestic horse galloping through cherry blossoms, golden hour lighting, Chinese New Year festive atmosphere",
    style="realistic",
    aspect_ratio="16:9",
    resolution="2K",
    num_images=2
)

Purpose: Generate images based on text descriptions
Parameters:
- ```
prompt
```
  : Image description (English yields better results)
- ```
style
```
  : Style presets (realistic, anime, oil_painting, watercolor, minimal, cinematic)
- ```
aspect_ratio
```
  : Aspect ratio (1:1, 16:9, 9:16, 4:3, 3:4)
- ```
resolution
```
  : Resolution (1K, 2K, 4K)
- ```
num_images
```
  : Number of images to generate (1-4)

Example:

python

generate_image(
    prompt="A majestic horse galloping through cherry blossoms, golden hour lighting, Chinese New Year festive atmosphere",
    style="realistic",
    aspect_ratio="16:9",
    resolution="2K",
    num_images=2
)

edit_image

用途：编辑现有图像
参数：
- ```
image_path
```
  : 原图路径或URL
- ```
prompt
```
  : 编辑指令（如："将背景改为夜景"）
- ```
preserve_subject
```
  : 是否保持主体不变（默认True）

示例：

python

edit_image(
    image_path="/workspace/photo.jpg",
    prompt="Add Chinese New Year decorations and red lanterns to the background",
    preserve_subject=True
)

Purpose: Edit existing images
Parameters:
- ```
image_path
```
  : Original image path or URL
- ```
prompt
```
  : Editing instructions (e.g., "Change the background to night scene")
- ```
preserve_subject
```
  : Whether to keep the subject unchanged (default True)

Example:

python

edit_image(
    image_path="/workspace/photo.jpg",
    prompt="Add Chinese New Year decorations and red lanterns to the background",
    preserve_subject=True
)

Prompt 最佳实践

Prompt Best Practices

基础公式

Basic Formula

[主体] + [动作/姿态] + [场景/背景] + [风格] + [氛围/光线]

[Subject] + [Action/Pose] + [Scene/Background] + [Style] + [Atmosphere/Lighting]

风格关键词

Style Keywords

写实：photorealistic, hyperrealistic, 8K, detailed
动漫：anime style, Ghibli style, cel shading
油画：oil painting style, impressionist, Van Gogh style
极简：minimal, flat design, vector art
电影感：cinematic, dramatic lighting, movie poster style

Realistic: photorealistic, hyperrealistic, 8K, detailed
Anime: anime style, Ghibli style, cel shading
Oil Painting: oil painting style, impressionist, Van Gogh style
Minimalist: minimal, flat design, vector art
Cinematic: cinematic, dramatic lighting, movie poster style

质量增强词

Quality Enhancement Words

```
high quality
```
,
```
detailed
```
,
```
sharp focus
```
```
professional photography
```
,
```
award winning
```
```
4K resolution
```
,
```
ultra detailed
```

```
high quality
```
,
```
detailed
```
,
```
sharp focus
```
```
professional photography
```
,
```
award winning
```
```
4K resolution
```
,
```
ultra detailed
```

避免事项

Avoidance Items

❌ 避免模糊描述："一张好看的图"
❌ 避免矛盾描述："写实风格的卡通"
❌ 避免敏感内容
✅ 具体、清晰、有层次

❌ Avoid vague descriptions: "A nice picture"
❌ Avoid contradictory descriptions: "Cartoon in realistic style"
❌ Avoid sensitive content
✅ Be specific, clear, and layered

应用场景模板

Application Scenario Templates

场景1：微信红包封面/节日祝福图

Scenario 1: WeChat Red Envelope Cover/Festival Greeting Image

yaml

prompt_template: |
  A {animal} in {pose}, surrounded by {decorations}, 
  Chinese New Year theme, festive red and gold colors, 
  {style} style, high quality, {text_content}
  
variables:
  animal: "majestic horse" # 马年
  pose: "running gracefully"
  decorations: "cherry blossoms, red lanterns, gold coins"
  style: "elegant illustration"
  text_content: "with Chinese text '恭喜发财' in golden calligraphy"

yaml

prompt_template: |
  A {animal} in {pose}, surrounded by {decorations}, 
  Chinese New Year theme, festive red and gold colors, 
  {style} style, high quality, {text_content}
  
variables:
  animal: "majestic horse" # Year of the Horse
  pose: "running gracefully"
  decorations: "cherry blossoms, red lanterns, gold coins"
  style: "elegant illustration"
  text_content: "with Chinese text '恭喜发财' in golden calligraphy"

场景2：演示文稿配图

Scenario 2: Presentation Illustration

yaml

prompt_template: |
  {concept} visualization, professional infographic style,
  clean white background, modern corporate aesthetic,
  subtle gradients, minimalist design

variables:
  concept: "AI workflow automation"

yaml

prompt_template: |
  {concept} visualization, professional infographic style,
  clean white background, modern corporate aesthetic,
  subtle gradients, minimalist design

variables:
  concept: "AI workflow automation"

场景3：社交媒体内容

Scenario 3: Social Media Content

yaml

prompt_template: |
  {subject} {action}, {platform} optimized aspect ratio,
  vibrant colors, eye-catching composition, 
  trending aesthetic, shareable content style
  
variables:
  subject: "coffee cup"
  action: "with steam rising"
  platform: "Instagram" # 1:1 or 4:5

yaml

prompt_template: |
  {subject} {action}, {platform} optimized aspect ratio,
  vibrant colors, eye-catching composition, 
  trending aesthetic, shareable content style
  
variables:
  subject: "coffee cup"
  action: "with steam rising"
  platform: "Instagram" # 1:1 or 4:5

输出格式

Output Format

生成结果展示

Generated Result Display

markdown

undefined

markdown

undefined

🎨 图像生成完成

🎨 Image Generation Completed

Prompt: [使用的英文Prompt]

参数:

风格: [style]
尺寸: [aspect_ratio]
分辨率: [resolution]

生成结果:

下一步:

满意，保存到指定位置
需要调整风格/颜色
需要修改特定部分
重新生成

undefined

Prompt: [English prompt used]

Parameters:

Style: [style]
Aspect Ratio: [aspect_ratio]
Resolution: [resolution]

Generated Results: Generated Image

Next Steps:

Satisfied, save to specified location
Need to adjust style/color
Need to modify specific parts
Regenerate

undefined

注意事项

Notes

版权合规：生成的图像带有 SynthID 水印
内容政策：遵守 Google 使用政策，不生成敏感内容
商业使用：支持商业用途（营销、产品）
文字渲染：Nano Banana Pro 支持多语言文字，但中文效果需要验证
角色一致性：跨图保持角色特征需要使用参考图功能

Copyright Compliance: Generated images have SynthID watermarks
Content Policy: Comply with Google's usage policies, do not generate sensitive content
Commercial Use: Supports commercial use (marketing, products)
Text Rendering: Nano Banana Pro supports multilingual text, but Chinese effects need verification
Character Consistency: Maintaining character features across images requires using the reference image function

资源引用

Resource References

```
resources/prompt_templates.yaml
```
- 预设 Prompt 模板
```
resources/style_presets.md
```
- 风格预设详解
```
resources/chinese_new_year_2026.md
```
- 马年专属模板

```
resources/prompt_templates.yaml
```
- Preset prompt templates
```
resources/style_presets.md
```
- Detailed style presets
```
resources/chinese_new_year_2026.md
```
- Year of the Horse exclusive templates