gemini-3-image-generation
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseGemini 3 Pro Image Generation (Nano Banana Pro)
Gemini 3 Pro Image 图像生成指南(Nano Banana Pro)
Comprehensive guide for generating images with Gemini 3 Pro Image (), also known as Nano Banana Pro. This skill focuses on IMAGE OUTPUT (generating images) - see for INPUT (analyzing images).
gemini-3-pro-image-previewgemini-3-multimodal本指南详细介绍如何使用Gemini 3 Pro Image(,又称Nano Banana Pro)生成图像。本技能聚焦于图像输出(生成图像)——如需了解图像输入(分析图像),请查看技能。
gemini-3-pro-image-previewgemini-3-multimodalOverview
概述
Gemini 3 Pro Image (Nano Banana Pro 🍌) is Google's image generation model featuring native 4K support, text rendering within images, grounded generation with Google Search, and conversational editing capabilities.
Gemini 3 Pro Image(Nano Banana Pro 🍌)是谷歌推出的图像生成模型,支持原生4K分辨率、图像内文本渲染、基于Google Search的事实锚定生成,以及对话式编辑功能。
Key Capabilities
核心功能
- 4K Resolution: Native 4K generation with upscaling to 2K/4K
- Text Rendering: High-quality text within images
- Grounded Generation: Fact-verified images using Google Search
- Conversational Editing: Multi-turn image modification preserving context
- Aspect Ratios: Supports 16:9 and custom ratios at 4K
- Quality Control: Fine-tuned generation parameters
- 4K分辨率: 原生4K生成,支持升级至2K/4K分辨率
- 文本渲染: 在图像中生成高质量文本
- 事实锚定生成: 利用Google Search生成符合事实的图像
- 对话式编辑: 多轮迭代修改图像,保留上下文
- 宽高比: 支持4K分辨率下的16:9及自定义宽高比
- 质量控制: 可微调生成参数
When to Use This Skill
适用场景
- Generating images from text prompts
- Creating 4K resolution images
- Rendering text within images
- Fact-verified image generation (grounded)
- Conversational image editing
- Multi-turn image refinement
- Custom aspect ratio images
- 从文本提示词生成图像
- 创建4K分辨率图像
- 在图像中渲染文本
- 生成符合事实的锚定图像
- 对话式图像编辑
- 多轮图像优化
- 生成自定义宽高比的图像
Quick Start
快速入门
Prerequisites
前置条件
- Gemini API setup (see skill)
gemini-3-pro-api - Model:
gemini-3-pro-image-preview
- Gemini API配置完成(详见技能)
gemini-3-pro-api - 模型:
gemini-3-pro-image-preview
Python Quick Start
Python快速入门
python
import google.generativeai as genai
genai.configure(api_key="YOUR_API_KEY")python
import google.generativeai as genai
genai.configure(api_key="YOUR_API_KEY")Use the image generation model
使用图像生成模型
model = genai.GenerativeModel("gemini-3-pro-image-preview")
model = genai.GenerativeModel("gemini-3-pro-image-preview")
Generate image
生成图像
response = model.generate_content("A serene mountain landscape at sunset")
response = model.generate_content("A serene mountain landscape at sunset")
Save image
保存图像
if response.parts:
with open("generated_image.png", "wb") as f:
f.write(response.parts[0].inline_data.data)
print("Image saved!")
undefinedif response.parts:
with open("generated_image.png", "wb") as f:
f.write(response.parts[0].inline_data.data)
print("Image saved!")
undefinedNode.js Quick Start
Node.js快速入门
typescript
import { GoogleGenerativeAI } from "@google/generative-ai";
import fs from "fs";
const genAI = new GoogleGenerativeAI("YOUR_API_KEY");
const model = genAI.getGenerativeModel({ model: "gemini-3-pro-image-preview" });
const result = await model.generateContent("A serene mountain landscape at sunset");
const imageData = result.response.parts[0].inlineData.data;
fs.writeFileSync("generated_image.png", Buffer.from(imageData, "base64"));
console.log("Image saved!");typescript
import { GoogleGenerativeAI } from "@google/generative-ai";
import fs from "fs";
const genAI = new GoogleGenerativeAI("YOUR_API_KEY");
const model = genAI.getGenerativeModel({ model: "gemini-3-pro-image-preview" });
const result = await model.generateContent("A serene mountain landscape at sunset");
const imageData = result.response.parts[0].inlineData.data;
fs.writeFileSync("generated_image.png", Buffer.from(imageData, "base64"));
console.log("Image saved!");Core Tasks
核心任务
Task 1: Generate Image from Text Prompt
任务1:从文本提示词生成图像
Goal: Create high-quality images from text descriptions.
Python Example:
python
import google.generativeai as genai
genai.configure(api_key="YOUR_API_KEY")
model = genai.GenerativeModel(
"gemini-3-pro-image-preview",
generation_config={
"thinking_level": "high", # Best quality
"temperature": 1.0
}
)目标: 根据文本描述生成高质量图像。
Python示例:
python
import google.generativeai as genai
genai.configure(api_key="YOUR_API_KEY")
model = genai.GenerativeModel(
"gemini-3-pro-image-preview",
generation_config={
"thinking_level": "high", # 最佳质量
"temperature": 1.0
}
)Generate image
生成图像
prompt = """A futuristic cityscape at night with:
- Neon lights and holographic advertisements
- Flying vehicles
- Tall skyscrapers with unique architecture
- Rain-slicked streets reflecting the lights
- Cinematic, detailed, 4K quality"""
response = model.generate_content(prompt)
prompt = """A futuristic cityscape at night with:
- Neon lights and holographic advertisements
- Flying vehicles
- Tall skyscrapers with unique architecture
- Rain-slicked streets reflecting the lights
- Cinematic, detailed, 4K quality"""
response = model.generate_content(prompt)
Save image
保存图像
if response.parts and hasattr(response.parts[0], 'inline_data'):
image_data = response.parts[0].inline_data.data
with open("futuristic_city.png", "wb") as f:
f.write(image_data)
print("Image generated successfully!")
else:
print("No image generated")
**Tips for Better Prompts:**
- Be specific and detailed
- Specify art style (realistic, cartoon, oil painting, etc.)
- Include lighting, mood, and atmosphere
- Mention quality level (4K, detailed, high-quality)
- Describe colors, textures, composition
**See:** `references/generation-guide.md` for comprehensive prompting techniques
---if response.parts and hasattr(response.parts[0], 'inline_data'):
image_data = response.parts[0].inline_data.data
with open("futuristic_city.png", "wb") as f:
f.write(image_data)
print("Image generated successfully!")
else:
print("No image generated")
**提示词优化技巧:**
- 描述要具体、详细
- 指定艺术风格(写实、卡通、油画等)
- 包含光线、氛围和情绪描述
- 提及质量等级(4K、细节丰富、高质量)
- 描述颜色、纹理和构图
**参考:** `references/generation-guide.md` 中的完整提示词技巧
---Task 2: Generate 4K Images
任务2:生成4K图像
Goal: Create high-resolution 4K images with upscaling.
Python Example:
python
undefined目标: 创建高分辨率4K图像,支持分辨率升级。
Python示例:
python
undefinedGenerate with 4K quality specification
生成4K质量的图像
prompt = """A photorealistic portrait of a scientist in a modern lab:
- 4K ultra-high definition
- Sharp focus on subject
- Soft bokeh background
- Professional studio lighting
- Fine detail in textures
- Cinema-grade quality"""
response = model.generate_content(prompt)
prompt = """A photorealistic portrait of a scientist in a modern lab:
- 4K ultra-high definition
- Sharp focus on subject
- Soft bokeh background
- Professional studio lighting
- Fine detail in textures
- Cinema-grade quality"""
response = model.generate_content(prompt)
4K image will be generated
将生成4K图像
if response.parts:
with open("scientist_4k.png", "wb") as f:
f.write(response.parts[0].inline_data.data)
**4K Features:**
- Native 4K resolution support
- Upscaling to 2K/4K
- 16:9 aspect ratio at 4K
- Enhanced detail and clarity
**See:** `references/resolution-guide.md` for resolution control
---if response.parts:
with open("scientist_4k.png", "wb") as f:
f.write(response.parts[0].inline_data.data)
**4K特性:**
- 原生4K分辨率支持
- 可升级至2K/4K
- 4K分辨率下支持16:9宽高比
- 细节更丰富、清晰度更高
**参考:** `references/resolution-guide.md` 中的分辨率控制指南
---Task 3: Render Text in Images
任务3:在图像中渲染文本
Goal: Generate images with readable, high-quality text.
Python Example:
python
prompt = """Create a professional business card design with:
- Company name: "TechVision AI"
- Text: "Dr. Sarah Chen"
- Text: "Chief AI Officer"
- Text: "sarah.chen@techvision.ai"
- Text: "+1 (555) 123-4567"
- Modern, clean design
- Professional fonts
- Blue and white color scheme
- All text clearly readable"""
response = model.generate_content(prompt)
if response.parts:
with open("business_card.png", "wb") as f:
f.write(response.parts[0].inline_data.data)Text Rendering Best Practices:
- Explicitly specify text content in quotes
- Request "readable" or "clearly visible" text
- Keep text short and simple
- Specify font style if desired
- Use high contrast backgrounds
See: for text rendering techniques
references/generation-guide.md目标: 生成包含清晰可读高质量文本的图像。
Python示例:
python
prompt = """Create a professional business card design with:
- Company name: "TechVision AI"
- Text: "Dr. Sarah Chen"
- Text: "Chief AI Officer"
- Text: "sarah.chen@techvision.ai"
- Text: "+1 (555) 123-4567"
- Modern, clean design
- Professional fonts
- Blue and white color scheme
- All text clearly readable"""
response = model.generate_content(prompt)
if response.parts:
with open("business_card.png", "wb") as f:
f.write(response.parts[0].inline_data.data)文本渲染最佳实践:
- 用引号明确指定文本内容
- 要求“可读”或“清晰可见”的文本
- 文本内容简洁简短
- 如需指定字体风格请明确说明
- 使用高对比度背景
参考: 中的文本渲染技巧
references/generation-guide.mdTask 4: Grounded Generation (Fact-Verified Images)
任务4:事实锚定生成(事实校验图像)
Goal: Generate factually accurate images using Google Search grounding.
Python Example:
python
undefined目标: 利用Google Search锚定功能生成符合事实的准确图像。
Python示例:
python
undefinedEnable Google Search grounding for factual accuracy
启用Google Search锚定以确保事实准确性
model_grounded = genai.GenerativeModel(
"gemini-3-pro-image-preview",
tools=[{"google_search_retrieval": {}}] # Enable grounding
)
prompt = """Generate an accurate image of the International Space Station
with Earth in the background. Use current ISS configuration."""
response = model_grounded.generate_content(prompt)
if response.parts:
with open("iss_grounded.png", "wb") as f:
f.write(response.parts[0].inline_data.data)
# Check if grounding was used
if hasattr(response, 'grounding_metadata'):
print(f"Grounding sources used: {len(response.grounding_metadata.grounding_chunks)}")
**Grounded Generation Use Cases:**
- Historical scenes (accurate to period)
- Scientific visualizations
- Current events
- Famous landmarks
- Product representations
**Benefits:**
- Factual accuracy
- Real-world grounding
- Reduced hallucination
- Up-to-date information
**Note:** Uses free Google Search quota (1,500 queries/day)
**See:** `references/grounded-generation.md` for comprehensive guide
---model_grounded = genai.GenerativeModel(
"gemini-3-pro-image-preview",
tools=[{"google_search_retrieval": {}}] # 启用锚定功能
)
prompt = """Generate an accurate image of the International Space Station
with Earth in the background. Use current ISS configuration."""
response = model_grounded.generate_content(prompt)
if response.parts:
with open("iss_grounded.png", "wb") as f:
f.write(response.parts[0].inline_data.data)
# 检查是否使用了锚定功能
if hasattr(response, 'grounding_metadata'):
print(f"Grounding sources used: {len(response.grounding_metadata.grounding_chunks)}")
**事实锚定生成适用场景:**
- 历史场景(符合时代特征)
- 科学可视化
- 当前事件
- 著名地标
- 产品展示
**优势:**
- 事实准确性
- 真实世界锚定
- 减少幻觉
- 信息实时更新
**注意:** 免费Google Search配额(每日1500次查询)
**参考:** `references/grounded-generation.md` 中的完整指南
---Task 5: Conversational Image Editing
任务5:对话式图像编辑
Goal: Iteratively refine images through multi-turn conversation.
Python Example:
python
model = genai.GenerativeModel("gemini-3-pro-image-preview")目标: 通过多轮对话迭代优化图像。
Python示例:
python
model = genai.GenerativeModel("gemini-3-pro-image-preview")Start a chat session for conversational editing
开启对话会话进行对话式编辑
chat = model.start_chat()
chat = model.start_chat()
First generation
第一次生成
response1 = chat.send_message("Create a cozy coffee shop interior")
if response1.parts:
with open("coffee_shop_v1.png", "wb") as f:
f.write(response1.parts[0].inline_data.data)
response1 = chat.send_message("Create a cozy coffee shop interior")
if response1.parts:
with open("coffee_shop_v1.png", "wb") as f:
f.write(response1.parts[0].inline_data.data)
Refine the image
优化图像
response2 = chat.send_message("Add more plants and warm lighting")
if response2.parts:
with open("coffee_shop_v2.png", "wb") as f:
f.write(response2.parts[0].inline_data.data)
response2 = chat.send_message("Add more plants and warm lighting")
if response2.parts:
with open("coffee_shop_v2.png", "wb") as f:
f.write(response2.parts[0].inline_data.data)
Further refinement
进一步优化
response3 = chat.send_message("Make it more minimalist, remove some decorations")
if response3.parts:
with open("coffee_shop_v3.png", "wb") as f:
f.write(response3.parts[0].inline_data.data)
**Conversational Editing Features:**
- Preserves visual context across turns
- Incremental modifications
- Natural language instructions
- Multi-turn refinement
- Context-aware changes
**Example Editing Commands:**
- "Make it darker/lighter"
- "Add more [element]"
- "Change the color scheme to [colors]"
- "Make it more realistic/artistic"
- "Remove [element]"
**See:** `references/conversational-editing.md` for advanced patterns
---response3 = chat.send_message("Make it more minimalist, remove some decorations")
if response3.parts:
with open("coffee_shop_v3.png", "wb") as f:
f.write(response3.parts[0].inline_data.data)
**对话式编辑特性:**
- 多轮对话中保留视觉上下文
- 增量式修改
- 自然语言指令
- 多轮优化
- 上下文感知的变更
**示例编辑指令:**
- "调暗/调亮画面"
- "添加更多[元素]"
- "将配色改为[颜色]"
- "让画面更写实/更具艺术感"
- "移除[元素]"
**参考:** `references/conversational-editing.md` 中的高级模式
---Task 6: Custom Aspect Ratios
任务6:自定义宽高比
Goal: Generate images in specific aspect ratios.
Python Example:
python
undefined目标: 生成特定宽高比的图像。
Python示例:
python
undefined16:9 aspect ratio (4K supported)
16:9宽高比(支持4K)
prompt_169 = "A cinematic landscape in 16:9 aspect ratio, 4K quality"
prompt_169 = "A cinematic landscape in 16:9 aspect ratio, 4K quality"
Square aspect ratio
正方形宽高比
prompt_square = "A square logo design for a tech company"
prompt_square = "A square logo design for a tech company"
Portrait orientation
竖版方向
prompt_portrait = "A portrait-oriented movie poster"
response = model.generate_content(prompt_169)
prompt_portrait = "A portrait-oriented movie poster"
response = model.generate_content(prompt_169)
Image will be generated in specified ratio
图像将按指定宽高比生成
**Supported Ratios:**
- **16:9** - Wide, cinematic (4K supported)
- **1:1** - Square
- **4:3** - Standard
- **9:16** - Vertical/portrait
---
**支持的宽高比:**
- **16:9** - 宽屏、电影级(支持4K)
- **1:1** - 正方形
- **4:3** - 标准比例
- **9:16** - 竖版/肖像模式
---Task 7: Optimize Image Generation Costs
任务7:优化图像生成成本
Goal: Balance quality and cost for image generation.
Pricing:
- Text Input: $1-2 per 1M tokens
- Text Output: $6-9 per 1M tokens
- Image Output: $0.134 per image (varies by resolution)
Python Cost Optimization:
python
def generate_with_cost_tracking(prompt):
"""Generate image and track costs"""
response = model.generate_content(prompt)
# Calculate cost
usage = response.usage_metadata
input_cost = (usage.prompt_token_count / 1_000_000) * 2.00
output_cost = (usage.candidates_token_count / 1_000_000) * 9.00
image_cost = 0.134 # Per image
total_cost = input_cost + output_cost + image_cost
print(f"Input tokens: {usage.prompt_token_count} (${input_cost:.6f})")
print(f"Output tokens: {usage.candidates_token_count} (${output_cost:.6f})")
print(f"Image cost: ${image_cost:.6f}")
print(f"Total: ${total_cost:.6f}")
return response
response = generate_with_cost_tracking("A beautiful sunset over mountains")Cost Optimization Strategies:
- Batch Requests: Generate multiple images in one session
- Reuse Chat Sessions: Conversational editing is more efficient
- Specific Prompts: Clear prompts reduce regeneration needs
- Monitor Usage: Track costs per project
- Use Appropriate Quality: Not all images need 4K
See: for detailed strategies
references/pricing-optimization.md目标: 在图像生成的质量与成本间取得平衡。
定价:
- 文本输入: 每100万令牌1-2美元
- 文本输出: 每100万令牌6-9美元
- 图像输出: 每张图像0.134美元(根据分辨率有所不同)
Python成本优化示例:
python
def generate_with_cost_tracking(prompt):
"""生成图像并跟踪成本"""
response = model.generate_content(prompt)
# 计算成本
usage = response.usage_metadata
input_cost = (usage.prompt_token_count / 1_000_000) * 2.00
output_cost = (usage.candidates_token_count / 1_000_000) * 9.00
image_cost = 0.134 # 每张图像成本
total_cost = input_cost + output_cost + image_cost
print(f"Input tokens: {usage.prompt_token_count} (${input_cost:.6f})")
print(f"Output tokens: {usage.candidates_token_count} (${output_cost:.6f})")
print(f"Image cost: ${image_cost:.6f}")
print(f"Total: ${total_cost:.6f}")
return response
response = generate_with_cost_tracking("A beautiful sunset over mountains")成本优化策略:
- 批量请求: 在一个会话中生成多张图像
- 复用对话会话: 对话式编辑更高效
- 明确的提示词: 清晰的提示词减少重复生成需求
- 监控使用情况: 跟踪每个项目的成本
- 选择合适的质量: 并非所有图像都需要4K分辨率
参考: 中的详细策略
references/pricing-optimization.mdBatch Image Generation
批量图像生成
python
import google.generativeai as genai
genai.configure(api_key="YOUR_API_KEY")
model = genai.GenerativeModel("gemini-3-pro-image-preview")
prompts = [
"A serene mountain lake at dawn",
"A bustling market in Morocco",
"A futuristic robot assistant",
"An abstract geometric pattern"
]
for i, prompt in enumerate(prompts):
print(f"Generating image {i+1}/{len(prompts)}: {prompt}")
response = model.generate_content(prompt)
if response.parts:
with open(f"generated_{i+1}.png", "wb") as f:
f.write(response.parts[0].inline_data.data)
print(f" Saved: generated_{i+1}.png")python
import google.generativeai as genai
genai.configure(api_key="YOUR_API_KEY")
model = genai.GenerativeModel("gemini-3-pro-image-preview")
prompts = [
"A serene mountain lake at dawn",
"A bustling market in Morocco",
"A futuristic robot assistant",
"An abstract geometric pattern"
]
for i, prompt in enumerate(prompts):
print(f"Generating image {i+1}/{len(prompts)}: {prompt}")
response = model.generate_content(prompt)
if response.parts:
with open(f"generated_{i+1}.png", "wb") as f:
f.write(response.parts[0].inline_data.data)
print(f" Saved: generated_{i+1}.png")Error Handling
错误处理
python
from google.api_core import exceptions
def safe_image_generation(prompt):
"""Generate image with error handling"""
try:
response = model.generate_content(prompt)
if not response.parts:
return {"success": False, "error": "No image generated"}
if not hasattr(response.parts[0], 'inline_data'):
return {"success": False, "error": "Invalid response format"}
return {
"success": True,
"image_data": response.parts[0].inline_data.data,
"mime_type": response.parts[0].inline_data.mime_type
}
except exceptions.InvalidArgument as e:
return {"success": False, "error": f"Invalid prompt: {e}"}
except exceptions.ResourceExhausted as e:
return {"success": False, "error": f"Rate limit exceeded: {e}"}
except Exception as e:
return {"success": False, "error": f"Error: {e}"}python
from google.api_core import exceptions
def safe_image_generation(prompt):
"""带有错误处理的图像生成"""
try:
response = model.generate_content(prompt)
if not response.parts:
return {"success": False, "error": "No image generated"}
if not hasattr(response.parts[0], 'inline_data'):
return {"success": False, "error": "Invalid response format"}
return {
"success": True,
"image_data": response.parts[0].inline_data.data,
"mime_type": response.parts[0].inline_data.mime_type
}
except exceptions.InvalidArgument as e:
return {"success": False, "error": f"Invalid prompt: {e}"}
except exceptions.ResourceExhausted as e:
return {"success": False, "error": f"Rate limit exceeded: {e}"}
except Exception as e:
return {"success": False, "error": f"Error: {e}"}References
参考资料
Core Guides
- Model Setup - Nano Banana Pro configuration
- Generation Guide - Comprehensive prompting techniques
- Grounded Generation - Fact-verified image creation
- Conversational Editing - Multi-turn refinement
Optimization
- Resolution Guide - 4K and quality control
- Pricing Optimization - Cost management
Scripts
- Generate Image Script - Production-ready generation
- Grounded Generation Script - Fact-verified images
- Edit Image Script - Conversational editing
Official Resources
核心指南
- Model Setup - Nano Banana Pro配置
- Generation Guide - 完整提示词技巧
- Grounded Generation - 事实校验图像创建
- Conversational Editing - 多轮优化
优化指南
- Resolution Guide - 4K与质量控制
- Pricing Optimization - 成本管理
脚本
- Generate Image Script - 生产环境可用的生成脚本
- Grounded Generation Script - 事实校验图像脚本
- Edit Image Script - 对话式编辑脚本
官方资源
Related Skills
相关技能
- gemini-3-pro-api - Basic setup, authentication, text generation
- gemini-3-multimodal - Image INPUT (analyzing images)
- gemini-3-advanced - Advanced features (caching, batch, tools)
- gemini-3-pro-api - 基础配置、身份验证、文本生成
- gemini-3-multimodal - 图像输入(分析图像)
- gemini-3-advanced - 高级功能(缓存、批量、工具)
Best Practices
最佳实践
- Be Specific: Detailed prompts produce better results
- Specify Quality: Request 4K or high quality explicitly
- Use Grounding: Enable for factual accuracy
- Iterate Conversationally: Use chat for refinements
- Monitor Costs: Track usage, especially for 4K
- Handle Errors: Implement retry logic
- Save Images Properly: Use binary mode for writing
- 描述具体: 详细的提示词产出更好的结果
- 指定质量: 明确要求4K或高质量
- 使用锚定功能: 启用以确保事实准确性
- 对话式迭代: 使用会话进行优化
- 监控成本: 跟踪使用情况,尤其是4K图像
- 处理错误: 实现重试逻辑
- 正确保存图像: 使用二进制模式写入
Troubleshooting
故障排除
Issue: No image generated
问题:未生成图像
Solution: Check exists and has attribute
response.partsinline_data解决方案: 检查是否存在,且包含属性
response.partsinline_dataIssue: Low quality images
问题:图像质量低
Solution: Add "4K", "high quality", "detailed" to prompt
解决方案: 在提示词中添加“4K”、“高质量”、“细节丰富”
Issue: Text in images unreadable
问题:图像中的文本不可读
Solution: Specify text explicitly in quotes, request "readable text"
解决方案: 用引号明确指定文本,要求“可读文本”
Issue: Images not factually accurate
问题:图像不符合事实
Solution: Enable grounded generation with Google Search
解决方案: 启用基于Google Search的事实锚定生成
Issue: High costs
问题:成本过高
Solution: Optimize prompts, batch requests, monitor usage
解决方案: 优化提示词、批量请求、监控使用情况
Summary
总结
This skill provides complete image generation capabilities:
✅ Text-to-image generation
✅ Native 4K support
✅ Text rendering in images
✅ Grounded generation (fact-verified)
✅ Conversational editing
✅ Custom aspect ratios
✅ Cost optimization
✅ Production-ready examples
Ready to generate images? Start with Task 1: Generate Image from Text Prompt above!
本技能提供完整的图像生成功能:
✅ 文本转图像生成
✅ 原生4K支持
✅ 图像内文本渲染
✅ 事实锚定生成(事实校验)
✅ 对话式编辑
✅ 自定义宽高比
✅ 成本优化
✅ 生产环境可用示例
准备好生成图像了吗? 从上方的任务1:从文本提示词生成图像开始吧!