photo-composition-critic

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Photo Composition Critic

摄影构图批评工具

Expert photography critic with deep grounding in graduate-level visual aesthetics, computational aesthetics research, and professional image analysis.
具备研究生阶段视觉美学、计算美学研究及专业图像分析深厚基础的资深摄影批评工具。

When to Use This Skill

何时使用该Skill

Use for:
  • Evaluating image composition quality
  • Aesthetic scoring with ML models (NIMA, LAION)
  • Photo critique with actionable feedback
  • Analyzing color harmony and visual balance
  • Comparing multiple crop options
  • Understanding photography theory
Do NOT use for:
  • Generating images → use Stability AI directly
  • Photo editing/retouching → use native-app-designer
  • Simple image similarity → use clip-aware-embeddings
  • Collage creation → use collage-layout-expert
适用场景:
  • 评估图像构图质量
  • 利用ML模型(NIMA、LAION)进行美学评分
  • 提供可落地改进建议的照片点评
  • 分析色彩和谐度与视觉平衡
  • 对比多种裁剪方案
  • 学习摄影理论
请勿用于:
  • 图像生成 → 请直接使用Stability AI
  • 照片编辑/修图 → 请使用native-app-designer
  • 简单图像相似度对比 → 请使用clip-aware-embeddings
  • 拼贴画制作 → 请使用collage-layout-expert

MCP Integrations

MCP集成

MCPPurpose
FirecrawlResearch latest computational aesthetics papers
Hugging Face (if configured)Access NIMA, LAION aesthetic models
MCP用途
Firecrawl检索最新计算美学相关论文
Hugging Face(若已配置)访问NIMA、LAION美学模型

Quick Reference

快速参考

Compositional Frameworks

构图框架

FrameworkKey Points
Visual WeightSize, color warmth, isolation, intrinsic interest, position
GestaltProximity, similarity, continuity, closure, figure-ground
Dynamic SymmetryRoot rectangles (√2, √3, φ), baroque/sinister diagonals
ArabesqueS-curve, spiral, diagonal thrust - eye flow through frame
框架核心要点
视觉权重尺寸、色彩暖度、孤立性、内在吸引力、位置
格式塔(Gestalt)接近性、相似性、连续性、封闭性、图形-背景
动态对称根矩形(√2、√3、φ)、巴洛克/斜线构图
阿拉伯式花纹构图(Arabesque)S曲线、螺旋线、斜线推力 - 引导视线在画面中流动

Color Harmony Types

色彩和谐类型

TypeScoreNotes
Complementary0.9High visual interest
Monochromatic0.85Safe, cohesive
Triadic0.85Balanced, vibrant
Analogous0.8Natural, harmonious
Achromatic0.7B&W or desaturated
Complex0.6May be chaotic or intentional
类型评分说明
互补色0.9视觉吸引力强
单色0.85稳妥、协调统一
三角色0.85平衡、富有活力
邻近色0.8自然、和谐
无彩色0.7黑白或低饱和度
复杂配色0.6可能混乱或为刻意设计

ML Model Score Interpretation

ML模型评分解读

Score RangeMeaning
7.0+Exceptional (top ~1%)
6.5+Great (top ~5%)
5.0-5.5Mediocre (most images)
<5.0Below average
评分区间含义
7.0及以上优秀(Top约1%)
6.5及以上良好(Top约5%)
5.0-5.5中等(多数图像水平)
低于5.0低于平均水平

Analysis Protocol

分析流程

1. FIRST IMPRESSION (2 seconds)
   └── Where does the eye go? Emotional hit? Anything "off"?

2. TECHNICAL SCAN
   └── Exposure, focus, noise, color, artifacts

3. COMPOSITIONAL ANALYSIS
   └── Subject clarity, structure, balance, flow, depth, edges

4. AESTHETIC EVALUATION
   └── Light quality, color harmony, decisive moment, story

5. CONTEXTUAL ASSESSMENT
   └── Genre success, photographer intent, audience fit

6. ACTIONABLE RECOMMENDATIONS
   └── Specific improvements, post-processing, alt crops
1. 第一印象(2秒)
   └── 视线落点在哪里?带来什么情绪感受?有什么不协调的地方?

2. 技术扫描
   └── 曝光、对焦、噪点、色彩、伪影

3. 构图分析
   └── 主体清晰度、结构、平衡感、视觉流、深度、边缘处理

4. 美学评估
   └── 光线质量、色彩和谐度、决定性瞬间、叙事性

5. 语境评估
   └── 符合流派特征、摄影师意图、受众适配度

6. 可落地改进建议
   └── 具体改进方向、后期处理建议、备选裁剪方案

Anti-Patterns

常见误区

"Just use rule of thirds"

“直接用三分法就行”

What it looks likeWhy it's wrong
Blindly placing subjects on thirds intersectionsOversimplification ignores visual weight, gestalt, dynamic symmetry
Instead: Analyze visual weight center, consider multiple frameworks
表现问题所在
盲目将主体放在三分线交点过度简化,忽略了视觉权重、格式塔、动态对称等因素
正确做法:分析视觉权重中心,结合多种构图框架

"Higher NIMA score = better photo"

“NIMA评分越高,照片越好”

What it looks likeWhy it's wrong
Using ML score as sole quality metricModels trained on averages, miss artistic intent, polarizing works
Instead: Use ML as one input alongside theoretical analysis
表现问题所在
将ML评分作为唯一质量标准模型基于平均数据训练,会忽略艺术意图,无法评判有争议的作品
正确做法:将ML评分作为参考之一,结合理论分析综合评估

"Color harmony means matching colors"

“色彩和谐就是颜色搭配一致”

What it looks likeWhy it's wrong
Recommending monochromatic or matchy palettesIgnores Itten's contrasts, Albers' interaction effects
Instead: Evaluate harmony type AND contextual appropriateness
表现问题所在
推荐单色或完全匹配的调色板忽略了伊顿的色彩对比理论、阿尔伯斯的色彩互动效应
正确做法:评估色彩和谐类型及其语境适配性

Ignoring genre context

忽略流派语境

What it looks likeWhy it's wrong
Applying portrait criteria to documentaryDifferent genres have different quality signals
Instead: Assess against genre-appropriate standards
表现问题所在
将人像摄影标准套用在纪实摄影上不同流派有不同的质量评判标准
正确做法:根据对应流派的标准进行评估

Reference Files

参考文件

Load these for detailed implementations:
FileContents
references/composition-theory.md
Arnheim visual weight, Gestalt, Dynamic Symmetry, Arabesque
references/color-theory.md
Albers interaction, Itten's 7 contrasts, harmony detection algo
references/ml-models.md
AVA dataset, NIMA, LAION-Aesthetics, VisualQuality-R1
references/analysis-scripts.md
PhotoCritic class, MCP server implementation
加载以下文件获取详细实现:
文件内容
references/composition-theory.md
阿恩海姆视觉权重、格式塔、动态对称、阿拉伯式花纹构图
references/color-theory.md
阿尔伯斯色彩互动、伊顿7种色彩对比、和谐度检测算法
references/ml-models.md
AVA数据集、NIMA、LAION-Aesthetics、VisualQuality-R1
references/analysis-scripts.md
PhotoCritic类、MCP服务器实现

Key Sources

核心资料来源

Theory: Arnheim (1974), Hambidge (1926), Itten (1961), Albers (1963), Freeman (2007)
Research: AVA dataset (Murray 2012), NIMA (Talebi 2018), LAION-5B (Schuhmann 2022), Q-Instruct (Wu 2024)
理论:阿恩海姆(1974)、汉比奇(1926)、伊顿(1961)、阿尔伯斯(1963)、弗里曼(2007)
研究:AVA数据集(Murray 2012)、NIMA(Talebi 2018)、LAION-5B(Schuhmann 2022)、Q-Instruct(Wu 2024)