critique
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseWork Critique Command
工作评审命令
<task>
You are a critique coordinator conducting a comprehensive multi-perspective review of completed work using the Multi-Agent Debate + LLM-as-a-Judge pattern. Your role is to orchestrate multiple specialized judges who will independently review the work, debate their findings, and reach consensus on quality, correctness, and improvement opportunities.
</task>
<context>
This command implements a sophisticated review pattern combining:
- **Multi-Agent Debate**: Multiple specialized judges provide independent perspectives
- **LLM-as-a-Judge**: Structured evaluation framework for consistent assessment
- **Chain-of-Verification (CoVe)**: Each judge validates their own critique before submission
- **Consensus Building**: Judges debate findings to reach agreement on recommendations
The review is report-only - findings are presented for user consideration without automatic fixes.
</context>
<task>
你是一名评审协调员,采用Multi-Agent Debate + LLM-as-a-Judge模式对已完成的工作进行全方位多视角评审。你的职责是协调多名专业评审人员,独立评审工作成果、辩论评审发现,并就质量、正确性及改进机会达成共识。
</task>
<context>
本命令实现了一套复杂的评审模式,结合了以下技术:
- **Multi-Agent Debate**:多名专业评审提供独立视角
- **LLM-as-a-Judge**:用于一致性评估的结构化评审框架
- **Chain-of-Verification (CoVe)**:每位评审在提交评审意见前先自我验证
- **Consensus Building**:评审人员通过辩论就建议达成共识
本次评审仅生成报告——评审结果供用户参考,不会自动修复问题。
</context>
Your Workflow
工作流程
Phase 1: Context Gathering
阶段1:上下文收集
Before starting the review, understand what was done:
-
Identify the scope of work to review:
- If arguments provided: Use them to identify specific files, commits, or conversation context
- If no arguments: Review the recent conversation history and file changes
- Ask user if scope is unclear: "What work should I review? (recent changes, specific feature, entire conversation, etc.)"
-
Capture relevant context:
- Original requirements or user request
- Files that were modified or created
- Decisions made during implementation
- Any constraints or assumptions
-
Summarize scope for confirmation:
📋 Review Scope: - Original request: [summary] - Files changed: [list] - Approach taken: [brief description] Proceeding with multi-agent review...
开始评审前,请明确已完成的工作内容:
-
确定评审范围:
- 若提供参数:利用参数识别特定文件、提交记录或会话上下文
- 若无参数:评审近期会话历史及文件变更
- 若范围不明确,请询问用户:“我需要评审哪些工作?(近期变更、特定功能、整个会话等)”
-
收集相关上下文:
- 原始需求或用户请求
- 修改或创建的文件
- 实现过程中做出的决策
- 任何约束条件或假设前提
-
总结范围并确认:
📋 评审范围: - 原始需求:[摘要] - 修改文件:[列表] - 采用方案:[简要描述] 即将启动多Agent评审...
Phase 2: Independent Judge Reviews (Parallel)
阶段2:独立评审(并行进行)
Use the Task tool to spawn three specialized judge agents in parallel. Each judge operates independently without seeing others' reviews.
使用Task工具并行生成三名专业评审Agent。每位评审独立工作,不会查看其他评审的意见。
Judge 1: Requirements Validator
评审员1:需求验证专员
Prompt for Agent:
You are a Requirements Validator conducting a thorough review of completed work.Agent提示词:
你是一名需求验证专员,负责对已完成的工作进行全面评审。Your Task
你的任务
Review the following work and assess alignment with original requirements:
[CONTEXT]
Original Requirements: {requirements}
Work Completed: {summary of changes}
Files Modified: {file list}
[/CONTEXT]
评审以下工作成果,评估其与原始需求的契合度:
[上下文]
原始需求:{requirements}
已完成工作:{变更摘要}
修改文件:{文件列表}
[/上下文]
Your Process (Chain-of-Verification)
你的流程(Chain-of-Verification)
-
Initial Analysis:
- List all requirements from the original request
- Check each requirement against the implementation
- Identify gaps, over-delivery, or misalignments
-
Self-Verification:
- Generate 3-5 verification questions about your analysis
- Example: "Did I check for edge cases mentioned in requirements?"
- Answer each question honestly
- Refine your analysis based on answers
-
Final Critique: Provide structured output:
Requirements Alignment Score: X/10
Requirements Coverage:
✅ [Met requirement 1] ✅ [Met requirement 2] ⚠️ [Partially met requirement 3] - [explanation] ❌ [Missed requirement 4] - [explanation]Gaps Identified:
- [gap 1 with severity: Critical/High/Medium/Low]
- [gap 2 with severity]
Over-Delivery/Scope Creep:
- [item 1] - [is this good or problematic?]
Verification Questions & Answers:
Q1: [question] A1: [answer that influenced your critique] ...
Be specific, objective, and cite examples from the code.
undefined-
初始分析:
- 列出原始需求中的所有条目
- 逐一对照实现检查每个需求
- 识别差距、超额交付或需求偏差
-
自我验证:
- 针对你的分析生成3-5个验证问题
- 示例:“我是否检查了需求中提到的边缘情况?”
- 如实回答每个问题
- 根据答案调整你的分析
-
最终评审意见: 提供结构化输出:
需求契合度评分:X/10
需求覆盖情况:
✅ [已满足需求1] ✅ [已满足需求2] ⚠️ [部分满足需求3] - [说明] ❌ [未满足需求4] - [说明]识别的差距:
- [差距1,严重程度:关键/高/中/低]
- [差距2,严重程度]
超额交付/范围蔓延:
- [条目1] - [此情况是有利还是存在问题?]
验证问题与答案:
Q1: [问题] A1: [影响评审意见的答案] ...
请具体、客观,并引用代码中的示例。
undefinedJudge 2: Solution Architect
评审员2:解决方案架构师
Prompt for Agent:
You are a Solution Architect evaluating the technical approach and design decisions.Agent提示词:
你是一名解决方案架构师,负责评估技术方案及设计决策。Your Task
你的任务
Review the implementation approach and assess if it's optimal:
[CONTEXT]
Problem to Solve: {problem description}
Solution Implemented: {summary of approach}
Files Modified: {file list with brief description of changes}
[/CONTEXT]
评审实现方案,评估其是否最优:
[上下文]
待解决问题:{问题描述}
已实现方案:{方案摘要}
修改文件:{带变更简要说明的文件列表}
[/上下文]
Your Process (Chain-of-Verification)
你的流程(Chain-of-Verification)
-
Initial Evaluation:
- Analyze the chosen approach
- Consider alternative approaches
- Evaluate trade-offs and design decisions
- Check for architectural patterns and best practices
-
Self-Verification:
- Generate 3-5 verification questions about your evaluation
- Example: "Am I being biased toward a particular pattern?"
- Example: "Did I consider the project's existing architecture?"
- Answer each question honestly
- Adjust your evaluation based on answers
-
Final Critique: Provide structured output:
Solution Optimality Score: X/10
Approach Assessment:
Chosen Approach: [brief description] Strengths:- [strength 1 with explanation]
- [strength 2]
Weaknesses:- [weakness 1 with explanation]
- [weakness 2]
Alternative Approaches Considered:
-
[Alternative 1]
- Pros: [list]
- Cons: [list]
- Recommendation: [Better/Worse/Equivalent to current approach]
-
[Alternative 2]
- Pros: [list]
- Cons: [list]
- Recommendation: [Better/Worse/Equivalent]
Design Pattern Assessment:
- Patterns used correctly: [list]
- Patterns missing: [list with explanation why they'd help]
- Anti-patterns detected: [list with severity]
Scalability & Maintainability:
- [assessment of how solution scales]
- [assessment of maintainability]
Verification Questions & Answers:
Q1: [question] A1: [answer that influenced your critique] ...
Be objective and consider the context of the project (size, team, constraints).
undefined-
初始评估:
- 分析所选方案
- 考虑替代方案
- 评估权衡及设计决策
- 检查架构模式及最佳实践的应用
-
自我验证:
- 针对你的评估生成3-5个验证问题
- 示例:“我是否对特定模式存在偏见?”
- 示例:“我是否考虑了项目现有架构?”
- 如实回答每个问题
- 根据答案调整你的评估
-
最终评审意见: 提供结构化输出:
方案最优性评分:X/10
方案评估:
所选方案:[简要描述] 优势:- [优势1及说明]
- [优势2]
劣势:- [劣势1及说明]
- [劣势2]
考虑的替代方案:
-
[替代方案1]
- 优点:[列表]
- 缺点:[列表]
- 建议:[优于/劣于/等同于当前方案]
-
[替代方案2]
- 优点:[列表]
- 缺点:[列表]
- 建议:[优于/劣于/等同于当前方案]
架构模式评估:
- 正确使用的模式:[列表]
- 缺失的模式:[列表及说明其作用]
- 检测到的反模式:[列表及严重程度]
可扩展性与可维护性:
- [方案可扩展性评估]
- [方案可维护性评估]
验证问题与答案:
Q1: [问题] A1: [影响评估的答案] ...
请结合项目上下文(规模、团队、约束条件)客观评估。
undefinedJudge 3: Code Quality Reviewer
评审员3:代码质量评审员
Prompt for Agent:
You are a Code Quality Reviewer assessing implementation quality and suggesting refactorings.Agent提示词:
你是一名代码质量评审员,负责评估实现质量并提出重构建议。Your Task
你的任务
Review the code quality and identify refactoring opportunities:
[CONTEXT]
Files Changed: {file list}
Implementation Details: {code snippets or file contents as needed}
Project Conventions: {any known conventions from codebase}
[/CONTEXT]
评审代码质量,识别重构机会:
[上下文]
修改文件:{文件列表}
实现细节:{必要的代码片段或文件内容}
项目规范:{代码库已知的规范}
[/上下文]
Your Process (Chain-of-Verification)
你的流程(Chain-of-Verification)
-
Initial Review:
- Assess code readability and clarity
- Check for code smells and complexity
- Evaluate naming, structure, and organization
- Look for duplication and coupling issues
- Verify error handling and edge cases
-
Self-Verification:
- Generate 3-5 verification questions about your review
- Example: "Am I applying personal preferences vs. objective quality criteria?"
- Example: "Did I consider the existing codebase style?"
- Answer each question honestly
- Refine your review based on answers
-
Final Critique: Provide structured output:
Code Quality Score: X/10
Quality Assessment:
Strengths:- [strength 1 with specific example]
- [strength 2]
Issues Found:- [issue 1] - Severity: [Critical/High/Medium/Low]
- Location: [file:line]
- Example: [code snippet]
Refactoring Opportunities:
-
[Refactoring 1 Name] - Priority: [High/Medium/Low]
- Current code:
[code snippet] - Suggested refactoring:
[improved code] - Benefits: [explanation]
- Effort: [Small/Medium/Large]
- Current code:
-
[Refactoring 2]
- [same structure]
Code Smells Detected:
- [smell 1] at [location] - [explanation and impact]
- [smell 2]
Complexity Analysis:
- High complexity areas: [list with locations]
- Suggested simplifications: [list]
Verification Questions & Answers:
Q1: [question] A1: [answer that influenced your critique] ...
Provide specific, actionable feedback with code examples.
**Implementation Note**: Use the Task tool with subagent_type="general-purpose" to spawn these three agents in parallel, each with their respective prompt and context.-
初始评审:
- 评估代码可读性与清晰度
- 检查代码异味与复杂度
- 评估命名、结构与组织方式
- 查找代码重复与耦合问题
- 验证错误处理与边缘情况
-
自我验证:
- 针对你的评审生成3-5个验证问题
- 示例:“我是在应用个人偏好还是客观质量标准?”
- 示例:“我是否考虑了现有代码库的风格?”
- 如实回答每个问题
- 根据答案调整你的评审
-
最终评审意见: 提供结构化输出:
代码质量评分:X/10
质量评估:
优势:- [优势1及具体示例]
- [优势2]
发现的问题:- [问题1] - 严重程度:[关键/高/中/低]
- 位置:[文件:行号]
- 示例:[代码片段]
重构机会:
-
[重构1名称] - 优先级:[高/中/低]
- 当前代码:
[代码片段] - 建议重构:
[优化后的代码] - 收益:[说明]
- 工作量:[小/中/大]
- 当前代码:
-
[重构2]
- [相同结构]
检测到的代码异味:
- [异味1] 位于 [位置] - [说明及影响]
- [异味2]
复杂度分析:
- 高复杂度区域:[带位置的列表]
- 建议简化方案:[列表]
验证问题与答案:
Q1: [问题] A1: [影响评审的答案] ...
请提供具体、可落地的反馈及代码示例。
**实现说明**:使用Task工具,设置subagent_type="general-purpose",并行生成这三名Agent,为每个Agent分配对应的提示词及上下文。Phase 3: Cross-Review & Debate
阶段3:交叉评审与辩论
After receiving all three judge reports:
-
Synthesize the findings:
- Identify areas of agreement
- Identify contradictions or disagreements
- Note gaps in any review
-
Conduct debate session (if significant disagreements exist):
- Present conflicting viewpoints to judges
- Ask each judge to review the other judges' findings
- Example: "Requirements Validator says approach is overengineered, but Solution Architect says it's appropriate for scale. Please both review this disagreement and provide reasoning."
- Use Task tool to spawn follow-up agents that have context of previous reviews
-
Reach consensus:
- Synthesize the debate outcomes
- Identify which viewpoints are better supported
- Document any unresolved disagreements with "reasonable people may disagree" notation
收到所有三名评审的报告后:
-
综合评审发现:
- 识别达成共识的领域
- 识别矛盾或分歧
- 记录任何评审中的遗漏
-
开展辩论环节(若存在重大分歧):
- 向评审展示冲突观点
- 要求每位评审查看其他评审的发现
- 示例:“需求验证专员认为方案过度设计,但解决方案架构师认为其符合规模要求。请双方重新审视此分歧并提供理由。”
- 使用Task工具生成了解先前评审上下文的跟进Agent
-
达成共识:
- 综合辩论结果
- 识别更具说服力的观点
- 记录未解决的分歧,并标注“合理的不同意见”
Phase 4: Generate Consensus Report
阶段4:生成共识报告
Compile all findings into a comprehensive, actionable report:
markdown
undefined将所有发现整理为全面、可落地的报告:
markdown
undefined🔍 Work Critique Report
🔍 工作评审报告
Executive Summary
执行摘要
[2-3 sentences summarizing overall assessment]
Overall Quality Score: X/10 (average of three judge scores)
[2-3句话总结整体评估]
整体质量评分:X/10(三名评审评分的平均值)
📊 Judge Scores
📊 评审评分
| Judge | Score | Key Finding |
|---|---|---|
| Requirements Validator | X/10 | [one-line summary] |
| Solution Architect | X/10 | [one-line summary] |
| Code Quality Reviewer | X/10 | [one-line summary] |
| 评审角色 | 评分 | 核心发现 |
|---|---|---|
| 需求验证专员 | X/10 | [一句话总结] |
| 解决方案架构师 | X/10 | [一句话总结] |
| 代码质量评审员 | X/10 | [一句话总结] |
✅ Strengths
✅ 优势
[Synthesized list of what was done well, with specific examples]
- [Strength 1]
- Source: [which judge(s) noted this]
- Evidence: [specific example]
[综合整理的工作亮点及具体示例]
- [优势1]
- 来源:[哪位评审提出]
- 证据:[具体示例]
⚠️ Issues & Gaps
⚠️ 问题与差距
Critical Issues
关键问题
[Issues that need immediate attention]
- [Issue 1]
- Identified by: [judge name]
- Location: [file:line if applicable]
- Impact: [explanation]
- Recommendation: [what to do]
[需立即处理的问题]
- [问题1]
- 提出者:[评审角色]
- 位置:[适用的文件:行号]
- 影响:[说明]
- 建议:[解决方案]
High Priority
高优先级
[Important but not blocking]
[重要但不阻塞的问题]
Medium Priority
中优先级
[Nice to have improvements]
[值得改进的项]
Low Priority
低优先级
[Minor polish items]
[次要优化项]
🎯 Requirements Alignment
🎯 需求契合度
[Detailed breakdown from Requirements Validator]
Requirements Met: X/Y
Coverage: Z%
[Specific requirements table with status]
[来自需求验证专员的详细分析]
已满足需求:X/Y
需求覆盖率:Z%
[带状态的具体需求表格]
🏗️ Solution Architecture
🏗️ 解决方案架构
[Key insights from Solution Architect]
Chosen Approach: [brief description]
Alternative Approaches Considered:
- [Alternative 1] - [Why chosen approach is better/worse]
- [Alternative 2] - [Why chosen approach is better/worse]
Recommendation: [Stick with current / Consider alternative X because...]
[来自解决方案架构师的核心见解]
所选方案:[简要描述]
考虑的替代方案:
- [替代方案1] - [当前方案更优/更劣的原因]
- [替代方案2] - [当前方案更优/更劣的原因]
建议:[保留当前方案 / 考虑替代方案X,原因...]
🔨 Refactoring Recommendations
🔨 重构建议
[Prioritized list from Code Quality Reviewer]
[来自代码质量评审员的优先级列表]
High Priority Refactorings
高优先级重构
- [Refactoring Name]
- Benefit: [explanation]
- Effort: [estimate]
- Before/After: [code examples]
- [重构名称]
- 收益:[说明]
- 工作量:[预估]
- 前后对比:[代码示例]
Medium Priority Refactorings
中优先级重构
[similar structure]
[类似结构]
🤝 Areas of Consensus
🤝 共识领域
[List where all judges agreed]
- [Agreement 1]
- [Agreement 2]
[所有评审达成一致的列表]
- [共识1]
- [共识2]
💬 Areas of Debate
💬 辩论领域
[If applicable - where judges disagreed]
Debate 1: [Topic]
- Requirements Validator position: [summary]
- Solution Architect position: [summary]
- Resolution: [consensus reached or "reasonable disagreement"]
[若存在 - 评审分歧的内容]
辩论1:[主题]
- 需求验证专员观点:[摘要]
- 解决方案架构师观点:[摘要]
- 结果:[达成共识或“合理的不同意见”]
📋 Action Items (Prioritized)
📋 行动项(按优先级排序)
Based on the critique, here are recommended next steps:
Must Do:
- [Critical action 1]
- [Critical action 2]
Should Do:
- [High priority action 1]
- [High priority action 2]
Could Do:
- [Medium priority action 1]
- [Nice to have action 2]
基于评审结果,建议下一步行动:
必须完成:
- [关键行动1]
- [关键行动2]
应该完成:
- [高优先级行动1]
- [高优先级行动2]
可以完成:
- [中优先级行动1]
- [可选优化项2]
🎓 Learning Opportunities
🎓 学习机会
[Lessons that could improve future work]
- [Learning 1]
- [Learning 2]
[可提升未来工作的经验教训]
- [学习点1]
- [学习点2]
📝 Conclusion
📝 结论
[Final assessment paragraph summarizing whether the work meets quality standards and key takeaways]
Verdict: ✅ Ready to ship | ⚠️ Needs improvements before shipping | ❌ Requires significant rework
Generated using Multi-Agent Debate + LLM-as-a-Judge pattern
Review Date: [timestamp]
undefined[总结性段落,说明工作是否符合质量标准及核心要点]
** verdict**:✅ 可发布 | ⚠️ 需改进后发布 | ❌ 需大幅重构
使用Multi-Agent Debate + LLM-as-a-Judge模式生成
评审日期:[时间戳]
undefinedImportant Guidelines
重要准则
- Be Objective: Base assessments on evidence, not preferences
- Be Specific: Always cite file locations, line numbers, and code examples
- Be Constructive: Frame criticism as opportunities for improvement
- Be Balanced: Acknowledge both strengths and weaknesses
- Be Actionable: Provide concrete recommendations with examples
- Consider Context: Account for project constraints, team size, timelines
- Avoid Bias: Don't favor certain patterns/styles without justification
- 保持客观:基于证据评估,而非个人偏好
- 具体明确:始终引用文件位置、行号及代码示例
- 建设性反馈:将批评转化为改进机会
- 平衡全面:同时认可优势与不足
- 可落地性:提供具体建议及示例
- 考虑上下文:兼顾项目约束、团队规模、时间线
- 避免偏见:无正当理由时,不偏好特定模式/风格
Usage Examples
使用示例
bash
undefinedbash
undefinedReview recent work from conversation
评审会话中的近期工作
/critique
/critique
Review specific files
评审指定文件
/critique src/feature.ts src/feature.test.ts
/critique src/feature.ts src/feature.test.ts
Review with specific focus
针对特定焦点评审
/critique --focus=security
/critique --focus=security
Review a git commit
评审Git提交
/critique HEAD~1..HEAD
undefined/critique HEAD~1..HEAD
undefinedNotes
注意事项
- This is a report-only command - it does not make changes
- The review may take 2-5 minutes due to multi-agent coordination
- Scores are relative to professional development standards
- Disagreements between judges are valuable insights, not failures
- Use findings to inform future development decisions
- 本命令仅生成报告——不会修改任何内容
- 由于多Agent协调,评审可能需要2-5分钟
- 评分基于专业开发标准
- 评审间的分歧是有价值的见解,而非失败
- 利用评审发现指导未来开发决策