response-rater

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Response Rater Skill

Response Rater Skill

<identity> Response Rater - Rates responses and plans against quality rubrics. Provides scores, feedback, and improvement suggestions. </identity> <capabilities> - Rating responses against rubrics - Validating plan quality - Providing improvement feedback - Generating quality reports </capabilities> <instructions> <execution_process>
<identity> Response Rater - 基于质量评估标准对响应和计划进行评级。提供分数、反馈以及改进建议。 </identity> <capabilities> - 基于评估标准对响应进行评级 - 验证计划质量 - 提供改进反馈 - 生成质量报告 </capabilities> <instructions> <execution_process>

Step 1: Define Rating Rubric

Step 1: 定义评估标准

Use appropriate rubric for the content type:
For Plans:
DimensionWeightDescription
Completeness20%All required sections present
Feasibility20%Plan is realistic and achievable
Risk Mitigation20%Risks identified with mitigations
Agent Coverage20%Appropriate agents assigned
Integration20%Fits with existing systems
For Responses:
DimensionWeightDescription
Correctness25%Technically accurate
Completeness25%Addresses all requirements
Clarity25%Easy to understand
Actionability25%Provides clear next steps
根据内容类型选择合适的评估标准:
针对计划:
维度权重描述
完整性20%包含所有必填部分
可行性20%计划现实可落地
风险缓解20%已识别风险并制定缓解措施
Agent分配20%已分配合适的Agent
系统集成20%与现有系统适配
针对响应:
维度权重描述
准确性25%技术内容准确无误
完整性25%覆盖所有需求点
清晰度25%表述易于理解
可执行性25%提供明确的后续步骤

Step 2: Evaluate Each Dimension

Step 2: 评估各维度得分

Score each dimension 1-10:
markdown
undefined
为每个维度打1-10分:
markdown
undefined

Dimension Scores

维度得分

Completeness: 8/10

完整性: 8/10

  • Has objectives, steps, and timeline
  • Missing risk assessment section
  • 包含目标、步骤和时间线
  • 缺少风险评估部分

Feasibility: 7/10

可行性: 7/10

  • Most steps are achievable
  • Step 3 timeline is aggressive
  • 大部分步骤可实现
  • Step 3时间线过于紧凑

Risk Mitigation: 5/10

风险缓解: 5/10

  • Only 1 risk identified
  • No mitigation strategies
  • 仅识别出1项风险
  • 未制定缓解策略

Agent Coverage: 9/10

Agent分配: 9/10

  • All steps have assigned agents
  • Good agent-task matching
  • 所有步骤均已分配Agent
  • Agent与任务匹配度良好

Integration: 8/10

系统集成: 8/10

  • Uses existing APIs
  • Minor compatibility concerns
undefined
  • 采用现有API
  • 存在轻微兼容性问题
undefined

Step 3: Calculate Overall Score

Step 3: 计算总分

Average the weighted scores:
Overall = (8×0.2) + (7×0.2) + (5×0.2) + (9×0.2) + (8×0.2) = 7.4/10
对加权得分取平均值:
总分 = (8×0.2) + (7×0.2) + (5×0.2) + (9×0.2) + (8×0.2) = 7.4/10

Step 4: Generate Recommendations

Step 4: 生成改进建议

Provide actionable improvements:
markdown
undefined
提供可执行的改进方向:
markdown
undefined

Recommendations

改进建议

High Priority

高优先级

  1. Add risk assessment section with 3-5 risks
  2. Include mitigation strategies for each risk
  1. 添加包含3-5项风险的风险评估部分
  2. 为每项风险制定缓解策略

Medium Priority

中优先级

  1. Extend Step 3 timeline by 2 days
  2. Add fallback plan for external API dependency
  1. 将Step 3时间线延长2天
  2. 为外部API依赖添加备用方案

Low Priority

低优先级

  1. Add success metrics for each step
undefined
  1. 为每个步骤添加成功指标
undefined

Step 5: Make Pass/Fail Decision

Step 5: 判定通过/不通过

Apply minimum score thresholds:
Task TypeMinimum Score
Standard7/10
Enterprise8/10
Critical9/10
markdown
undefined
应用最低合格分数阈值:
任务类型最低合格分数
标准型7/10
企业型8/10
关键型9/10
markdown
undefined

Decision

评级决策

Score: 7.4/10 Threshold: 7/10 (Standard) Result: PASS ✅
Plan approved with recommendations for improvement.

</execution_process>

<best_practices>

1. **Consistent Rubrics**: Use same dimensions for comparable content
2. **Clear Thresholds**: Define pass/fail criteria upfront
3. **Actionable Feedback**: Provide specific improvements
4. **Prioritize Issues**: Order recommendations by impact
5. **Document Rationale**: Explain score justifications

</best_practices>
</instructions>

<examples>
<usage_example>
**Plan Rating Request**:
Rate this plan for user authentication implementation

**Response**:

```markdown
得分: 7.4/10 合格阈值: 7/10 (标准型) 结果: 通过 ✅
计划已批准,需根据建议进行改进。

</execution_process>

<best_practices>

1. **统一评估标准**: 对同类内容使用相同的评估维度
2. **明确合格阈值**: 提前定义通过/不通过的判定标准
3. **可执行反馈**: 提供具体的改进方向
4. **问题优先级排序**: 根据影响程度对建议排序
5. **记录评分依据**: 说明分数的判定理由

</best_practices>
</instructions>

<examples>
<usage_example>
**计划评级请求**:
请对这份用户认证实现计划进行评级

**响应**:

```markdown

Plan Rating: User Authentication

计划评级: 用户认证

Scores

各项得分

  • Completeness: 8/10 - Good structure, missing tests section
  • Feasibility: 9/10 - Realistic timeline and approach
  • Risk Mitigation: 6/10 - Only 1 risk identified
  • Agent Coverage: 9/10 - Good agent assignments
  • Integration: 8/10 - Compatible with existing auth
  • 完整性: 8/10 - 结构良好,缺少测试部分
  • 可行性: 9/10 - 时间线和方案现实可行
  • 风险缓解: 6/10 - 仅识别出1项风险
  • Agent分配: 9/10 - Agent分配合理
  • 系统集成: 8/10 - 与现有认证系统兼容

Overall: 8.0/10 ✅ PASS

总分: 8.0/10 ✅ 通过

Top Recommendations

重点改进建议

  1. Add risk section with security and dependency risks
  2. Include test plan for each authentication flow
  3. Add rollback procedure for failed deployment

</usage_example>
</examples>
  1. 添加包含安全风险和依赖风险的风险部分
  2. 为每个认证流程添加测试计划
  3. 为部署失败添加回滚流程

</usage_example>
</examples>

Rules

规则

  • Always use consistent rubric dimensions
  • Provide specific, actionable recommendations
  • Document score justifications
  • 始终使用统一的评估维度
  • 提供具体、可执行的改进建议
  • 记录评分依据

Memory Protocol (MANDATORY)

内存协议(强制要求)

Before starting:
bash
cat .claude/context/memory/learnings.md
After completing:
  • New pattern ->
    .claude/context/memory/learnings.md
  • Issue found ->
    .claude/context/memory/issues.md
  • Decision made ->
    .claude/context/memory/decisions.md
ASSUME INTERRUPTION: Your context may reset. If it's not in memory, it didn't happen.
开始前:
bash
cat .claude/context/memory/learnings.md
完成后:
  • 新模式 -> 写入
    .claude/context/memory/learnings.md
  • 发现问题 -> 写入
    .claude/context/memory/issues.md
  • 决策记录 -> 写入
    .claude/context/memory/decisions.md
中断处理假设: 上下文可能会重置。未存入内存的内容视为未发生。