response-rater

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Response Rater Skill

<identity> Response Rater - Rates responses and plans against quality rubrics. Provides scores, feedback, and improvement suggestions. </identity> <capabilities> - Rating responses against rubrics - Validating plan quality - Providing improvement feedback - Generating quality reports </capabilities> <instructions> <execution_process>

<identity> Response Rater - 基于质量评估标准对响应和计划进行评级。提供分数、反馈以及改进建议。 </identity> <capabilities> - 基于评估标准对响应进行评级 - 验证计划质量 - 提供改进反馈 - 生成质量报告 </capabilities> <instructions> <execution_process>

Step 1: Define Rating Rubric

Step 1: 定义评估标准

Use appropriate rubric for the content type:

For Plans:

Dimension	Weight	Description
Completeness	20%	All required sections present
Feasibility	20%	Plan is realistic and achievable
Risk Mitigation	20%	Risks identified with mitigations
Agent Coverage	20%	Appropriate agents assigned
Integration	20%	Fits with existing systems

For Responses:

Dimension	Weight	Description
Correctness	25%	Technically accurate
Completeness	25%	Addresses all requirements
Clarity	25%	Easy to understand
Actionability	25%	Provides clear next steps

根据内容类型选择合适的评估标准：

针对计划:

维度	权重	描述
完整性	20%	包含所有必填部分
可行性	20%	计划现实可落地
风险缓解	20%	已识别风险并制定缓解措施
Agent分配	20%	已分配合适的Agent
系统集成	20%	与现有系统适配

针对响应:

维度	权重	描述
准确性	25%	技术内容准确无误
完整性	25%	覆盖所有需求点
清晰度	25%	表述易于理解
可执行性	25%	提供明确的后续步骤

Step 2: Evaluate Each Dimension

Step 2: 评估各维度得分

Score each dimension 1-10:

markdown

undefined

为每个维度打1-10分：

markdown

undefined

Dimension Scores

维度得分

Completeness: 8/10

完整性: 8/10

Has objectives, steps, and timeline
Missing risk assessment section

包含目标、步骤和时间线
缺少风险评估部分

Feasibility: 7/10

可行性: 7/10

Most steps are achievable
Step 3 timeline is aggressive

大部分步骤可实现
Step 3时间线过于紧凑

Risk Mitigation: 5/10

风险缓解: 5/10

Only 1 risk identified
No mitigation strategies

仅识别出1项风险
未制定缓解策略

Agent Coverage: 9/10

Agent分配: 9/10

All steps have assigned agents
Good agent-task matching

所有步骤均已分配Agent
Agent与任务匹配度良好

Integration: 8/10

系统集成: 8/10

Uses existing APIs
Minor compatibility concerns

undefined

采用现有API
存在轻微兼容性问题

undefined

Step 3: Calculate Overall Score

Step 3: 计算总分

Average the weighted scores:

Overall = (8×0.2) + (7×0.2) + (5×0.2) + (9×0.2) + (8×0.2) = 7.4/10

对加权得分取平均值：

总分 = (8×0.2) + (7×0.2) + (5×0.2) + (9×0.2) + (8×0.2) = 7.4/10

Step 4: Generate Recommendations

Step 4: 生成改进建议

Provide actionable improvements:

markdown

undefined

提供可执行的改进方向：

markdown

undefined

Recommendations

改进建议

High Priority

高优先级

Add risk assessment section with 3-5 risks
Include mitigation strategies for each risk

添加包含3-5项风险的风险评估部分
为每项风险制定缓解策略

Medium Priority

中优先级

Extend Step 3 timeline by 2 days
Add fallback plan for external API dependency

将Step 3时间线延长2天
为外部API依赖添加备用方案

Low Priority

低优先级

Add success metrics for each step

undefined

为每个步骤添加成功指标

undefined

Step 5: Make Pass/Fail Decision

Step 5: 判定通过/不通过

Apply minimum score thresholds:

Task Type	Minimum Score
Standard	7/10
Enterprise	8/10
Critical	9/10

markdown

undefined

应用最低合格分数阈值：

任务类型	最低合格分数
标准型	7/10
企业型	8/10
关键型	9/10

markdown

undefined

Decision

评级决策

Score: 7.4/10 Threshold: 7/10 (Standard) Result: PASS ✅

Plan approved with recommendations for improvement.


</execution_process>

<best_practices>

1. **Consistent Rubrics**: Use same dimensions for comparable content
2. **Clear Thresholds**: Define pass/fail criteria upfront
3. **Actionable Feedback**: Provide specific improvements
4. **Prioritize Issues**: Order recommendations by impact
5. **Document Rationale**: Explain score justifications

</best_practices>
</instructions>

<examples>
<usage_example>
**Plan Rating Request**:

Rate this plan for user authentication implementation


**Response**:

```markdown

得分: 7.4/10 合格阈值: 7/10 (标准型) 结果: 通过 ✅

计划已批准，需根据建议进行改进。


</execution_process>

<best_practices>

1. **统一评估标准**: 对同类内容使用相同的评估维度
2. **明确合格阈值**: 提前定义通过/不通过的判定标准
3. **可执行反馈**: 提供具体的改进方向
4. **问题优先级排序**: 根据影响程度对建议排序
5. **记录评分依据**: 说明分数的判定理由

</best_practices>
</instructions>

<examples>
<usage_example>
**计划评级请求**:

请对这份用户认证实现计划进行评级


**响应**:

```markdown

Plan Rating: User Authentication

计划评级: 用户认证

Scores

各项得分

Completeness: 8/10 - Good structure, missing tests section
Feasibility: 9/10 - Realistic timeline and approach
Risk Mitigation: 6/10 - Only 1 risk identified
Agent Coverage: 9/10 - Good agent assignments
Integration: 8/10 - Compatible with existing auth

完整性: 8/10 - 结构良好，缺少测试部分
可行性: 9/10 - 时间线和方案现实可行
风险缓解: 6/10 - 仅识别出1项风险
Agent分配: 9/10 - Agent分配合理
系统集成: 8/10 - 与现有认证系统兼容

Overall: 8.0/10 ✅ PASS

总分: 8.0/10 ✅ 通过

Top Recommendations

重点改进建议

Add risk section with security and dependency risks
Include test plan for each authentication flow
Add rollback procedure for failed deployment


</usage_example>
</examples>

添加包含安全风险和依赖风险的风险部分
为每个认证流程添加测试计划
为部署失败添加回滚流程


</usage_example>
</examples>

Rules

规则

Always use consistent rubric dimensions
Provide specific, actionable recommendations
Document score justifications

始终使用统一的评估维度
提供具体、可执行的改进建议
记录评分依据

Memory Protocol (MANDATORY)

内存协议（强制要求）

Before starting:

bash

cat .claude/context/memory/learnings.md

After completing:

New pattern ->
```
.claude/context/memory/learnings.md
```
Issue found ->
```
.claude/context/memory/issues.md
```
Decision made ->
```
.claude/context/memory/decisions.md
```

ASSUME INTERRUPTION: Your context may reset. If it's not in memory, it didn't happen.

开始前:

bash

cat .claude/context/memory/learnings.md

完成后:

新模式 -> 写入
```
.claude/context/memory/learnings.md
```
发现问题 -> 写入
```
.claude/context/memory/issues.md
```
决策记录 -> 写入
```
.claude/context/memory/decisions.md
```

中断处理假设: 上下文可能会重置。未存入内存的内容视为未发生。