Loading...
Loading...
Compare original and translation side by side
build-skillsreview-prbuild-skillsreview-prSkill under test: [name]
Test task: [one-line description]
Date: [YYYY-MM-DD]
Method: Follow SKILL.md steps N–M exactly as writtenSkill under test: [name]
Test task: [one-line description]
Date: [YYYY-MM-DD]
Method: Follow SKILL.md steps N–M exactly as writtenreferences/references/**F-[NN] — [short title]** (P[0-2])
[What happened, what the instructions said, what was missing or ambiguous.]
Fix: [Specific text edit that would prevent this derailment.]references/friction-classification.md**F-[NN] — [简短标题]** (P[0-2])
[事件经过、指令原文、缺失或模糊的内容。]
Fix: [可避免此卡点的具体文本编辑方案。]references/friction-classification.md| Metric | Value |
|---|---|
| Total steps attempted | |
| Clean passes | |
| P0 (blocks progress) | |
| P1 (causes confusion) | |
| P2 (minor annoyance) |
references/root-cause-taxonomy.md| Metric | Value |
|---|---|
| Total steps attempted | |
| Clean passes | |
| P0 (blocks progress) | |
| P1 (causes confusion) | |
| P2 (minor annoyance) |
references/root-cause-taxonomy.mdderail-notes/NN-dogfood-[topic].mdundefinedderail-notes/NN-dogfood-[topic].mdundefined| Priority | Count | Friction points |
|---|---|---|
| P0 | N | F-xx, ... |
| P1 | N | F-xx, ... |
| P2 | N | F-xx, ... |
undefined| Priority | Count | Friction points |
|---|---|---|
| P0 | N | F-xx, ... |
| P1 | N | F-xx, ... |
| P2 | N | F-xx, ... |
undefinedreferences/fix-patterns.mdreferences/fix-patterns.mdundefinedundefinedundefinedundefined02-dogfood-[topic].md02-dogfood-[topic].mdexternalexternal| Do this | Not that |
|---|---|
| Follow each step literally as written | Fill in gaps from personal knowledge |
| Record every uncertainty as a friction point | Skip ambiguities that seem "minor" |
| Fix the source files directly | Create a separate errata or known-issues file |
| Test on a real task within the skill's scope | Use a toy example or hypothetical scenario |
| Write structured derail notes with IDs and severities | Write prose complaints without classification |
| Verify fixes with grep and routing checks | Assume fixes are correct without verification |
| Report what worked well alongside what broke | Write a purely negative report |
| Do this | Not that |
|---|---|
| 严格遵循每一步的字面描述 | 凭借个人知识填补空白 |
| 将所有不确定性记录为卡点 | 忽略看似“微小”的歧义 |
| 直接修改源文件进行修复 | 创建单独的勘误表或已知问题文件 |
| 在Skill范围内的真实任务上测试 | 使用玩具示例或假设场景 |
| 撰写带ID和级别的结构化卡点记录 | 撰写无分类的散文式反馈 |
| 使用grep和路由检查验证修复 | 未经验证即假设修复正确 |
| 同时报告有效部分与问题部分 | 撰写纯负面报告 |
| File | Read when |
|---|---|
| Assigning severity (P0/P1/P2) to a friction point or choosing between severity levels |
| Tagging friction points with root cause codes for pattern analysis |
| Matching a derailment type to a proven fix pattern |
| Tracking improvement across multiple test runs or building cross-run reports |
| Applying Derailment Testing to non-skill instruction sets (runbooks, SOPs, API docs) |
| File | Read when |
|---|---|
| 为卡点分配级别(P0/P1/P2)或在级别间做选择时 |
| 为卡点标记根本原因代码以进行模式分析时 |
| 将卡点类型与已验证的修复模式匹配时 |
| 跟踪多次测试的改进效果或构建跨测试报告时 |
| 将脱轨测试应用于非Skill指令集(运行手册、标准操作流程、API文档)时 |
build-skillsbuild-skillsderail-notes/NN-dogfood-[topic].mdderail-notes/NN-dogfood-[topic].md