skill-tuning

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Skill Tuning

Skill调优

Autonomous diagnosis and optimization for skill execution issues.
针对Skill执行问题的自主诊断与优化。

Architecture

架构

┌─────────────────────────────────────────────────────┐
│  Phase 0: Read Specs (mandatory)                    │
│  → problem-taxonomy.md, tuning-strategies.md         │
└─────────────────────────────────────────────────────┘
┌─────────────────────────────────────────────────────┐
│  Orchestrator (state-driven)                         │
│  Read state → Select action → Execute → Update → ✓ │
└─────────────────────────────────────────────────────┘
        ↓                           ↓
┌──────────────────────┐   ┌──────────────────┐
│  Diagnosis Phase     │   │ Gemini CLI       │
│  • Context          │   │ Deep analysis    │
│  • Memory           │   │ (on-demand)      │
│  • DataFlow         │   │                  │
│  • Agent            │   │ Complex issues   │
│  • Docs             │   │ Architecture     │
│  • Token Usage      │   │ Performance      │
└──────────────────────┘   └──────────────────┘
        ┌───────────────────┐
        │  Fix & Verify     │
        │  Apply → Re-test  │
        └───────────────────┘
┌─────────────────────────────────────────────────────┐
│  阶段0:读取规范(必填)                    │
│  → problem-taxonomy.md, tuning-strategies.md         │
└─────────────────────────────────────────────────────┘
┌─────────────────────────────────────────────────────┐
│  编排器(状态驱动)                         │
│  读取状态 → 选择操作 → 执行 → 更新 → ✓ │
└─────────────────────────────────────────────────────┘
        ↓                           ↓
┌──────────────────────┐   ┌──────────────────┐
│  诊断阶段     │   │ Gemini CLI       │
│  • 上下文          │   │ 深度分析(按需)      │
│  • 内存           │   │                  │
│  • 数据流         │   │ 复杂问题   │
│  • Agent            │   │ 架构     │
│  • 文档             │   │ 性能      │
│  • Token使用      │   │                  │
└──────────────────────┘   └──────────────────┘
        ┌───────────────────┐
        │  修复与验证     │
        │  应用 → 重新测试  │
        └───────────────────┘

Core Issues Detected

检测到的核心问题

PriorityProblemRoot CauseFix Strategy
P0Authoring ViolationIntermediate files, state bloat, file relayeliminate_intermediate, minimize_state
P1Data Flow DisruptionScattered state, inconsistent formatsstate_centralization, schema_enforcement
P2Agent CoordinationFragile chains, no error handlingerror_wrapping, result_validation
P3Context ExplosionUnbounded history, full content passingsliding_window, path_reference
P4Long-tail ForgettingEarly constraint lossconstraint_injection, checkpoint_restore
P5Token ConsumptionVerbose prompts, state bloatprompt_compression, lazy_loading
优先级问题根本原因修复策略
P0编写违规中间文件、状态膨胀、文件传递移除中间文件、最小化状态
P1数据流中断状态分散、格式不一致状态集中化、模式强制
P2Agent协调问题脆弱的调用链、无错误处理错误包装、结果验证
P3上下文膨胀无限制历史记录、完整内容传递滑动窗口、路径引用
P4长尾遗忘早期约束丢失约束注入、检查点恢复
P5Token消耗过高冗长提示词、状态膨胀提示词压缩、懒加载

Problem Categories (Detailed Specs)

问题分类(详细规范)

See specs/problem-taxonomy.md for:
  • Detection patterns (regex/checks)
  • Severity calculations
  • Impact assessments
查看 specs/problem-taxonomy.md 了解:
  • 检测模式(正则/检查规则)
  • 严重程度计算
  • 影响评估

Tuning Strategies (Detailed Specs)

调优策略(详细规范)

See specs/tuning-strategies.md for:
  • 10+ strategies per category
  • Implementation patterns
  • Verification methods
查看 specs/tuning-strategies.md 了解:
  • 每个分类下的10+种策略
  • 实现模式
  • 验证方法

Workflow

工作流

StepActionOrchestrator DecisionOutput
1
action-init
status='pending'Backup, session created
2
action-analyze-requirements
After initRequired dimensions + coverage
3Diagnosis (6 types)Focus areasstate.diagnosis.{type}
4
action-gemini-analysis
Critical issues OR user requestDeep findings
5
action-generate-report
All diagnosis completestate.final_report
6
action-propose-fixes
Issues foundstate.proposed_fixes[]
7
action-apply-fix
Pending fixesApplied + verified
8
action-complete
Quality gates passsession.status='completed'
步骤操作编排器决策输出
1
action-init
状态='待处理'备份文件、会话已创建
2
action-analyze-requirements
初始化完成后所需分析维度 + 覆盖范围
3诊断(6种类型)聚焦重点领域state.diagnosis.{type}
4
action-gemini-analysis
存在关键问题 或 用户请求深度分析结果
5
action-generate-report
所有诊断完成后state.final_report
6
action-propose-fixes
检测到问题时state.proposed_fixes[]
7
action-apply-fix
存在待修复问题时修复已应用 + 验证通过
8
action-complete
通过质量门session.status='已完成'

Action Reference

操作参考

CategoryActionsPurpose
Setupaction-initInitialize backup, session state
Analysisaction-analyze-requirementsDecompose user request via Gemini CLI
Diagnosisaction-diagnose-{context,memory,dataflow,agent,docs,token_consumption}Detect category-specific issues
Deep Analysisaction-gemini-analysisGemini CLI: complex/critical issues
Reportingaction-generate-reportConsolidate findings → final_report
Fixingaction-propose-fixes, action-apply-fixGenerate + apply fixes
Verifyaction-verifyRe-run diagnosis, check gates
Exitaction-complete, action-abortFinalize or rollback
Full action details: phases/actions/
分类操作用途
设置action-init初始化备份、会话状态
分析action-analyze-requirements通过Gemini CLI分解用户请求
诊断action-diagnose-{context,memory,dataflow,agent,docs,token_consumption}检测特定分类的问题
深度分析action-gemini-analysisGemini CLI:处理复杂/关键问题
报告action-generate-report整合分析结果 → final_report
修复action-propose-fixes, action-apply-fix生成并应用修复方案
验证action-verify重新运行诊断、检查质量门
退出action-complete, action-abort完成或回滚操作
完整操作详情:phases/actions/

State Management

状态管理

Single source of truth:
.workflow/.scratchpad/skill-tuning-{ts}/state.json
json
{
  "status": "pending|running|completed|failed",
  "target_skill": { "name": "...", "path": "..." },
  "diagnosis": {
    "context": {...},
    "memory": {...},
    "dataflow": {...},
    "agent": {...},
    "docs": {...},
    "token_consumption": {...}
  },
  "issues": [{"id":"...", "severity":"...", "category":"...", "strategy":"..."}],
  "proposed_fixes": [...],
  "applied_fixes": [...],
  "quality_gate": "pass|fail",
  "final_report": "..."
}
See phases/state-schema.md for complete schema.
唯一可信数据源
.workflow/.scratchpad/skill-tuning-{ts}/state.json
json
{
  "status": "pending|running|completed|failed",
  "target_skill": { "name": "...", "path": "..." },
  "diagnosis": {
    "context": {...},
    "memory": {...},
    "dataflow": {...},
    "agent": {...},
    "docs": {...},
    "token_consumption": {...}
  },
  "issues": [{"id":"...", "severity":"...", "category":"...", "strategy":"..."}],
  "proposed_fixes": [...],
  "applied_fixes": [...],
  "quality_gate": "pass|fail",
  "final_report": "..."
}
完整状态架构请查看 phases/state-schema.md

Orchestrator Logic

核心原则

See phases/orchestrator.md for:
  • Decision logic (termination checks → action selection)
  • State transitions
  • Error recovery
  1. 问题优先:先诊断再进行任何修复
  2. 数据驱动:记录跟踪信息、Token计数、快照
  3. 迭代式:多轮执行直到通过质量门
  4. 可回滚:所有更改都带有备份检查点
  5. 非侵入式:最小化代码更改,最大化清晰度

Key Principles

使用示例

  1. Problem-First: Diagnosis before any fix
  2. Data-Driven: Record traces, token counts, snapshots
  3. Iterative: Multiple rounds until quality gates pass
  4. Reversible: All changes with backup checkpoints
  5. Non-Invasive: Minimal changes, maximum clarity
bash
undefined

Usage Examples

基础Skill诊断

bash
undefined
/skill-tuning "Fix memory leaks in my skill"

Basic skill diagnosis

结合Gemini进行深度分析

/skill-tuning "Fix memory leaks in my skill"
/skill-tuning "Architecture issues in async workflow"

Deep analysis with Gemini

聚焦特定领域

/skill-tuning "Architecture issues in async workflow"
/skill-tuning "Optimize token consumption and fix agent coordination"

Focus on specific areas

自定义问题

/skill-tuning "Optimize token consumption and fix agent coordination"
/skill-tuning "My skill produces inconsistent outputs"
undefined

Custom issue

输出

/skill-tuning "My skill produces inconsistent outputs"
undefined
完成后,查看以下内容:
  • .workflow/.scratchpad/skill-tuning-{ts}/state.json
    - 包含最终报告的完整状态文件
  • state.final_report
    - Markdown格式的分析摘要(位于state.json中)
  • state.applied_fixes
    - 已应用修复的列表及验证结果

Output

参考文档

After completion, review:
  • .workflow/.scratchpad/skill-tuning-{ts}/state.json
    - Full state with final_report
  • state.final_report
    - Markdown summary (in state.json)
  • state.applied_fixes
    - List of applied fixes with verification results
文档用途
specs/problem-taxonomy.md问题分类 + 检测模式
specs/tuning-strategies.md修复方案实施指南
specs/dimension-mapping.md分析维度 ↔ 规范映射
specs/quality-gates.md质量验证标准
phases/orchestrator.md工作流编排逻辑
phases/state-schema.md状态结构定义
phases/actions/单个操作的具体实现

Reference Documents

DocumentPurpose
specs/problem-taxonomy.mdClassification + detection patterns
specs/tuning-strategies.mdFix implementation guide
specs/dimension-mapping.mdDimension ↔ Spec mapping
specs/quality-gates.mdQuality verification criteria
phases/orchestrator.mdWorkflow orchestration
phases/state-schema.mdState structure definition
phases/actions/Individual action implementations