verify

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Verify Feature

功能验证

Comprehensive verification using parallel specialized agents with nuanced grading (0-10 scale) and improvement suggestions.
借助并行的专用Agent实现全面验证,提供精细化评分(0-10分制)和改进建议。

Quick Start

快速开始

bash
/ork:verify authentication flow
/ork:verify --model=opus user profile feature
/ork:verify --scope=backend database migrations
bash
/ork:verify authentication flow
/ork:verify --model=opus user profile feature
/ork:verify --scope=backend database migrations

Argument Resolution

参数解析

python
SCOPE = "$ARGUMENTS"       # Full argument string, e.g., "authentication flow"
SCOPE_TOKEN = "$ARGUMENTS[0]"  # First token for flag detection (e.g., "--scope=backend")
python
SCOPE = "$ARGUMENTS"       # 完整参数字符串,例如:"authentication flow"
SCOPE_TOKEN = "$ARGUMENTS[0]"  # 用于检测标志的第一个令牌(例如:"--scope=backend")

$ARGUMENTS[0], $ARGUMENTS[1] etc. for indexed access (CC 2.1.59)

$ARGUMENTS[0], $ARGUMENTS[1] 等用于索引访问(CC 2.1.59)

Model override detection (CC 2.1.72)

模型覆盖检测(CC 2.1.72)

MODEL_OVERRIDE = None for token in "$ARGUMENTS".split(): if token.startswith("--model="): MODEL_OVERRIDE = token.split("=", 1)[1] # "opus", "sonnet", "haiku" SCOPE = SCOPE.replace(token, "").strip()

Pass `MODEL_OVERRIDE` to all Agent() calls via `model=MODEL_OVERRIDE` when set. Accepts symbolic names (`opus`, `sonnet`, `haiku`) or full IDs (`claude-opus-4-6`) per CC 2.1.74.

> **Opus 4.6**: Agents use native adaptive thinking (no MCP sequential-thinking needed). Extended 128K output supports comprehensive verification reports.

---
MODEL_OVERRIDE = None for token in "$ARGUMENTS".split(): if token.startswith("--model="): MODEL_OVERRIDE = token.split("=", 1)[1] # "opus", "sonnet", "haiku" SCOPE = SCOPE.replace(token, "").strip()

当设置`MODEL_OVERRIDE`时,通过`model=MODEL_OVERRIDE`将其传递给所有Agent()调用。支持符号名称(`opus`、`sonnet`、`haiku`)或完整ID(`claude-opus-4-6`),符合CC 2.1.74规范。

> **Opus 4.6**:Agent采用原生自适应思维(无需MCP顺序思维)。扩展的128K输出支持生成全面的验证报告。

---

STEP 0: Effort-Aware Verification Scaling (CC 2.1.76)

步骤0:基于工作量的验证规模调整(CC 2.1.76)

Scale verification depth based on
/effort
level:
Effort LevelPhases RunAgentsOutput
lowRun tests only → pass/fail0 agentsQuick check
mediumTests + code quality + security3 agentsScore + top issues
high (default)All 8 phases + visual capture6-7 agentsFull report + grades
Override: Explicit user selection (e.g., "Full verification") overrides
/effort
downscaling.
根据
/effort
级别调整验证深度:
工作量级别运行阶段Agent数量输出结果
仅运行测试 → 合格/不合格0个Agent快速检查结果
测试 + 代码质量 + 安全3个Agent评分 + 主要问题
(默认)全部8个阶段 + 视觉捕获6-7个Agent完整报告 + 评分
覆盖设置:用户明确选择(例如“全面验证”)会覆盖
/effort
的降级设置。

STEP 0a: Verify User Intent with AskUserQuestion

步骤0a:通过AskUserQuestion验证用户意图

BEFORE creating tasks, clarify verification scope:
python
AskUserQuestion(
  questions=[{
    "question": "What scope for this verification?",
    "header": "Scope",
    "options": [
      {"label": "Full verification (Recommended)", "description": "All tests + security + code quality + visual + grades", "markdown": "```\nFull Verification (10 phases)\n─────────────────────────────\n  7 parallel agents:\n  ┌────────────┐ ┌────────────┐\n  │ Code       │ │ Security   │\n  │ Quality    │ │ Auditor    │\n  ├────────────┤ ├────────────┤\n  │ Test       │ │ Backend    │\n  │ Generator  │ │ Architect  │\n  ├────────────┤ ├────────────┤\n  │ Frontend   │ │ Performance│\n  │ Developer  │ │ Engineer   │\n  ├────────────┤ └────────────┘\n  │ Visual     │\n  │ Capture    │ → gallery.html\n  └────────────┘\n         ▼\n    Composite Score (0-10)\n    8 dimensions + Grade\n    + Visual Gallery\n```"},
      {"label": "Tests only", "description": "Run unit + integration + e2e tests", "markdown": "```\nTests Only\n──────────\n  npm test ──▶ Results\n  ┌─────────────────────┐\n  │ Unit tests     ✓/✗  │\n  │ Integration    ✓/✗  │\n  │ E2E            ✓/✗  │\n  │ Coverage       NN%  │\n  └─────────────────────┘\n  Skip: security, quality, UI\n  Output: Pass/fail + coverage\n```"},
      {"label": "Security audit", "description": "Focus on security vulnerabilities", "markdown": "```\nSecurity Audit\n──────────────\n  security-auditor agent:\n  ┌─────────────────────────┐\n  │ OWASP Top 10       ✓/✗ │\n  │ Dependency CVEs    ✓/✗ │\n  │ Secrets scan       ✓/✗ │\n  │ Auth flow review   ✓/✗ │\n  │ Input validation   ✓/✗ │\n  └─────────────────────────┘\n  Output: Security score 0-10\n          + vulnerability list\n```"},
      {"label": "Code quality", "description": "Lint, types, complexity analysis", "markdown": "```\nCode Quality\n────────────\n  code-quality-reviewer agent:\n  ┌─────────────────────────┐\n  │ Lint errors         N   │\n  │ Type coverage       NN% │\n  │ Cyclomatic complex  N.N │\n  │ Dead code           N   │\n  │ Pattern violations  N   │\n  └─────────────────────────┘\n  Output: Quality score 0-10\n          + refactor suggestions\n```"},
      {"label": "Quick check", "description": "Just run tests, skip detailed analysis", "markdown": "```\nQuick Check (~1 min)\n────────────────────\n  Run tests ──▶ Pass/Fail\n\n  Output:\n  ├── Test results\n  ├── Build status\n  └── Lint status\n  No agents, no grading,\n  no report generation\n```"}
    ],
    "multiSelect": true
  }]
)
Based on answer, adjust workflow:
  • Full verification: All 10 phases (8 + 2.5 + 8.5), 7 parallel agents including visual capture
  • Tests only: Skip phases 2 (security), 5 (UI/UX analysis)
  • Security audit: Focus on security-auditor agent
  • Code quality: Focus on code-quality-reviewer agent
  • Quick check: Run tests only, skip grading and suggestions

在创建任务之前,明确验证范围:
python
AskUserQuestion(
  questions=[{
    "question": "本次验证的范围是什么?",
    "header": "范围",
    "options": [
      {"label": "全面验证(推荐)", "description": "所有测试 + 安全 + 代码质量 + 视觉 + 评分", "markdown": "```\n全面验证(10个阶段)\n─────────────────────────────\n  7个并行Agent:\n  ┌────────────┐ ┌────────────┐\n  │ 代码质量   │ │ 安全审计   │\n  │ 检查Agent  │ │ Agent      │\n  ├────────────┤ ├────────────┤\n  │ 测试生成   │ │ 后端架构   │\n  │ Agent      │ │ Agent      │\n  ├────────────┤ ├────────────┤\n  │ 前端开发   │ │ 性能工程   │\n  │ Agent      │ │ Agent      │\n  ├────────────┤ └────────────┘\n  │ 视觉捕获   │\n  │ Agent      │ → gallery.html\n  └────────────┘\n         ▼\n    综合评分(0-10)\n    8个维度 + 等级\n    + 视觉图库\n```"},
      {"label": "仅测试", "description": "运行单元测试 + 集成测试 + E2E测试", "markdown": "```\n仅测试\n──────────\n  npm test ──▶ 结果\n  ┌─────────────────────┐\n  │ 单元测试     ✓/✗  │\n  │ 集成测试     ✓/✗  │\n  │ E2E测试      ✓/✗  │\n  │ 覆盖率       NN%  │\n  └─────────────────────┘\n  跳过:安全、质量、UI\n  输出:合格/不合格 + 覆盖率\n```"},
      {"label": "安全审计", "description": "重点关注安全漏洞", "markdown": "```\n安全审计\n──────────────\n  安全审计Agent:\n  ┌─────────────────────────┐\n  │ OWASP Top 10       ✓/✗ │\n  │ 依赖项CVE检查    ✓/✗ │\n  │ 敏感信息扫描    ✓/✗ │\n  │ 认证流程审查    ✓/✗ │\n  │ 输入验证检查    ✓/✗ │\n  └─────────────────────────┘\n  输出:安全评分 0-10\n          + 漏洞列表\n```"},
      {"label": "代码质量", "description": "代码规范、类型检查、复杂度分析", "markdown": "```\n代码质量\n────────────\n  代码质量审查Agent:\n  ┌─────────────────────────┐\n  │ 代码规范错误数         N   │\n  │ 类型覆盖率       NN% │\n  │ 圈复杂度       N.N │\n  │ 死代码数量           N   │\n  │ 模式违规数       N   │\n  └─────────────────────────┘\n  输出:质量评分 0-10\n          + 重构建议\n```"},
      {"label": "快速检查", "description": "仅运行测试,跳过详细分析", "markdown": "```\n快速检查(约1分钟)\n────────────────────\n  运行测试 ──▶ 合格/不合格\n\n  输出:\n  ├── 测试结果\n  ├── 构建状态\n  └── 代码规范检查状态\n  无Agent参与,无评分,\n  不生成报告\n```"}
    ],
    "multiSelect": true
  }]
)
根据回答调整工作流
  • 全面验证:所有10个阶段(8 + 2.5 + 8.5),7个并行Agent,包括视觉捕获
  • 仅测试:跳过阶段2(安全)、阶段5(UI/UX分析)
  • 安全审计:重点使用安全审计Agent
  • 代码质量:重点使用代码质量审查Agent
  • 快速检查:仅运行测试,跳过评分和建议

STEP 0b: Select Orchestration Mode

步骤0b:选择编排模式

Load details:
Read("${CLAUDE_SKILL_DIR}/references/orchestration-mode.md")
for env var check logic, Agent Teams vs Task Tool comparison, and mode selection rules.
Choose Agent Teams (mesh -- verifiers share findings) or Task tool (star -- all report to lead) based on the orchestration mode reference.

加载详细信息:
Read("${CLAUDE_SKILL_DIR}/references/orchestration-mode.md")
用于环境变量检查逻辑、Agent团队与任务工具的对比,以及模式选择规则。
根据编排模式参考,选择Agent团队(网状结构——验证者共享发现结果)或任务工具(星型结构——所有结果上报给主导者)。

MCP Probe + Resume

MCP探测与恢复

python
ToolSearch(query="select:mcp__memory__search_nodes")
Write(".claude/chain/capabilities.json", { memory, timestamp })

Read(".claude/chain/state.json")  # resume if exists
python
ToolSearch(query="select:mcp__memory__search_nodes")
Write(".claude/chain/capabilities.json", { memory, timestamp })

Read(".claude/chain/state.json")  # 如果存在则恢复

Handoff File

交接文件

After verification completes, write results:
python
Write(".claude/chain/verify-results.json", JSON.stringify({
  "phase": "verify", "skill": "verify",
  "timestamp": now(), "status": "completed",
  "outputs": {
    "tests_passed": N, "tests_failed": N,
    "coverage": "87%", "security_scan": "clean"
  }
}))
验证完成后,写入结果:
python
Write(".claude/chain/verify-results.json", JSON.stringify({
  "phase": "verify", "skill": "verify",
  "timestamp": now(), "status": "completed",
  "outputs": {
    "tests_passed": N, "tests_failed": N,
    "coverage": "87%", "security_scan": "clean"
  }
}))

Regression Monitor (CC 2.1.71)

回归监控(CC 2.1.71)

Optionally schedule post-verification monitoring:
python
undefined
可选:安排验证后的监控任务:
python
undefined

Guard: Skip cron in headless/CI (CLAUDE_CODE_DISABLE_CRON)

防护:在无头/CI环境中跳过定时任务(CLAUDE_CODE_DISABLE_CRON)

if env CLAUDE_CODE_DISABLE_CRON is set, run a single check instead

如果设置了环境变量CLAUDE_CODE_DISABLE_CRON,则仅运行一次检查

CronCreate( schedule="0 8 * * *", prompt="Daily regression check: npm test. If 7 consecutive passes → CronDelete. If failures → alert with details." )

---
CronCreate( schedule="0 8 * * *", prompt="每日回归检查:npm test.\n 如果连续7次通过 → 删除定时任务.\n 如果失败 → 发送包含详细信息的警报." )

---

Task Management (CC 2.1.16)

任务管理(CC 2.1.16)

python
undefined
python
undefined

Create main verification task

创建主验证任务

TaskCreate( subject="Verify [feature-name] implementation", description="Comprehensive verification with nuanced grading", activeForm="Verifying [feature-name] implementation" )
TaskCreate( subject="验证[功能名称]实现", description="借助精细化评分实现全面验证", activeForm="正在验证[功能名称]实现" )

Create subtasks for 8-phase process

为8阶段流程创建子任务

phases = ["Run code quality checks", "Execute security audit", "Verify test coverage", "Validate API", "Check UI/UX", "Calculate grades", "Generate suggestions", "Compile report"] for phase in phases: TaskCreate(subject=phase, activeForm=f"{phase}ing")

---
phases = ["运行代码质量检查", "执行安全审计", "验证测试覆盖率", "验证API", "检查UI/UX", "计算评分", "生成改进建议", "编译报告"] for phase in phases: TaskCreate(subject=phase, activeForm=f"正在{phase}")

---

8-Phase Workflow

8阶段工作流

Load details:
Read("${CLAUDE_SKILL_DIR}/references/verification-phases.md")
for complete phase details, agent spawn definitions, Agent Teams alternative, and team teardown.
PhaseActivitiesOutput
1. Context GatheringGit diff, commit historyChanges summary
2. Parallel Agent Dispatch6 agents evaluate0-10 scores
2.5 Visual CaptureScreenshot routes, AI vision evalGallery + visual score
3. Test ExecutionBackend + frontend testsCoverage data
4. Nuanced GradingComposite score calculationGrade (A-F)
5. Improvement SuggestionsEffort vs impact analysisPrioritized list
6. Alternative ComparisonCompare approaches (optional)Recommendation
7. Metrics TrackingTrend analysisHistorical data
8. Report CompilationEvidence artifacts + gallery.htmlFinal report
8.5 Agentation LoopUser annotates, ui-feedback fixesBefore/after diffs
加载详细信息:
Read("${CLAUDE_SKILL_DIR}/references/verification-phases.md")
用于完整的阶段详情、Agent生成定义、Agent团队替代方案以及团队拆解流程。
阶段活动输出
1. 上下文收集Git差异、提交历史变更摘要
2. 并行Agent调度6个Agent评估0-10分评分
2.5 视觉捕获路由截图、AI视觉评估图库 + 视觉评分
3. 测试执行后端 + 前端测试覆盖率数据
4. 精细化评分综合评分计算等级(A-F)
5. 改进建议工作量与影响分析优先级列表
6. 方案对比对比不同实现方式(可选)推荐方案
7. 指标跟踪趋势分析历史数据
8. 报告编译证据工件 + gallery.html最终报告
8.5 Agent化视觉反馈用户标注、ui-feedback修复前后对比差异

Phase 2 Agents (Quick Reference)

阶段2 Agent快速参考

AgentFocusOutput
code-quality-reviewerLint, types, patternsQuality 0-10
security-auditorOWASP, secrets, CVEsSecurity 0-10
test-generatorCoverage, test qualityCoverage 0-10
backend-system-architectAPI design, asyncAPI 0-10
frontend-ui-developerReact 19, Zod, a11yUI 0-10
python-performance-engineerLatency, resources, scalingPerformance 0-10
Launch ALL agents in ONE message with
run_in_background=True
and
max_turns=25
.
Agent关注点输出
code-quality-reviewer代码规范、类型、模式质量评分0-10
security-auditorOWASP、敏感信息、CVE安全评分0-10
test-generator覆盖率、测试质量覆盖率评分0-10
backend-system-architectAPI设计、异步处理API评分0-10
frontend-ui-developerReact 19、Zod、无障碍UI评分0-10
python-performance-engineer延迟、资源、扩展性性能评分0-10
使用
run_in_background=True
max_turns=25
在一条消息中启动所有Agent。

Progressive Output (CC 2.1.76)

渐进式输出(CC 2.1.76)

Output each agent's score as soon as it completes — don't wait for all 6-7 agents:
Security:     8.2/10 — No critical vulnerabilities found
Code Quality: 7.5/10 — 3 complexity hotspots identified
[...remaining agents still running...]
This gives users real-time visibility into multi-agent verification. If any dimension scores below the
security_minimum
threshold (default 5.0), flag it as a blocker immediately — the user can terminate early without waiting for remaining agents.
每个Agent完成后立即输出其评分——无需等待所有6-7个Agent完成:
安全:     8.2/10 — 未发现严重漏洞
代码质量: 7.5/10 — 识别出3个复杂度热点
[...剩余Agent仍在运行...]
这让用户可以实时了解多Agent验证的进度。如果任何维度的评分低于
security_minimum
阈值(默认5.0),会立即标记为阻塞项——用户无需等待剩余Agent完成即可提前终止验证。

Phase 2.5: Visual Capture (NEW — runs in parallel with Phase 2)

阶段2.5:视觉捕获(新增 — 与阶段2并行运行)

Load details:
Read("${CLAUDE_SKILL_DIR}/references/visual-capture.md")
for auto-detection, route discovery, screenshot capture, and AI vision evaluation.
Summary: Auto-detects project framework, starts dev server, discovers routes, uses agent-browser to screenshot each route, evaluates with Claude vision, generates self-contained
gallery.html
with base64-embedded images.
Output:
verification-output/{timestamp}/gallery.html
— open in browser to see all screenshots with AI evaluations, scores, and annotation diffs.
Graceful degradation: If no frontend detected or server won't start, skips visual capture with a warning — never blocks verification.
加载详细信息:
Read("${CLAUDE_SKILL_DIR}/references/visual-capture.md")
用于自动检测、路由发现、截图捕获以及AI视觉评估。
摘要:自动检测项目框架,启动开发服务器,发现路由,使用agent-browser为每个路由截图,通过Claude视觉进行评估,生成包含base64嵌入图片的独立
gallery.html
文件。
输出
verification-output/{timestamp}/gallery.html
— 在浏览器中打开即可查看所有截图及AI评估、评分和标注差异。
优雅降级:如果未检测到前端或服务器无法启动,会跳过视觉捕获并发出警告——绝不会阻塞验证流程。

Phase 8.5: Agentation Visual Feedback (opt-in)

阶段8.5:Agent化视觉反馈(可选)

Load details:
Read("${CLAUDE_SKILL_DIR}/references/visual-capture.md")
(Phase 8.5 section) for agentation loop workflow.
Trigger: Only when agentation MCP is configured. Offers user the choice to annotate the live UI.
ui-feedback
agent processes annotations, re-screenshots show before/after.

加载详细信息:
Read("${CLAUDE_SKILL_DIR}/references/visual-capture.md")
(阶段8.5部分)用于Agent化循环工作流。
触发条件:仅当配置了Agent化MCP时生效。为用户提供标注实时UI的选项。
ui-feedback
Agent处理标注内容,重新截图展示前后对比。

Grading & Scoring

评分机制

Load
Read("${CLAUDE_PLUGIN_ROOT}/skills/quality-gates/references/unified-scoring-framework.md")
for dimensions, weights, grade thresholds, and improvement prioritization. Load
Read("${CLAUDE_SKILL_DIR}/references/quality-model.md")
for verify-specific extensions (Visual dimension). Load
Read("${CLAUDE_SKILL_DIR}/references/grading-rubric.md")
for per-agent scoring criteria.

加载
Read("${CLAUDE_PLUGIN_ROOT}/skills/quality-gates/references/unified-scoring-framework.md")
用于维度、权重、等级阈值以及改进优先级设置。加载
Read("${CLAUDE_SKILL_DIR}/references/quality-model.md")
用于验证专用扩展(视觉维度)。加载
Read("${CLAUDE_SKILL_DIR}/references/grading-rubric.md")
用于每个Agent的评分标准。

Evidence & Test Execution

证据与测试执行

Load details:
Read("${CLAUDE_SKILL_DIR}/rules/evidence-collection.md")
for git commands, test execution patterns, metrics tracking, and post-verification feedback.

加载详细信息:
Read("${CLAUDE_SKILL_DIR}/rules/evidence-collection.md")
用于Git命令、测试执行模式、指标跟踪以及验证后反馈。

Policy-as-Code

策略即代码

Load details:
Read("${CLAUDE_SKILL_DIR}/references/policy-as-code.md")
for configuration.
Define verification rules in
.claude/policies/verification-policy.json
:
json
{
  "thresholds": {
    "composite_minimum": 6.0,
    "security_minimum": 7.0,
    "coverage_minimum": 70
  },
  "blocking_rules": [
    {"dimension": "security", "below": 5.0, "action": "block"}
  ]
}

加载详细信息:
Read("${CLAUDE_SKILL_DIR}/references/policy-as-code.md")
用于配置说明。
.claude/policies/verification-policy.json
中定义验证规则:
json
{
  "thresholds": {
    "composite_minimum": 6.0,
    "security_minimum": 7.0,
    "coverage_minimum": 70
  },
  "blocking_rules": [
    {"dimension": "security", "below": 5.0, "action": "block"}
  ]
}

Report Format

报告格式

Load details:
Read("${CLAUDE_SKILL_DIR}/references/report-template.md")
for full format. Summary:
markdown
undefined
加载详细信息:
Read("${CLAUDE_SKILL_DIR}/references/report-template.md")
用于完整格式说明。摘要:
markdown
undefined

Feature Verification Report

功能验证报告

Composite Score: [N.N]/10 (Grade: [LETTER])
综合评分:[N.N]/10(等级:[字母])

Verdict

结论

[READY FOR MERGE | IMPROVEMENTS RECOMMENDED | BLOCKED]

---
[可合并 | 建议改进 | 阻塞]

---

References

参考资料

Load on demand with
Read("${CLAUDE_SKILL_DIR}/references/<file>")
:
FileContent
verification-phases.md
8-phase workflow, agent spawn definitions, Agent Teams mode
visual-capture.md
Phase 2.5 + 8.5: screenshot capture, AI vision, gallery generation, agentation loop
quality-model.md
Scoring dimensions and weights (8 unified)
grading-rubric.md
Per-agent scoring criteria
report-template.md
Full report format with visual evidence section
alternative-comparison.md
Approach comparison template
orchestration-mode.md
Agent Teams vs Task Tool
policy-as-code.md
Verification policy configuration
verification-checklist.md
Pre-flight checklist
按需加载
Read("${CLAUDE_SKILL_DIR}/references/<file>")
文件内容
verification-phases.md
8阶段工作流、Agent生成定义、Agent团队模式
visual-capture.md
阶段2.5 + 8.5:截图捕获、AI视觉、图库生成、Agent化循环
quality-model.md
评分维度与权重(8个统一维度)
grading-rubric.md
每个Agent的评分标准
report-template.md
包含视觉证据部分的完整报告格式
alternative-comparison.md
方案对比模板
orchestration-mode.md
Agent团队 vs 任务工具
policy-as-code.md
验证策略配置
verification-checklist.md
预检查清单

Rules

规则

Load on demand with
Read("${CLAUDE_SKILL_DIR}/rules/<file>")
:
FileContent
scoring-rubric.md
Composite scoring, grades, verdicts
evidence-collection.md
Evidence gathering and test patterns

按需加载
Read("${CLAUDE_SKILL_DIR}/rules/<file>")
文件内容
scoring-rubric.md
综合评分、等级、结论
evidence-collection.md
证据收集与测试模式

Related Skills

相关技能

  • ork:implement
    - Full implementation with verification
  • ork:review-pr
    - PR-specific verification
  • testing-unit
    /
    testing-integration
    /
    testing-e2e
    - Test execution patterns
  • ork:quality-gates
    - Quality gate patterns
  • browser-tools
    - Browser automation for visual capture

Version: 4.2.0 (March 2026) — Added progressive output for incremental agent scores
  • ork:implement
    - 带验证的完整实现
  • ork:review-pr
    - 针对PR的验证
  • testing-unit
    /
    testing-integration
    /
    testing-e2e
    - 测试执行模式
  • ork:quality-gates
    - 质量门禁模式
  • browser-tools
    - 用于视觉捕获的浏览器自动化

版本: 4.2.0(2026年3月)—— 新增了增量Agent评分的渐进式输出