verify
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseVerify Feature
功能验证
Comprehensive verification using parallel specialized agents with nuanced grading (0-10 scale) and improvement suggestions.
借助并行的专用Agent实现全面验证,提供精细化评分(0-10分制)和改进建议。
Quick Start
快速开始
bash
/ork:verify authentication flow
/ork:verify --model=opus user profile feature
/ork:verify --scope=backend database migrationsbash
/ork:verify authentication flow
/ork:verify --model=opus user profile feature
/ork:verify --scope=backend database migrationsArgument Resolution
参数解析
python
SCOPE = "$ARGUMENTS" # Full argument string, e.g., "authentication flow"
SCOPE_TOKEN = "$ARGUMENTS[0]" # First token for flag detection (e.g., "--scope=backend")python
SCOPE = "$ARGUMENTS" # 完整参数字符串,例如:"authentication flow"
SCOPE_TOKEN = "$ARGUMENTS[0]" # 用于检测标志的第一个令牌(例如:"--scope=backend")$ARGUMENTS[0], $ARGUMENTS[1] etc. for indexed access (CC 2.1.59)
$ARGUMENTS[0], $ARGUMENTS[1] 等用于索引访问(CC 2.1.59)
Model override detection (CC 2.1.72)
模型覆盖检测(CC 2.1.72)
MODEL_OVERRIDE = None
for token in "$ARGUMENTS".split():
if token.startswith("--model="):
MODEL_OVERRIDE = token.split("=", 1)[1] # "opus", "sonnet", "haiku"
SCOPE = SCOPE.replace(token, "").strip()
Pass `MODEL_OVERRIDE` to all Agent() calls via `model=MODEL_OVERRIDE` when set. Accepts symbolic names (`opus`, `sonnet`, `haiku`) or full IDs (`claude-opus-4-6`) per CC 2.1.74.
> **Opus 4.6**: Agents use native adaptive thinking (no MCP sequential-thinking needed). Extended 128K output supports comprehensive verification reports.
---MODEL_OVERRIDE = None
for token in "$ARGUMENTS".split():
if token.startswith("--model="):
MODEL_OVERRIDE = token.split("=", 1)[1] # "opus", "sonnet", "haiku"
SCOPE = SCOPE.replace(token, "").strip()
当设置`MODEL_OVERRIDE`时,通过`model=MODEL_OVERRIDE`将其传递给所有Agent()调用。支持符号名称(`opus`、`sonnet`、`haiku`)或完整ID(`claude-opus-4-6`),符合CC 2.1.74规范。
> **Opus 4.6**:Agent采用原生自适应思维(无需MCP顺序思维)。扩展的128K输出支持生成全面的验证报告。
---STEP 0: Effort-Aware Verification Scaling (CC 2.1.76)
步骤0:基于工作量的验证规模调整(CC 2.1.76)
Scale verification depth based on level:
/effort| Effort Level | Phases Run | Agents | Output |
|---|---|---|---|
| low | Run tests only → pass/fail | 0 agents | Quick check |
| medium | Tests + code quality + security | 3 agents | Score + top issues |
| high (default) | All 8 phases + visual capture | 6-7 agents | Full report + grades |
Override: Explicit user selection (e.g., "Full verification") overridesdownscaling./effort
根据级别调整验证深度:
/effort| 工作量级别 | 运行阶段 | Agent数量 | 输出结果 |
|---|---|---|---|
| 低 | 仅运行测试 → 合格/不合格 | 0个Agent | 快速检查结果 |
| 中 | 测试 + 代码质量 + 安全 | 3个Agent | 评分 + 主要问题 |
| 高(默认) | 全部8个阶段 + 视觉捕获 | 6-7个Agent | 完整报告 + 评分 |
覆盖设置:用户明确选择(例如“全面验证”)会覆盖的降级设置。/effort
STEP 0a: Verify User Intent with AskUserQuestion
步骤0a:通过AskUserQuestion验证用户意图
BEFORE creating tasks, clarify verification scope:
python
AskUserQuestion(
questions=[{
"question": "What scope for this verification?",
"header": "Scope",
"options": [
{"label": "Full verification (Recommended)", "description": "All tests + security + code quality + visual + grades", "markdown": "```\nFull Verification (10 phases)\n─────────────────────────────\n 7 parallel agents:\n ┌────────────┐ ┌────────────┐\n │ Code │ │ Security │\n │ Quality │ │ Auditor │\n ├────────────┤ ├────────────┤\n │ Test │ │ Backend │\n │ Generator │ │ Architect │\n ├────────────┤ ├────────────┤\n │ Frontend │ │ Performance│\n │ Developer │ │ Engineer │\n ├────────────┤ └────────────┘\n │ Visual │\n │ Capture │ → gallery.html\n └────────────┘\n ▼\n Composite Score (0-10)\n 8 dimensions + Grade\n + Visual Gallery\n```"},
{"label": "Tests only", "description": "Run unit + integration + e2e tests", "markdown": "```\nTests Only\n──────────\n npm test ──▶ Results\n ┌─────────────────────┐\n │ Unit tests ✓/✗ │\n │ Integration ✓/✗ │\n │ E2E ✓/✗ │\n │ Coverage NN% │\n └─────────────────────┘\n Skip: security, quality, UI\n Output: Pass/fail + coverage\n```"},
{"label": "Security audit", "description": "Focus on security vulnerabilities", "markdown": "```\nSecurity Audit\n──────────────\n security-auditor agent:\n ┌─────────────────────────┐\n │ OWASP Top 10 ✓/✗ │\n │ Dependency CVEs ✓/✗ │\n │ Secrets scan ✓/✗ │\n │ Auth flow review ✓/✗ │\n │ Input validation ✓/✗ │\n └─────────────────────────┘\n Output: Security score 0-10\n + vulnerability list\n```"},
{"label": "Code quality", "description": "Lint, types, complexity analysis", "markdown": "```\nCode Quality\n────────────\n code-quality-reviewer agent:\n ┌─────────────────────────┐\n │ Lint errors N │\n │ Type coverage NN% │\n │ Cyclomatic complex N.N │\n │ Dead code N │\n │ Pattern violations N │\n └─────────────────────────┘\n Output: Quality score 0-10\n + refactor suggestions\n```"},
{"label": "Quick check", "description": "Just run tests, skip detailed analysis", "markdown": "```\nQuick Check (~1 min)\n────────────────────\n Run tests ──▶ Pass/Fail\n\n Output:\n ├── Test results\n ├── Build status\n └── Lint status\n No agents, no grading,\n no report generation\n```"}
],
"multiSelect": true
}]
)Based on answer, adjust workflow:
- Full verification: All 10 phases (8 + 2.5 + 8.5), 7 parallel agents including visual capture
- Tests only: Skip phases 2 (security), 5 (UI/UX analysis)
- Security audit: Focus on security-auditor agent
- Code quality: Focus on code-quality-reviewer agent
- Quick check: Run tests only, skip grading and suggestions
在创建任务之前,明确验证范围:
python
AskUserQuestion(
questions=[{
"question": "本次验证的范围是什么?",
"header": "范围",
"options": [
{"label": "全面验证(推荐)", "description": "所有测试 + 安全 + 代码质量 + 视觉 + 评分", "markdown": "```\n全面验证(10个阶段)\n─────────────────────────────\n 7个并行Agent:\n ┌────────────┐ ┌────────────┐\n │ 代码质量 │ │ 安全审计 │\n │ 检查Agent │ │ Agent │\n ├────────────┤ ├────────────┤\n │ 测试生成 │ │ 后端架构 │\n │ Agent │ │ Agent │\n ├────────────┤ ├────────────┤\n │ 前端开发 │ │ 性能工程 │\n │ Agent │ │ Agent │\n ├────────────┤ └────────────┘\n │ 视觉捕获 │\n │ Agent │ → gallery.html\n └────────────┘\n ▼\n 综合评分(0-10)\n 8个维度 + 等级\n + 视觉图库\n```"},
{"label": "仅测试", "description": "运行单元测试 + 集成测试 + E2E测试", "markdown": "```\n仅测试\n──────────\n npm test ──▶ 结果\n ┌─────────────────────┐\n │ 单元测试 ✓/✗ │\n │ 集成测试 ✓/✗ │\n │ E2E测试 ✓/✗ │\n │ 覆盖率 NN% │\n └─────────────────────┘\n 跳过:安全、质量、UI\n 输出:合格/不合格 + 覆盖率\n```"},
{"label": "安全审计", "description": "重点关注安全漏洞", "markdown": "```\n安全审计\n──────────────\n 安全审计Agent:\n ┌─────────────────────────┐\n │ OWASP Top 10 ✓/✗ │\n │ 依赖项CVE检查 ✓/✗ │\n │ 敏感信息扫描 ✓/✗ │\n │ 认证流程审查 ✓/✗ │\n │ 输入验证检查 ✓/✗ │\n └─────────────────────────┘\n 输出:安全评分 0-10\n + 漏洞列表\n```"},
{"label": "代码质量", "description": "代码规范、类型检查、复杂度分析", "markdown": "```\n代码质量\n────────────\n 代码质量审查Agent:\n ┌─────────────────────────┐\n │ 代码规范错误数 N │\n │ 类型覆盖率 NN% │\n │ 圈复杂度 N.N │\n │ 死代码数量 N │\n │ 模式违规数 N │\n └─────────────────────────┘\n 输出:质量评分 0-10\n + 重构建议\n```"},
{"label": "快速检查", "description": "仅运行测试,跳过详细分析", "markdown": "```\n快速检查(约1分钟)\n────────────────────\n 运行测试 ──▶ 合格/不合格\n\n 输出:\n ├── 测试结果\n ├── 构建状态\n └── 代码规范检查状态\n 无Agent参与,无评分,\n 不生成报告\n```"}
],
"multiSelect": true
}]
)根据回答调整工作流:
- 全面验证:所有10个阶段(8 + 2.5 + 8.5),7个并行Agent,包括视觉捕获
- 仅测试:跳过阶段2(安全)、阶段5(UI/UX分析)
- 安全审计:重点使用安全审计Agent
- 代码质量:重点使用代码质量审查Agent
- 快速检查:仅运行测试,跳过评分和建议
STEP 0b: Select Orchestration Mode
步骤0b:选择编排模式
Load details: for env var check logic, Agent Teams vs Task Tool comparison, and mode selection rules.
Read("${CLAUDE_SKILL_DIR}/references/orchestration-mode.md")Choose Agent Teams (mesh -- verifiers share findings) or Task tool (star -- all report to lead) based on the orchestration mode reference.
加载详细信息: 用于环境变量检查逻辑、Agent团队与任务工具的对比,以及模式选择规则。
Read("${CLAUDE_SKILL_DIR}/references/orchestration-mode.md")根据编排模式参考,选择Agent团队(网状结构——验证者共享发现结果)或任务工具(星型结构——所有结果上报给主导者)。
MCP Probe + Resume
MCP探测与恢复
python
ToolSearch(query="select:mcp__memory__search_nodes")
Write(".claude/chain/capabilities.json", { memory, timestamp })
Read(".claude/chain/state.json") # resume if existspython
ToolSearch(query="select:mcp__memory__search_nodes")
Write(".claude/chain/capabilities.json", { memory, timestamp })
Read(".claude/chain/state.json") # 如果存在则恢复Handoff File
交接文件
After verification completes, write results:
python
Write(".claude/chain/verify-results.json", JSON.stringify({
"phase": "verify", "skill": "verify",
"timestamp": now(), "status": "completed",
"outputs": {
"tests_passed": N, "tests_failed": N,
"coverage": "87%", "security_scan": "clean"
}
}))验证完成后,写入结果:
python
Write(".claude/chain/verify-results.json", JSON.stringify({
"phase": "verify", "skill": "verify",
"timestamp": now(), "status": "completed",
"outputs": {
"tests_passed": N, "tests_failed": N,
"coverage": "87%", "security_scan": "clean"
}
}))Regression Monitor (CC 2.1.71)
回归监控(CC 2.1.71)
Optionally schedule post-verification monitoring:
python
undefined可选:安排验证后的监控任务:
python
undefinedGuard: Skip cron in headless/CI (CLAUDE_CODE_DISABLE_CRON)
防护:在无头/CI环境中跳过定时任务(CLAUDE_CODE_DISABLE_CRON)
if env CLAUDE_CODE_DISABLE_CRON is set, run a single check instead
如果设置了环境变量CLAUDE_CODE_DISABLE_CRON,则仅运行一次检查
CronCreate(
schedule="0 8 * * *",
prompt="Daily regression check: npm test.
If 7 consecutive passes → CronDelete.
If failures → alert with details."
)
---CronCreate(
schedule="0 8 * * *",
prompt="每日回归检查:npm test.\n 如果连续7次通过 → 删除定时任务.\n 如果失败 → 发送包含详细信息的警报."
)
---Task Management (CC 2.1.16)
任务管理(CC 2.1.16)
python
undefinedpython
undefinedCreate main verification task
创建主验证任务
TaskCreate(
subject="Verify [feature-name] implementation",
description="Comprehensive verification with nuanced grading",
activeForm="Verifying [feature-name] implementation"
)
TaskCreate(
subject="验证[功能名称]实现",
description="借助精细化评分实现全面验证",
activeForm="正在验证[功能名称]实现"
)
Create subtasks for 8-phase process
为8阶段流程创建子任务
phases = ["Run code quality checks", "Execute security audit",
"Verify test coverage", "Validate API", "Check UI/UX",
"Calculate grades", "Generate suggestions", "Compile report"]
for phase in phases:
TaskCreate(subject=phase, activeForm=f"{phase}ing")
---phases = ["运行代码质量检查", "执行安全审计",
"验证测试覆盖率", "验证API", "检查UI/UX",
"计算评分", "生成改进建议", "编译报告"]
for phase in phases:
TaskCreate(subject=phase, activeForm=f"正在{phase}")
---8-Phase Workflow
8阶段工作流
Load details: for complete phase details, agent spawn definitions, Agent Teams alternative, and team teardown.
Read("${CLAUDE_SKILL_DIR}/references/verification-phases.md")| Phase | Activities | Output |
|---|---|---|
| 1. Context Gathering | Git diff, commit history | Changes summary |
| 2. Parallel Agent Dispatch | 6 agents evaluate | 0-10 scores |
| 2.5 Visual Capture | Screenshot routes, AI vision eval | Gallery + visual score |
| 3. Test Execution | Backend + frontend tests | Coverage data |
| 4. Nuanced Grading | Composite score calculation | Grade (A-F) |
| 5. Improvement Suggestions | Effort vs impact analysis | Prioritized list |
| 6. Alternative Comparison | Compare approaches (optional) | Recommendation |
| 7. Metrics Tracking | Trend analysis | Historical data |
| 8. Report Compilation | Evidence artifacts + gallery.html | Final report |
| 8.5 Agentation Loop | User annotates, ui-feedback fixes | Before/after diffs |
加载详细信息: 用于完整的阶段详情、Agent生成定义、Agent团队替代方案以及团队拆解流程。
Read("${CLAUDE_SKILL_DIR}/references/verification-phases.md")| 阶段 | 活动 | 输出 |
|---|---|---|
| 1. 上下文收集 | Git差异、提交历史 | 变更摘要 |
| 2. 并行Agent调度 | 6个Agent评估 | 0-10分评分 |
| 2.5 视觉捕获 | 路由截图、AI视觉评估 | 图库 + 视觉评分 |
| 3. 测试执行 | 后端 + 前端测试 | 覆盖率数据 |
| 4. 精细化评分 | 综合评分计算 | 等级(A-F) |
| 5. 改进建议 | 工作量与影响分析 | 优先级列表 |
| 6. 方案对比 | 对比不同实现方式(可选) | 推荐方案 |
| 7. 指标跟踪 | 趋势分析 | 历史数据 |
| 8. 报告编译 | 证据工件 + gallery.html | 最终报告 |
| 8.5 Agent化视觉反馈 | 用户标注、ui-feedback修复 | 前后对比差异 |
Phase 2 Agents (Quick Reference)
阶段2 Agent快速参考
| Agent | Focus | Output |
|---|---|---|
| code-quality-reviewer | Lint, types, patterns | Quality 0-10 |
| security-auditor | OWASP, secrets, CVEs | Security 0-10 |
| test-generator | Coverage, test quality | Coverage 0-10 |
| backend-system-architect | API design, async | API 0-10 |
| frontend-ui-developer | React 19, Zod, a11y | UI 0-10 |
| python-performance-engineer | Latency, resources, scaling | Performance 0-10 |
Launch ALL agents in ONE message with and .
run_in_background=Truemax_turns=25| Agent | 关注点 | 输出 |
|---|---|---|
| code-quality-reviewer | 代码规范、类型、模式 | 质量评分0-10 |
| security-auditor | OWASP、敏感信息、CVE | 安全评分0-10 |
| test-generator | 覆盖率、测试质量 | 覆盖率评分0-10 |
| backend-system-architect | API设计、异步处理 | API评分0-10 |
| frontend-ui-developer | React 19、Zod、无障碍 | UI评分0-10 |
| python-performance-engineer | 延迟、资源、扩展性 | 性能评分0-10 |
使用和在一条消息中启动所有Agent。
run_in_background=Truemax_turns=25Progressive Output (CC 2.1.76)
渐进式输出(CC 2.1.76)
Output each agent's score as soon as it completes — don't wait for all 6-7 agents:
Security: 8.2/10 — No critical vulnerabilities found
Code Quality: 7.5/10 — 3 complexity hotspots identified
[...remaining agents still running...]This gives users real-time visibility into multi-agent verification. If any dimension scores below the threshold (default 5.0), flag it as a blocker immediately — the user can terminate early without waiting for remaining agents.
security_minimum每个Agent完成后立即输出其评分——无需等待所有6-7个Agent完成:
安全: 8.2/10 — 未发现严重漏洞
代码质量: 7.5/10 — 识别出3个复杂度热点
[...剩余Agent仍在运行...]这让用户可以实时了解多Agent验证的进度。如果任何维度的评分低于阈值(默认5.0),会立即标记为阻塞项——用户无需等待剩余Agent完成即可提前终止验证。
security_minimumPhase 2.5: Visual Capture (NEW — runs in parallel with Phase 2)
阶段2.5:视觉捕获(新增 — 与阶段2并行运行)
Load details: for auto-detection, route discovery, screenshot capture, and AI vision evaluation.
Read("${CLAUDE_SKILL_DIR}/references/visual-capture.md")Summary: Auto-detects project framework, starts dev server, discovers routes, uses agent-browser to screenshot each route, evaluates with Claude vision, generates self-contained with base64-embedded images.
gallery.htmlOutput: — open in browser to see all screenshots with AI evaluations, scores, and annotation diffs.
verification-output/{timestamp}/gallery.htmlGraceful degradation: If no frontend detected or server won't start, skips visual capture with a warning — never blocks verification.
加载详细信息: 用于自动检测、路由发现、截图捕获以及AI视觉评估。
Read("${CLAUDE_SKILL_DIR}/references/visual-capture.md")摘要:自动检测项目框架,启动开发服务器,发现路由,使用agent-browser为每个路由截图,通过Claude视觉进行评估,生成包含base64嵌入图片的独立文件。
gallery.html输出: — 在浏览器中打开即可查看所有截图及AI评估、评分和标注差异。
verification-output/{timestamp}/gallery.html优雅降级:如果未检测到前端或服务器无法启动,会跳过视觉捕获并发出警告——绝不会阻塞验证流程。
Phase 8.5: Agentation Visual Feedback (opt-in)
阶段8.5:Agent化视觉反馈(可选)
Load details: (Phase 8.5 section) for agentation loop workflow.
Read("${CLAUDE_SKILL_DIR}/references/visual-capture.md")Trigger: Only when agentation MCP is configured. Offers user the choice to annotate the live UI. agent processes annotations, re-screenshots show before/after.
ui-feedback加载详细信息:(阶段8.5部分)用于Agent化循环工作流。
Read("${CLAUDE_SKILL_DIR}/references/visual-capture.md")触发条件:仅当配置了Agent化MCP时生效。为用户提供标注实时UI的选项。Agent处理标注内容,重新截图展示前后对比。
ui-feedbackGrading & Scoring
评分机制
Load for dimensions, weights, grade thresholds, and improvement prioritization. Load for verify-specific extensions (Visual dimension). Load for per-agent scoring criteria.
Read("${CLAUDE_PLUGIN_ROOT}/skills/quality-gates/references/unified-scoring-framework.md")Read("${CLAUDE_SKILL_DIR}/references/quality-model.md")Read("${CLAUDE_SKILL_DIR}/references/grading-rubric.md")加载 用于维度、权重、等级阈值以及改进优先级设置。加载 用于验证专用扩展(视觉维度)。加载 用于每个Agent的评分标准。
Read("${CLAUDE_PLUGIN_ROOT}/skills/quality-gates/references/unified-scoring-framework.md")Read("${CLAUDE_SKILL_DIR}/references/quality-model.md")Read("${CLAUDE_SKILL_DIR}/references/grading-rubric.md")Evidence & Test Execution
证据与测试执行
Load details: for git commands, test execution patterns, metrics tracking, and post-verification feedback.
Read("${CLAUDE_SKILL_DIR}/rules/evidence-collection.md")加载详细信息: 用于Git命令、测试执行模式、指标跟踪以及验证后反馈。
Read("${CLAUDE_SKILL_DIR}/rules/evidence-collection.md")Policy-as-Code
策略即代码
Load details: for configuration.
Read("${CLAUDE_SKILL_DIR}/references/policy-as-code.md")Define verification rules in :
.claude/policies/verification-policy.jsonjson
{
"thresholds": {
"composite_minimum": 6.0,
"security_minimum": 7.0,
"coverage_minimum": 70
},
"blocking_rules": [
{"dimension": "security", "below": 5.0, "action": "block"}
]
}加载详细信息: 用于配置说明。
Read("${CLAUDE_SKILL_DIR}/references/policy-as-code.md")在中定义验证规则:
.claude/policies/verification-policy.jsonjson
{
"thresholds": {
"composite_minimum": 6.0,
"security_minimum": 7.0,
"coverage_minimum": 70
},
"blocking_rules": [
{"dimension": "security", "below": 5.0, "action": "block"}
]
}Report Format
报告格式
Load details: for full format. Summary:
Read("${CLAUDE_SKILL_DIR}/references/report-template.md")markdown
undefined加载详细信息: 用于完整格式说明。摘要:
Read("${CLAUDE_SKILL_DIR}/references/report-template.md")markdown
undefinedFeature Verification Report
功能验证报告
Composite Score: [N.N]/10 (Grade: [LETTER])
综合评分:[N.N]/10(等级:[字母])
Verdict
结论
[READY FOR MERGE | IMPROVEMENTS RECOMMENDED | BLOCKED]
---[可合并 | 建议改进 | 阻塞]
---References
参考资料
Load on demand with :
Read("${CLAUDE_SKILL_DIR}/references/<file>")| File | Content |
|---|---|
| 8-phase workflow, agent spawn definitions, Agent Teams mode |
| Phase 2.5 + 8.5: screenshot capture, AI vision, gallery generation, agentation loop |
| Scoring dimensions and weights (8 unified) |
| Per-agent scoring criteria |
| Full report format with visual evidence section |
| Approach comparison template |
| Agent Teams vs Task Tool |
| Verification policy configuration |
| Pre-flight checklist |
按需加载:
Read("${CLAUDE_SKILL_DIR}/references/<file>")| 文件 | 内容 |
|---|---|
| 8阶段工作流、Agent生成定义、Agent团队模式 |
| 阶段2.5 + 8.5:截图捕获、AI视觉、图库生成、Agent化循环 |
| 评分维度与权重(8个统一维度) |
| 每个Agent的评分标准 |
| 包含视觉证据部分的完整报告格式 |
| 方案对比模板 |
| Agent团队 vs 任务工具 |
| 验证策略配置 |
| 预检查清单 |
Rules
规则
Load on demand with :
Read("${CLAUDE_SKILL_DIR}/rules/<file>")| File | Content |
|---|---|
| Composite scoring, grades, verdicts |
| Evidence gathering and test patterns |
按需加载:
Read("${CLAUDE_SKILL_DIR}/rules/<file>")| 文件 | 内容 |
|---|---|
| 综合评分、等级、结论 |
| 证据收集与测试模式 |
Related Skills
相关技能
- - Full implementation with verification
ork:implement - - PR-specific verification
ork:review-pr - /
testing-unit/testing-integration- Test execution patternstesting-e2e - - Quality gate patterns
ork:quality-gates - - Browser automation for visual capture
browser-tools
Version: 4.2.0 (March 2026) — Added progressive output for incremental agent scores
- - 带验证的完整实现
ork:implement - - 针对PR的验证
ork:review-pr - /
testing-unit/testing-integration- 测试执行模式testing-e2e - - 质量门禁模式
ork:quality-gates - - 用于视觉捕获的浏览器自动化
browser-tools
版本: 4.2.0(2026年3月)—— 新增了增量Agent评分的渐进式输出