council
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
Chinese/council — Multi-Model Consensus Council
/council — 多模型共识委员会
Spawn parallel judges with different perspectives, consolidate into consensus. Works for any task — validation, research, brainstorming.
生成具有不同视角的并行评审Agent,最终整合为共识结论。适用于任何任务——验证、研究、头脑风暴。
Quick Start
快速开始
bash
/council --quick validate recent # fast inline check
/council validate this plan # validation (2 agents)
/council brainstorm caching approaches # brainstorm
/council validate the implementation # validation (critique triggers map here)
/council research kubernetes upgrade strategies # research
/council research the CI/CD pipeline bottlenecks # research (analyze triggers map here)
/council --preset=security-audit validate the auth system # preset personas
/council --deep --explorers=3 research upgrade automation # deep + explorers
/council --debate validate the auth system # adversarial 2-round review
/council --deep --debate validate the migration plan # thorough + debate
/council # infers from contextCouncil works independently — no RPI workflow, no ratchet chain, no CLI required. Zero setup beyond plugin install.
aobash
/council --quick validate recent # 快速内联检查
/council validate this plan # 验证(2个Agent)
/council brainstorm caching approaches # 头脑风暴
/council validate the implementation # 验证(critique触发词映射至此)
/council research kubernetes upgrade strategies # 研究
/council research the CI/CD pipeline bottlenecks # 研究(analyze触发词映射至此)
/council --preset=security-audit validate the auth system # 使用预设角色
/council --deep --explorers=3 research upgrade automation # 深度模式 + 探索者Agent
/council --debate validate the auth system # 对抗式两轮评审
/council --deep --debate validate the migration plan # 全面评审 + 辩论模式
/council # 从上下文推断任务Council可独立运行——无需RPI工作流、无需棘轮链、无需 CLI。仅需安装插件,零额外配置。
aoModes
模式
| Mode | Agents | Execution Backend | Use Case |
|---|---|---|---|
| 0 (inline) | Self | Fast single-agent check, no spawning |
| default | 2 | Runtime-native (Codex sub-agents preferred; Claude teams fallback) | Independent judges (no perspective labels) |
| 3 | Runtime-native | Thorough review |
| 3+3 | Runtime-native + Codex CLI | Cross-vendor consensus |
| 2+ | Runtime-native | Adversarial refinement (2 rounds) |
bash
/council --quick validate recent # inline single-agent check, no spawning
/council recent # 2 runtime-native judges
/council --deep recent # 3 runtime-native judges
/council --mixed recent # runtime-native + Codex CLI| 模式 | Agent数量 | 执行后端 | 适用场景 |
|---|---|---|---|
| 0(内联) | 当前Agent自身 | 快速单Agent检查,不生成新Agent |
| 默认 | 2 | 原生运行时(优先使用Codex子Agent;回退至Claude团队) | 独立评审(无视角标签) |
| 3 | 原生运行时 | 全面评审 |
| 3+3 | 原生运行时 + Codex CLI | 跨供应商共识 |
| 2+ | 原生运行时 | 对抗式优化(两轮) |
bash
/council --quick validate recent # 内联单Agent检查,不生成新Agent
/council recent # 2个原生运行时评审Agent
/council --deep recent # 3个原生运行时评审Agent
/council --mixed recent # 原生运行时 + Codex CLISpawn Backend Selection (MANDATORY)
生成后端选择(必填)
Council must auto-select backend using capability detection:
- If is available, use Codex experimental sub-agents
spawn_agent - Else if is available, use Claude native teams
TeamCreate - Else use Task(run_in_background=true) fallback
This keeps universal across Claude and Codex sessions.
/councilCouncil必须通过能力检测自动选择后端:
- 若可用,使用Codex实验性子Agent
spawn_agent - 若可用,使用Claude原生团队
TeamCreate - 否则使用**Task(run_in_background=true)**作为回退方案
这确保可在Claude和Codex会话中通用。
/councilWhen to Use --debate
--debate何时使用--debate
模式
--debateUse for high-stakes or ambiguous reviews where judges are likely to disagree:
--debate- Security audits, architecture decisions, migration plans
- Reviews where multiple valid perspectives exist
- Cases where a missed finding has real consequences
Skip for routine validation where consensus is expected. Debate adds R2 latency (judges stay alive and process a second round via backend messaging).
--debateIncompatibilities:
- and
--quickcannot be combined.--debateruns inline with no spawning;--quickrequires multi-agent rounds. If both are passed, exit with error: "Error: --quick and --debate are incompatible."--debate - is only supported with validate mode. Brainstorm and research do not produce PASS/WARN/FAIL verdicts. If combined, exit with error: "Error: --debate is only supported with validate mode."
--debate
在高风险或模糊的评审场景中,当评审Agent可能存在分歧时,使用:
--debate- 安全审计、架构决策、迁移计划
- 存在多种合理视角的评审
- 遗漏结论会产生实际影响的场景
在预期能达成共识的常规验证场景中,跳过。辩论模式会增加R2延迟(评审Agent会保持活跃,并通过后端消息处理第二轮评审)。
--debate不兼容情况:
- 和
--quick无法组合使用。--debate以内联方式运行,不生成新Agent;--quick需要多Agent轮次。若同时传入,将返回错误:"Error: --quick and --debate are incompatible."--debate - 仅支持验证模式。头脑风暴和研究模式不会生成PASS/WARN/FAIL结论。若组合使用,将返回错误:"Error: --debate is only supported with validate mode."
--debate
Task Types
任务类型
| Type | Trigger Words | Perspective Focus |
|---|---|---|
| validate | validate, check, review, assess, critique, feedback, improve | Is this correct? What's wrong? What could be better? |
| brainstorm | brainstorm, explore, options, approaches | What are the alternatives? Pros/cons? |
| research | research, investigate, deep dive, explore deeply, analyze, examine, evaluate, compare | What can we discover? What are the properties, trade-offs, and structure? |
Natural language works — the skill infers task type from your prompt.
| 类型 | 触发词 | 视角重点 |
|---|---|---|
| 验证 | validate, check, review, assess, critique, feedback, improve | 内容是否正确?存在哪些问题?如何优化? |
| 头脑风暴 | brainstorm, explore, options, approaches | 有哪些替代方案?优缺点分别是什么? |
| 研究 | research, investigate, deep dive, explore deeply, analyze, examine, evaluate, compare | 我们能发现什么?其特性、权衡和结构是什么? |
支持自然语言触发——工具会从你的提示中推断任务类型。
Architecture
架构
Execution Flow
执行流程
┌─────────────────────────────────────────────────────────────────┐
│ Phase 1: Build Packet (JSON) │
│ - Task type (validate/brainstorm/research) │
│ - Target description │
│ - Context (files, diffs, prior decisions) │
│ - Perspectives to assign │
└─────────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────┐
│ Phase 1a: Select spawn backend │
│ codex_subagents | claude_teams | background_fallback │
│ Team lead = spawner (this agent) │
└─────────────────────────────────────────────────────────────────┘
│
┌─────────────────┴─────────────────┐
▼ ▼
┌───────────────────────┐ ┌───────────────────────┐
│ RUNTIME-NATIVE JUDGES│ │ CODEX AGENTS │
│ (spawn_agent or teams)│ │ (Bash tool, parallel)│
│ │ │ Agent 1 (independent │
│ Agent 1 (independent │ │ or with preset) │
│ or with preset) │ │ Agent 2 │
│ Agent 2 │ │ Agent 3 │
│ Agent 3 (--deep only)│ │ (--mixed only) │
│ (--deep/--mixed only)│ │ │
│ │ │ Output: JSON + MD │
│ Write files, then │ │ Files: .agents/ │
│ wait()/SendMessage to │ │ council/codex-* │
│ lead │ │ │
│ Files: .agents/ │ └───────────────────────┘
│ council/claude-* │ │
└───────────────────────┘ │
│ │
└─────────────────┬─────────────────┘
▼
┌─────────────────────────────────────────────────────────────────┐
│ Phase 2: Consolidation (Team Lead) │
│ - Receive completion from backend channel (wait/SendMessage) │
│ - Read all agent output files │
│ - If schema_version is missing from a judge's output, treat │
│ as version 0 (backward compatibility) │
│ - Compute consensus verdict │
│ - Identify shared findings │
│ - Surface disagreements with attribution │
│ - Generate Markdown report for human │
└─────────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────┐
│ Phase 3: Cleanup │
│ - Cleanup backend resources (close_agent / TeamDelete / none) │
└─────────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────┐
│ Output: Markdown Council Report │
│ - Consensus: PASS/WARN/FAIL │
│ - Shared findings │
│ - Disagreements (if any) │
│ - Recommendations │
└─────────────────────────────────────────────────────────────────┘┌─────────────────────────────────────────────────────────────────┐
│ 阶段1:构建数据包(JSON) │
│ - 任务类型(validate/brainstorm/research) │
│ - 目标描述 │
│ - 上下文(文件、差异、先前决策) │
│ - 要分配的视角 │
└─────────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────┐
│ 阶段1a:选择生成后端 │
│ codex_subagents | claude_teams | background_fallback │
│ 团队负责人 = 生成器(当前Agent) │
└─────────────────────────────────────────────────────────────────┘
│
┌─────────────────┴─────────────────┐
▼ ▼
┌───────────────────────┐ ┌───────────────────────┐
│ 原生运行时评审Agent │ │ CODEX AGENTS │
│ (spawn_agent或teams) │ │ (Bash工具,并行执行) │
│ │ │ Agent 1(独立或使用预设) │
│ Agent 1(独立或使用预设) │ │ │
│ Agent 2 │ │ Agent 2 │
│ Agent 3(仅--deep模式) │ │ Agent 3 │
│ (仅--deep/--mixed模式) │ │ (仅--mixed模式) │
│ │ │ │
│ 写入文件,然后通过wait()/SendMessage发送给负责人 │ │ 输出:JSON + MD │
│ 文件路径:.agents/ │ │ 文件路径:.agents/ │
│ council/claude-* │ │ council/codex-* │
└───────────────────────┘ │
│ │
└─────────────────┬─────────────────┘
▼
┌─────────────────────────────────────────────────────────────────┐
│ 阶段2:整合(团队负责人) │
│ - 从后端通道接收完成结果(wait/SendMessage) │
│ - 读取所有Agent的输出文件 │
│ - 若评审Agent的输出中缺少schema_version,视为版本0(向后兼容) │
│ - 计算共识结论 │
│ - 识别共同发现 │
│ - 标注分歧点及对应Agent │
│ - 生成供人类阅读的Markdown报告 │
└─────────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────┐
│ 阶段3:清理 │
│ - 清理后端资源(close_agent / TeamDelete / 无操作) │
└─────────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────┐
│ 输出:Markdown格式的委员会报告 │
│ - 共识结论:PASS/WARN/FAIL │
│ - 共同发现 │
│ - 分歧点(若存在) │
│ - 建议 │
└─────────────────────────────────────────────────────────────────┘Graceful Degradation
优雅降级
| Failure | Behavior |
|---|---|
| 1 of N agents times out | Proceed with N-1, note in report |
| All Codex CLI agents fail | Proceed with runtime-native judges only, note degradation |
| All agents fail | Return error, suggest retry |
| Codex CLI not installed | Skip Codex CLI judges, continue runtime-native mode (warn user) |
| Codex sub-agents unavailable | Fall back to Claude teams |
| Native teams unavailable | Fall back to |
| Output dir missing | Create |
Timeout: 120s per agent (configurable via in seconds).
--timeout=NMinimum quorum: At least 1 agent must respond for a valid council. If 0 agents respond, return error.
| 故障场景 | 行为 |
|---|---|
| N个Agent中有1个超时 | 继续使用N-1个Agent的结果,并在报告中注明 |
| 所有Codex CLI Agent失败 | 仅使用原生运行时评审Agent继续执行,并注明降级情况 |
| 所有Agent失败 | 返回错误,建议重试 |
| 未安装Codex CLI | 跳过Codex CLI评审Agent,继续使用原生运行时模式(向用户发出警告) |
| Codex子Agent不可用 | 回退至Claude团队 |
| 原生团队不可用 | 回退至 |
| 输出目录不存在 | 自动创建 |
超时设置:每个Agent超时时间为120秒(可通过参数以秒为单位配置)。
--timeout=N最低法定人数: 至少需要1个Agent响应才能生成有效结论。若0个Agent响应,返回错误。
Pre-Flight Checks
预检查
- Runtime-native backend: Select via capability detection (->
spawn_agent->TeamCreate).Task(run_in_background=true) - Codex CLI judges (--mixed only): Check , test model availability, test
which codexsupport. Downgrade mixed mode when unavailable.--output-schema - Agent count: Verify
judges * (1 + explorers) <= MAX_AGENTS (12) - Output dir:
mkdir -p .agents/council
- 原生运行时后端: 通过能力检测选择(→
spawn_agent→TeamCreate)。Task(run_in_background=true) - Codex CLI评审Agent(仅--mixed模式): 检查、测试模型可用性、测试
which codex支持。不可用时降级混合模式。--output-schema - Agent数量: 验证
评审Agent数量 * (1 + 探索者数量) <= 最大Agent数(12) - 输出目录: 执行
mkdir -p .agents/council
Quick Mode (--quick
)
--quick快速模式(--quick
)
--quickSingle-agent inline validation. No subprocess spawning, no Task tool, no Codex. The current agent performs a structured self-review using the same output schema as a full council.
When to use: Routine checks, mid-implementation sanity checks, pre-commit quick scan.
Execution: Gather context (files, diffs) -> perform structured self-review inline using the council output_schema (verdict, confidence, findings, recommendation) -> write report to labeled as .
.agents/council/YYYY-MM-DD-quick-<target>.mdMode: quick (single-agent)Limitations: No cross-perspective disagreement, no cross-vendor insights, lower confidence ceiling. Not suitable for security audits or architecture decisions.
单Agent内联验证。不生成子进程、不使用Task工具、不调用Codex。当前Agent使用与完整委员会相同的输出 schema,执行结构化自我评审。
适用场景: 常规检查、实现过程中的 sanity check、提交前快速扫描。
执行流程: 收集上下文(文件、差异)→ 使用委员会输出 schema 内联执行结构化自我评审(结论、置信度、发现、建议)→ 将报告写入,标记为。
.agents/council/YYYY-MM-DD-quick-<target>.mdMode: quick (single-agent)局限性: 无跨视角分歧分析、无跨供应商见解、置信度上限较低。不适用于安全审计或架构决策。
Packet Format (JSON)
数据包格式(JSON)
The packet sent to each agent. File contents are included inline — agents receive the actual code/plan text in the packet, not just paths. This ensures both Claude and Codex agents can analyze without needing file access.
json
{
"council_packet": {
"version": "1.0",
"mode": "validate | brainstorm | research",
"target": "Implementation of user authentication system",
"context": {
"files": [
{
"path": "src/auth/jwt.py",
"content": "<file contents inlined here>"
},
{
"path": "src/auth/middleware.py",
"content": "<file contents inlined here>"
}
],
"diff": "git diff output if applicable",
"spec": {
"source": "bead na-0042 | plan doc | none",
"content": "The spec/bead description text (optional — included when wrapper provides it)"
},
"prior_decisions": [
"Using JWT, not sessions",
"Refresh tokens required"
]
},
"perspective": "skeptic (only when --preset or --perspectives used)",
"perspective_description": "What could go wrong? (only when --preset or --perspectives used)",
"output_schema": {
"verdict": "PASS | WARN | FAIL",
"confidence": "HIGH | MEDIUM | LOW",
"key_insight": "Single sentence summary",
"findings": [
{
"severity": "critical | significant | minor",
"category": "security | architecture | performance | style",
"description": "What was found",
"location": "file:line if applicable",
"recommendation": "How to address"
}
],
"recommendation": "Concrete next step",
"schema_version": 1
}
}
}发送给每个Agent的数据包。文件内容将内联包含——Agent会收到实际的代码/计划文本,而非仅路径。这确保Claude和Codex Agent无需文件访问即可进行分析。
json
{
"council_packet": {
"version": "1.0",
"mode": "validate | brainstorm | research",
"target": "用户认证系统实现方案",
"context": {
"files": [
{
"path": "src/auth/jwt.py",
"content": "<此处内联文件内容>"
},
{
"path": "src/auth/middleware.py",
"content": "<此处内联文件内容>"
}
],
"diff": "git diff输出(若适用)",
"spec": {
"source": "bead na-0042 | 计划文档 | 无",
"content": "规格/bead描述文本(可选——当包装器提供时包含)"
},
"prior_decisions": [
"使用JWT,不使用会话",
"需要刷新令牌"
]
},
"perspective": "skeptic(仅当使用--preset或--perspectives时)",
"perspective_description": "可能出现哪些问题?(仅当使用--preset或--perspectives时)",
"output_schema": {
"verdict": "PASS | WARN | FAIL",
"confidence": "HIGH | MEDIUM | LOW",
"key_insight": "单句摘要",
"findings": [
{
"severity": "critical | significant | minor",
"category": "security | architecture | performance | style",
"description": "发现的问题",
"location": "文件:行号(若适用)",
"recommendation": "解决建议"
}
],
"recommendation": "具体下一步操作",
"schema_version": 1
}
}
}Perspectives
视角
Perspectives & Presets: Usetool onReadfor persona definitions, preset configurations, and custom perspective details.skills/council/references/personas.md
Auto-Escalation: When or specifies more perspectives than the current judge count, automatically escalate judge count to match. The flag overrides auto-escalation.
--preset--perspectives--count视角与预设: 使用Read工具读取,获取角色定义、预设配置和自定义视角详情。skills/council/references/personas.md
自动升级: 当或指定的视角数量超过当前评审Agent数量时,自动将评审Agent数量升级至匹配视角数量。参数可覆盖自动升级。
--preset--perspectives--countExplorer Sub-Agents
探索者子Agent
Explorer Details: Usetool onReadfor explorer architecture, prompts, sub-question generation, and timeout configuration.skills/council/references/explorers.md
Summary: Judges can spawn explorer sub-agents (, max 5) for parallel deep-dive research. Total agents = , capped at MAX_AGENTS=12.
--explorers=Njudges * (1 + explorers)探索者详情: 使用Read工具读取,获取探索者架构、提示词、子问题生成和超时配置。skills/council/references/explorers.md
概述: 评审Agent可生成探索者子Agent(,最多5个)用于并行深度研究。总Agent数量 = ,上限为12个。
--explorers=N评审Agent数量 * (1 + 探索者数量)Debate Phase (--debate
)
--debate辩论阶段(--debate
)
--debateDebate Protocol: Usetool onReadfor full debate execution flow, R1-to-R2 verdict injection, timeout handling, and cost analysis.skills/council/references/debate-protocol.md
Summary: Two-round adversarial review. R1 produces independent verdicts. R2 sends other judges' verdicts via backend messaging ( or ) for steel-manning and revision. Only supported with validate mode.
send_inputSendMessage辩论协议: 使用Read工具读取,获取完整辩论执行流程、R1到R2结论注入、超时处理和成本分析。skills/council/references/debate-protocol.md
概述: 两轮对抗式评审。R1生成独立结论。R2通过后端消息(或)将其他评审Agent的结论发送给每个Agent,用于观点强化和修订。仅支持验证模式。
send_inputSendMessageAgent Prompts
Agent提示词
Agent Prompts: Usetool onReadfor judge prompts (default and perspective-based), consolidation prompt, and debate R2 message template.skills/council/references/agent-prompts.md
Agent提示词: 使用Read工具读取,获取评审Agent提示词(默认和基于视角的)、整合提示词和辩论R2消息模板。skills/council/references/agent-prompts.md
Consensus Rules
共识规则
| Condition | Verdict |
|---|---|
| All PASS | PASS |
| Any FAIL | FAIL |
| Mixed PASS/WARN | WARN |
| All WARN | WARN |
Disagreement handling:
- If Claude says PASS and Codex says FAIL → DISAGREE (surface both)
- Severity-weighted: Security FAIL outweighs style WARN
DISAGREE resolution: When vendors disagree, the spawner presents both positions with reasoning and defers to the user. No automatic tie-breaking — cross-vendor disagreement is a signal worth human attention.
| 条件 | 结论 |
|---|---|
| 所有Agent返回PASS | PASS |
| 任意Agent返回FAIL | FAIL |
| PASS/WARN混合 | WARN |
| 所有Agent返回WARN | WARN |
分歧处理:
- 若Claude返回PASS而Codex返回FAIL → 标记为DISAGREE(展示双方结论)
- 按严重程度加权:安全相关的FAIL优先级高于风格相关的WARN
DISAGREE解决方式: 当不同供应商的Agent存在分歧时,生成器会展示双方立场及理由,交由用户决策。不自动打破平局——跨供应商分歧值得人工关注。
Output Format
输出格式
Report Templates: Usetool onReadfor full report templates (validate, brainstorm, research) and debate report additions (verdict shifts, convergence detection).skills/council/references/output-format.md
All reports write to .
.agents/council/YYYY-MM-DD-<type>-<target>.md报告模板: 使用Read工具读取,获取完整报告模板(验证、头脑风暴、研究)和辩论报告附加内容(结论变化、收敛检测)。skills/council/references/output-format.md
所有报告均写入。
.agents/council/YYYY-MM-DD-<type>-<target>.mdConfiguration
配置
Partial Completion
部分完成
Minimum quorum: 1 agent. Recommended: 80% of judges. On timeout, proceed with remaining judges and note in report. On user cancellation, shutdown all judges and generate partial report with INCOMPLETE marker.
最低法定人数: 1个Agent。推荐: 80%的评审Agent响应。超时情况下,使用剩余Agent的结果并在报告中注明。用户取消时,关闭所有评审Agent并生成标记为INCOMPLETE的部分报告。
Environment Variables
环境变量
| Variable | Default | Description |
|---|---|---|
| 120 | Agent timeout in seconds |
| gpt-5.3-codex | Default Codex model for --mixed |
| opus | Claude model for agents |
| sonnet | Model for explorer sub-agents |
| 60 | Explorer timeout in seconds |
| 90 | Maximum wait time for R2 debate completion after sending debate messages. Shorter than R1 since judges already have context. |
| 变量 | 默认值 | 描述 |
|---|---|---|
| 120 | Agent超时时间(秒) |
| gpt-5.3-codex | --mixed模式默认使用的Codex模型 |
| opus | Agent使用的Claude模型 |
| sonnet | 探索者子Agent使用的模型 |
| 60 | 探索者超时时间(秒) |
| 90 | 发送辩论消息后,等待R2辩论完成的最长时间。由于评审Agent已有上下文,此时间比R1短。 |
Flags
命令行参数
| Flag | Description |
|---|---|
| 3 Claude agents instead of 2 |
| Add 3 Codex agents |
| Enable adversarial debate round (2 rounds via backend messaging, same agents). Incompatible with |
| Override timeout in seconds (default: 120) |
| Custom perspective names |
| Built-in persona preset (security-audit, architecture, research, ops, code-review, plan-review, retrospective) |
| Override agent count per vendor (e.g., |
| Explorer sub-agents per judge (default: 0, max: 5). Max effective value depends on judge count. Total agents capped at 12. |
| Override explorer model (default: sonnet) |
| 参数 | 描述 |
|---|---|
| 使用3个Claude Agent替代默认的2个 |
| 添加3个Codex Agent |
| 启用对抗式辩论轮次(通过后端消息进行2轮,使用相同Agent)。与 |
| 覆盖超时时间(秒),默认120 |
| 自定义视角名称 |
| 内置角色预设(security-audit、architecture、research、ops、code-review、plan-review、retrospective) |
| 覆盖每个供应商的Agent数量(例如 |
| 每个评审Agent对应的探索者子Agent数量(默认0,最大5)。实际最大值取决于评审Agent数量。总Agent数量上限为12。 |
| 覆盖探索者使用的模型(默认sonnet) |
CLI Spawning Commands
CLI生成命令
CLI Spawning: Usetool onReadfor team setup, Claude/Codex agent spawning, parallel execution, debate R2 commands, cleanup, and model selection.skills/council/references/cli-spawning.md
CLI生成: 使用Read工具读取,获取团队设置、Claude/Codex Agent生成、并行执行、辩论R2命令、清理和模型选择。skills/council/references/cli-spawning.md
Examples
示例
bash
/council validate recent # 2 judges, recent commits
/council --deep --preset=architecture research the auth system # 3 judges with architecture personas
/council --mixed validate this plan # 3 Claude + 3 Codex
/council --deep --explorers=3 research upgrade patterns # 12 agents (3 judges x 4)
/council --preset=security-audit --deep validate the API # attacker, defender, compliance
/council brainstorm caching strategies for the API # 2 judges explore options
/council research Redis vs Memcached for session storage # 2 judges assess trade-offs
/council validate the implementation plan in PLAN.md # structured plan feedbackbash
/council validate recent # 2个评审Agent,评审最近提交
/council --deep --preset=architecture research the auth system # 3个使用架构角色的评审Agent
/council --mixed validate this plan # 3个Claude + 3个Codex Agent
/council --deep --explorers=3 research upgrade patterns # 12个Agent(3个评审Agent ×4)
/council --preset=security-audit --deep validate the API # 攻击者、防御者、合规视角
/council brainstorm caching strategies for the API # 2个评审Agent探索方案
/council research Redis vs Memcached for session storage # 2个评审Agent评估权衡
/council validate the implementation plan in PLAN.md # 结构化计划反馈Migration from /judge
从/judge迁移
/council/judge| Old | New |
|---|---|
| |
| |
| |
The skill is deprecated. Use .
/judge/council/council/judge| 旧命令 | 新命令 |
|---|---|
| |
| |
| |
/judge/councilRuntime-Native Architecture
原生运行时架构
Council uses runtime-native spawning as primary:
- Codex sessions: experimental sub-agents (,
spawn_agent,wait,send_input)close_agent - Claude sessions: native teams (,
TeamCreate, sharedSendMessage)TaskList - Fallback:
Task(run_in_background=true)
Council将原生运行时生成为主要方式:
- Codex会话:实验性子Agent(、
spawn_agent、wait、send_input)close_agent - Claude会话:原生团队(、
TeamCreate、共享SendMessage)TaskList - 回退方案:
Task(run_in_background=true)
Deliberation Protocol
审议协议
The flag implements the deliberation protocol pattern:
--debateIndependent assessment → evidence exchange → position revision → convergence analysis
Runtime-native backends make this pattern first-class:
- R1: Judges spawn as sub-agents/teammates, assess independently, return verdicts to lead
- R2: Team lead sends other judges' verdicts via (Codex) or
send_input(Claude). Judges wake from idle with full R1 context.SendMessage - Consolidation: Team lead reads all output files, computes consensus
- Cleanup: (Codex) or
close_agent+shutdown_request(Claude)TeamDelete()
--debate独立评估 → 证据交换 → 立场修订 → 收敛分析
原生运行时后端使该模式成为一等公民:
- R1: 评审Agent作为子Agent/团队成员生成,独立评估,将结论返回给负责人
- R2: 团队负责人通过(Codex)或
send_input(Claude)将其他评审Agent的结论发送给每个Agent。评审Agent从空闲状态唤醒,拥有完整R1上下文。SendMessage - 整合: 团队负责人读取所有输出文件,计算共识结论
- 清理: 执行(Codex)或
close_agent+shutdown_request(Claude)TeamDelete()
Communication Rules
通信规则
- Judges → team lead only. Judges never message each other directly. This prevents anchoring.
- Team lead → judges. Only the team lead sends follow-ups (or
send_input).SendMessage - No shared task mutation by judges. Team lead manages coordination state.
- 仅评审Agent → 团队负责人。 评审Agent之间从不直接通信。这可避免锚定效应。
- 仅团队负责人 → 评审Agent。 仅团队负责人可发送后续消息(或
send_input)。SendMessage - 评审Agent不得修改共享任务。 团队负责人管理协调状态。
Ralph Wiggum Compliance
Ralph Wiggum合规性
Council maintains fresh-context isolation (Ralph Wiggum pattern) with one documented exception:
--debate- Judges benefit from their own R1 analytical context (reasoning chain, not just the verdict JSON) when evaluating other judges' positions in R2
- Re-spawning with only the verdict summary (~200 tokens) would lose the judge's working memory of WHY they reached their verdict
- The exception is bounded: max 2 rounds, within one invocation, with explicit cleanup (close_agent or shutdown_request + TeamDelete)
Without , council is fully Ralph-compliant: each judge is a fresh spawn, executes once, writes output, and terminates.
--debateCouncil保持新鲜上下文隔离(Ralph Wiggum模式),仅一个有文档记录的例外:
--debate- 在R2评估其他评审Agent的立场时,评审Agent可受益于自身R1的分析上下文(推理链,而非仅结论JSON)
- 仅重新生成包含结论摘要(约200个token)的Agent会丢失评审Agent得出结论的推理过程记忆
- 例外情况是有界的:最多2轮,在一次调用内,且有明确的清理步骤(close_agent或shutdown_request + TeamDelete)
不使用时,Council完全符合Ralph规范:每个评审Agent都是全新生成的,执行一次,写入输出,然后终止。
--debateFallback
回退方案
If runtime-native backend is unavailable, fall back to fire-and-forget. In fallback mode:
Task(run_in_background=true)- reverts to R2 re-spawning with truncated R1 verdicts
--debate - The debate report must include in the header so users know results may be lower fidelity
**Fidelity:** degraded (fallback — R1 verdicts truncated for R2 re-spawn) - Non-debate mode works identically (judges write files, team lead reads them)
若原生运行时后端不可用,回退至的即发即忘模式。在回退模式下:
Task(run_in_background=true)- 会回退为R2重新生成Agent,并传入截断的R1结论
--debate - 辩论报告必须在标题中包含,以便用户知晓结果可能保真度较低
**Fidelity:** degraded (fallback — R1 verdicts truncated for R2 re-spawn) - 非辩论模式工作方式与正常情况相同(评审Agent写入文件,团队负责人读取)
Judge Naming
评审Agent命名规范
Convention: (e.g., ).
council-YYYYMMDD-<target>council-20260206-auth-systemJudge names: for independent judges (e.g., , ), or when using presets/perspectives (e.g., , ). Use the same logical names across both Codex and Claude backends.
judge-{N}judge-1judge-2judge-{perspective}judge-error-pathsjudge-feasibility约定:(例如)。
council-YYYYMMDD-<target>council-20260206-auth-system评审Agent名称:独立评审Agent命名为(例如、),使用预设/视角时命名为(例如、)。在Codex和Claude后端使用相同的逻辑名称。
judge-{N}judge-1judge-2judge-{perspective}judge-error-pathsjudge-feasibilitySee Also
另请参阅
- — Complexity + council for code validation (uses
skills/vibe/SKILL.mdwhen spec found)--preset=code-review - — Plan validation (uses
skills/pre-mortem/SKILL.md, always 3 judges)--preset=plan-review - — Work wrap-up (uses
skills/post-mortem/SKILL.md, always 3 judges + retro)--preset=retrospective - — Multi-agent orchestration
skills/swarm/SKILL.md - — Language-specific coding standards
skills/standards/SKILL.md - — Codebase exploration (complementary to council research mode)
skills/research/SKILL.md
- — 结合复杂度分析与委员会进行代码验证(找到规格时使用
skills/vibe/SKILL.md)--preset=code-review - — 计划验证(使用
skills/pre-mortem/SKILL.md,固定3个评审Agent)--preset=plan-review - — 工作收尾(使用
skills/post-mortem/SKILL.md,固定3个评审Agent + 回顾)--preset=retrospective - — 多Agent编排
skills/swarm/SKILL.md - — 特定语言编码标准
skills/standards/SKILL.md - — 代码库探索(与委员会研究模式互补)
skills/research/SKILL.md