deepresearch

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

DeepResearch Methodology

DeepResearch方法论

A comprehensive methodology for conducting rigorous, traceable research projects that produce decision-ready deliverables with evidence-based analysis, structured reasoning, and quality gates.
一套用于开展严谨、可追溯研究项目的综合方法论,可产出基于证据分析、结构化推理且带有质量门控的可决策交付物。

Core Principles

核心原则

  1. Artifact-driven progress: Research is organized into 5 stages, each producing required deliverables
  2. Quality gates: Each stage has explicit quality criteria that must be met before proceeding
  3. Traceable evidence: Every claim must be traceable to sources with proper citation
  4. Structured analysis: Use structured analytic techniques to mitigate bias and improve rigor
  5. Uncertainty expression: Clearly distinguish facts, judgments, and speculation with likelihood and confidence
  6. Decision-ready outputs: Deliverables are structured for immediate use by decision-makers
  1. 工件驱动进度:研究分为5个阶段,每个阶段都需产出指定交付物
  2. 质量门控:每个阶段都有明确的质量标准,达标后方可进入下一阶段
  3. 可追溯证据:所有主张必须可追溯至来源,并配有恰当引用
  4. 结构化分析:使用结构化分析技术减少偏差,提升严谨性
  5. 不确定性表达:清晰区分事实、判断与推测,并标注可能性和置信度
  6. 可决策输出:交付物的结构需便于决策者直接使用

Recommended Tools

推荐工具

Cursor IDE Browser (cursor-ide-browser)

Cursor IDE Browser (cursor-ide-browser)

The cursor-ide-browser MCP server is highly recommended for DeepResearch projects. It enables browser automation directly within Cursor IDE, making information collection, verification, and OSINT work more efficient and traceable.
Key Use Cases:
  1. Information Collection (Stage 3):
    • Navigate to official sources, company websites, regulatory filings
    • Capture screenshots with timestamps for audit trail
    • Extract structured data from web pages
    • Archive web pages before they change or disappear
  2. OSINT Verification (Stage 4):
    • Reverse image search on multiple platforms (Google Images, TinEye, Yandex)
    • Verify social media posts and UGC authenticity
    • Check geolocation using map services
    • Capture verification evidence (screenshots, page snapshots)
  3. Source Archiving:
    • Take full-page screenshots of key sources
    • Capture page snapshots with accessibility tree for later analysis
    • Document page state at time of access (for reproducibility)
  4. Cross-Platform Verification:
    • Navigate between multiple sources to verify consistency
    • Check multiple language versions of same content
    • Verify across different platforms (official site, news, social media)
Best Practices:
  • Always capture screenshots/snapshots when accessing sources (for audit trail)
  • Use browser navigation to verify links are still active
  • Take snapshots before archiving (captures full page state)
  • Use browser console to check for dynamic content or hidden information
Integration with Workflow:
  • collection-strategist: Use browser to access and archive sources during collection
  • verification-expert: Use browser for reverse image search, geolocation verification, UGC checking
  • evidence-librarian: Use browser to verify citations and capture source snapshots
See OSINT_VERIFICATION.md for detailed browser-based verification techniques.
cursor-ide-browser MCP服务器是DeepResearch项目的高度推荐工具。它支持在Cursor IDE内直接实现浏览器自动化,让信息收集、验证和OSINT工作更高效、更具可追溯性。
核心使用场景:
  1. 信息收集(第3阶段):
    • 访问官方来源、企业网站、监管文件
    • 捕获带时间戳的截图,用于审计追踪
    • 从网页提取结构化数据
    • 在网页变更或消失前存档
  2. OSINT验证(第4阶段):
    • 在多平台进行反向图片搜索(Google Images、TinEye、Yandex)
    • 验证社交媒体帖子和UGC的真实性
    • 利用地图服务核查地理位置
    • 捕获验证证据(截图、页面快照)
  3. 来源存档:
    • 对关键来源进行全页截图
    • 捕获带可访问性树的页面快照,用于后续分析
    • 记录访问时的页面状态(确保可复现)
  4. 跨平台验证:
    • 在多个来源间导航,验证信息一致性
    • 核查同一内容的多语言版本
    • 在不同平台(官网、新闻、社交媒体)交叉验证
最佳实践:
  • 访问来源时始终捕获截图/快照(用于审计追踪)
  • 通过浏览器导航验证链接是否仍有效
  • 存档前先拍摄快照(完整捕获页面状态)
  • 使用浏览器控制台检查动态内容或隐藏信息
与工作流的集成:
  • collection-strategist:在收集阶段使用浏览器访问并存档来源
  • verification-expert:使用浏览器进行反向图片搜索、地理位置验证、UGC核查
  • evidence-librarian:使用浏览器验证引用并捕获来源快照
有关基于浏览器的验证技术详情,请参阅OSINT_VERIFICATION.md

Research Stages

研究阶段

Stage A: Task Contract (0→1)

阶段A:任务契约(0→1)

Deliverables:
  • Task Contract: Goal, audience, time window, scope, non-goals, deliverable format
  • KIQs (Key Intelligence Questions): 3-7 must-answer questions
  • Success criteria: Definition of "good enough" and "unobtainable"
Quality Gate 1: Research questions are answerable, falsifiable, with clear time windows. Non-goals are explicit. KIQs ≤ 7 and actionable.
交付物:
  • 任务契约:目标、受众、时间窗口、范围、非目标、交付物格式
  • KIQs(关键情报问题):3-7个必须解答的问题
  • 成功标准:定义“足够好”和“无法实现”的边界
质量门控1:研究问题需可解答、可证伪,并具备明确的时间窗口。非目标需清晰明确。KIQs数量≤7且具备可操作性。

Stage B: Decomposition & Planning (1→Plan)

阶段B:分解与规划(1→计划)

Deliverables:
  • Issue tree / Hypothesis set (MECE decomposition + initial hypotheses)
  • Collection plan: Source map, retrieval routes, priorities, verification strategy
  • Risk log: Data gaps, timeliness, compliance boundaries, conflicting evidence expectations
Quality Gate 2: Issue tree is MECE. Each sub-question has evidence requirements and source routes. Cross-validation design exists (at least two complementary source types).
交付物:
  • 问题树/假设集(MECE分解+初始假设)
  • 收集计划:来源地图、检索路径、优先级、验证策略
  • 风险日志:数据缺口、时效性、合规边界、预期冲突证据
质量门控2:问题树需符合MECE原则。每个子问题都有证据要求和来源路径。需设计交叉验证方案(至少两种互补来源类型)。

Stage C: Collection & Registration (Plan→Evidence)

阶段C:收集与登记(计划→证据)

Deliverables:
  • Source Register: Type, time, reliability, bias risk, usable scope for each source
  • Evidence Table: Claim→Evidence→Strength→Conflict→Notes
  • Collection log: Query strings, timestamps, exclusion reasons (for reproducibility and audit)
Quality Gate 3: Key claims coverage reaches threshold (e.g., 70% of core claims have usable evidence). Evidence is from traceable sources with timestamps/versions. Conflicts are explicitly recorded.
交付物:
  • 来源登记册:每个来源的类型、时间、可靠性、偏差风险、可用范围
  • 证据表:主张→证据→强度→冲突→备注
  • 收集日志:查询字符串、时间戳、排除原因(用于可复现性和审计)
质量门控3:核心主张的覆盖度达到阈值(例如70%的核心主张具备可用证据)。证据来自可追溯的来源,带有时间戳/版本信息。冲突需被明确记录。

Stage D: Analysis & Convergence (Evidence→Judgment)

阶段D:分析与收敛(证据→判断)

Deliverables:
  • Structured analysis workbook: ACH/Key Assumptions Check/Red Team (at least 1-2 techniques)
  • Key Judgments (3-7): Each with likelihood + confidence + evidence anchors
  • Alternative explanations and flip conditions: What would change my mind / signposts
Quality Gate 4: At least 1 alternative explanation exists and is evaluated. Key assumptions are explicit. Most vulnerable assumption is identified. Confidence matches evidence strength.
交付物:
  • 结构化分析工作簿:ACH/关键假设检查/红队法(至少1-2种技术)
  • 关键判断(3-7条):每条都包含可能性+置信度+证据锚点
  • 替代解释和翻转条件:哪些因素会改变结论/关键信号
质量门控4:至少存在1种替代解释并已被评估。关键假设需明确。已识别最脆弱的假设。置信度与证据强度匹配。

Stage E: Delivery & Review (Judgment→Product)

阶段E:交付与评审(判断→成果)

Deliverables:
  • Deliverable (brief/memo/table/appendices package)
  • QA checklist record: Fact-checking, traceable citations, consistent uncertainty expression
  • Follow-up actions: Gap list, next collection suggestions, monitoring indicators
Quality Gate 5: Key Judgments are conclusion-first, clear language, audience-appropriate. Each judgment has traceable citations. Uncertainty is expressed consistently. Inference is distinguished from fact.
交付物:
  • 交付物(简报/备忘录/表格/附录包)
  • QA检查表记录:事实核查、可追溯引用、一致性不确定性表达
  • 后续行动:缺口清单、下一步收集建议、监控指标
质量门控5:关键判断采用结论先行的方式,语言清晰,符合受众需求。每条判断都有可追溯的引用。不确定性表达一致。推理与事实明确区分。

Quick Start Workflow

快速开始工作流

⚠️ MANDATORY FIRST STEP: Before starting any DeepResearch project, create
AGENTS.md
in project root using
@cursor-agents-md
. This file defines project-specific instructions that all research work must follow.
  1. Create AGENTS.md (MANDATORY):
    @cursor-agents-md Create an AGENTS.md file for this DeepResearch project
    • Must include project-specific research guidelines
    • Must include reminder to read
      cursor-agents-md
      before updates
    • Must be in project root directory
  2. Create Task Contract: Define research question, KIQs, scope, and success criteria
  3. Build Issue Tree: MECE decomposition with initial hypotheses
  4. Develop Collection Plan: Source map with cross-validation strategy
  5. Collect & Register: Build Source Register and Evidence Table as you collect
  6. Analyze: Apply structured analytic techniques (see STRUCTURED_ANALYSIS.md)
  7. Synthesize: Generate Key Judgments with likelihood and confidence
  8. Deliver: Create deliverable following REPORT_TEMPLATE.md and run QA_CHECKLIST.md
⚠️ 强制第一步:在启动任何DeepResearch项目前,使用
@cursor-agents-md
在项目根目录创建
AGENTS.md
文件。该文件定义了所有研究工作必须遵循的项目特定指令。
  1. 创建AGENTS.md(强制要求):
    @cursor-agents-md Create an AGENTS.md file for this DeepResearch project
    • 必须包含项目特定的研究指南
    • 必须包含更新前阅读
      cursor-agents-md
      的提醒
    • 必须位于项目根目录
  2. 创建任务契约:定义研究问题、KIQs、范围和成功标准
  3. 构建问题树:基于MECE分解的初始假设
  4. 制定收集计划:带有交叉验证策略的来源地图
  5. 收集与登记:在收集过程中构建来源登记册和证据表
  6. 分析:应用结构化分析技术(参阅STRUCTURED_ANALYSIS.md
  7. 综合:生成带有可能性和置信度的关键判断
  8. 交付:遵循REPORT_TEMPLATE.md创建交付物,并运行QA_CHECKLIST.md

Evidence-Chain Production Line (12-Stage Workflow)

证据链生产线(12阶段工作流)

The complete workflow from problem to usable conclusions, with subagent assignments:
从问题到可用结论的完整工作流,包含子代理分配:

Stage 0: Task Contract Definition

阶段0:任务契约定义

Subagent:
research-lead
InputActionsOutputGate
Decision-maker's vague questionDefine decision goal, boundaries, time window, success criteriaTask Contract v1Does "falsifiable judgment" exist?
⚠️ MANDATORY PREREQUISITE: Before starting Stage 0, ensure
AGENTS.md
exists in project root. If not, create it using
@cursor-agents-md
.
Checklist:
  • AGENTS.md exists (created using
    @cursor-agents-md
    )
  • Research question is falsifiable and testable
  • Non-goals are explicit
  • Time window defined
  • Success criteria clear
  • Deliverable format specified
Handoff to:
methodologist
(for KIQ decomposition)

子代理
research-lead
输入操作输出门控
决策者的模糊问题定义决策目标、边界、时间窗口、成功标准任务契约v1是否存在“可证伪的判断”?
⚠️ 强制前提:启动阶段0前,确保项目根目录存在
AGENTS.md
。若不存在,使用
@cursor-agents-md
创建。
检查表:
  • AGENTS.md已存在(使用
    @cursor-agents-md
    创建)
  • 研究问题可证伪、可测试
  • 非目标明确
  • 时间窗口已定义
  • 成功标准清晰
  • 交付物格式已指定
交接至
methodologist
(用于KIQ分解)

Stage 1: KIQ & Claim Draft

阶段1:KIQ与主张草拟

Subagents:
research-lead
+
methodologist
InputActionsOutputGate
Task ContractBreak into KIQs; form initial Key ClaimsClaim Tree v1Do "falsifiable" claims exist?
Collaboration:
  • research-lead
    : Defines KIQs and priority
  • methodologist
    : Ensures claims are falsifiable, suggests hypothesis structure
Handoff to:
collection-strategist
(for source map design)
Checklist:
  • KIQs ≤ 7
  • Each KIQ has corresponding claims
  • Claims are falsifiable
  • Priority order established

子代理
research-lead
+
methodologist
输入操作输出门控
任务契约分解为KIQs;形成初始关键主张主张树v1是否存在“可证伪的”主张?
协作:
  • research-lead
    :定义KIQs和优先级
  • methodologist
    :确保主张可证伪,建议假设结构
交接至
collection-strategist
(用于来源地图设计)
检查表:
  • KIQs数量≤7
  • 每个KIQ都有对应的主张
  • 主张可证伪
  • 已确定优先级顺序

Stage 2: Source Map Design

阶段2:来源地图设计

Subagent:
collection-strategist
InputActionsOutputGate
Claim TreeDesign source type combinations for each claimSource MapEach claim has ≥2 independent source types?
Handoff to: Collectors (Stage 3) +
verification-expert
(for verification strategy)
Checklist:
  • Each claim mapped to ≥2 source types
  • Cross-validation design exists
  • Retrieval routes specified
  • Archiving strategy defined
  • Language variants considered

子代理
collection-strategist
输入操作输出门控
主张树为每个主张设计来源类型组合来源地图每个主张是否对应≥2种独立来源类型?
交接至:收集人员(阶段3) +
verification-expert
(用于验证策略)
检查表:
  • 每个主张映射到≥2种来源类型
  • 存在交叉验证设计
  • 已指定检索路径
  • 已定义存档策略
  • 已考虑语言变体

Stage 3: Collection & Archiving

阶段3:收集与存档

Subagents:
collection-strategist
+ Collectors
InputActionsOutputGate
Source MapSearch, download, archive, tagEvidence PoolIs it auditable? Are original copies available?
Collaboration:
  • collection-strategist
    : Monitors coverage, adjusts strategy
  • Collectors: Execute retrieval, archive materials
Recommended Tool: Use cursor-ide-browser to:
  • Navigate to sources and capture screenshots/snapshots immediately
  • Archive web pages before they change or disappear
  • Verify links are still active
  • Extract structured data from web pages
Handoff to:
verification-expert
(Stage 4)
Checklist:
  • Original sources archived
  • Screenshots/snapshots captured (browser evidence)
  • Timestamps recorded
  • Archive links created
  • Metadata captured
  • Coverage threshold met (e.g., 70% of core claims)

子代理
collection-strategist
+ 收集人员
输入操作输出门控
来源地图搜索、下载、存档、标记证据池是否可审计?是否有原始副本?
协作:
  • collection-strategist
    :监控覆盖度,调整策略
  • 收集人员:执行检索,存档材料
推荐工具:使用cursor-ide-browser来:
  • 导航至来源并立即捕获截图/快照
  • 在网页变更或消失前存档
  • 验证链接是否仍有效
  • 从网页提取结构化数据
交接至
verification-expert
(阶段4)
检查表:
  • 原始来源已存档
  • 已捕获截图/快照(浏览器证据)
  • 已记录时间戳
  • 已创建存档链接
  • 已捕获元数据
  • 覆盖度达到阈值(例如70%的核心主张)

Stage 4: OSINT Verification

阶段4:OSINT验证

Subagent:
verification-expert
InputActionsOutputGate
Evidence PoolDeception removal, time-geography consistency checkVerified EvidenceDoes it pass "falsify first" test?
Recommended Tool: Use cursor-ide-browser to:
  • Perform reverse image/video search across multiple platforms
  • Verify geolocation using map services (Google Maps/Earth)
  • Check social media accounts and capture snapshots
  • Verify chronolocation using weather/timezone services
  • Capture verification evidence (screenshots, snapshots) at each step
Handoff to:
evidence-librarian
(Stage 5)
Checklist:
  • UGC verified (source, time, location, originality)
  • Images/videos geolocated (if applicable)
  • Chronolocation verified
  • Consistency checks passed
  • Verification log complete
  • Browser-captured evidence included (screenshots, snapshots)

子代理
verification-expert
输入操作输出门控
证据池去除误导信息,检查时间-地理一致性已验证证据是否通过“先证伪”测试?
推荐工具:使用cursor-ide-browser来:
  • 在多平台执行反向图片/视频搜索
  • 使用地图服务(Google Maps/Earth)验证地理位置
  • 检查社交媒体账户并捕获快照
  • 使用天气/时区服务验证时间定位
  • 在每个步骤捕获验证证据(截图、快照)
交接至
evidence-librarian
(阶段5)
检查表:
  • UGC已验证(来源、时间、位置、原创性)
  • 图片/视频已定位(如适用)
  • 时间定位已验证
  • 一致性检查通过
  • 验证日志完整
  • 包含浏览器捕获的证据(截图、快照)

Stage 5: Evidence Registration

阶段5:证据登记

Subagent:
evidence-librarian
InputActionsOutputGate
Verified EvidenceBuild Claim–Evidence TableEvidence RegisterDoes each key judgment have evidence?
Handoff to:
methodologist
(Stage 6) +
quant-analyst
(for data consistency)
Checklist:
  • Source Register complete
  • Evidence Table built
  • Each claim linked to evidence
  • Conflicts explicitly recorded
  • Citations traceable

子代理
evidence-librarian
输入操作输出门控
已验证证据构建主张-证据表证据登记册每个关键判断是否有对应证据?
交接至
methodologist
(阶段6) +
quant-analyst
(用于数据一致性)
检查表:
  • 来源登记册完整
  • 证据表已构建
  • 每个主张都链接到证据
  • 冲突已明确记录
  • 引用可追溯

Stage 6: Structured Reasoning

阶段6:结构化推理

Subagent:
methodologist
InputActionsOutputGate
Evidence RegisterACH, hypothesis competition, discriminating evidenceHypothesis MatrixDo competing worlds exist?
Collaboration:
  • May consult
    domain-expert
    for mechanism plausibility
  • May consult
    quant-analyst
    for data consistency
Handoff to:
quant-analyst
(Stage 7) +
devils-advocate
(Stage 8)
Checklist:
  • ACH matrix complete
  • At least 2 competing hypotheses
  • Discriminating evidence identified
  • Key assumptions checked
  • Alternative explanations evaluated

子代理
methodologist
输入操作输出门控
证据登记册ACH、假设竞争、区分性证据假设矩阵是否存在竞争性假设?
协作:
  • 可咨询
    domain-expert
    确认机制合理性
  • 可咨询
    quant-analyst
    检查数据一致性
交接至
quant-analyst
(阶段7) +
devils-advocate
(阶段8)
检查表:
  • ACH矩阵完整
  • 至少存在2种竞争性假设
  • 已识别区分性证据
  • 已检查关键假设
  • 已评估替代解释

Stage 7: Data Consistency & Sensitivity

阶段7:数据一致性与敏感性

Subagent:
quant-analyst
InputActionsOutputGate
Hypothesis MatrixMetric unification, sensitivity analysisConsistency PackAre conclusions sensitive to assumptions?
Handoff to:
devils-advocate
(Stage 8) +
domain-expert
(Stage 9)
Checklist:
  • Metrics unified
  • Consistency checks passed
  • Sensitivity analysis complete
  • Error ranges specified
  • Timeline closed (if applicable)

子代理
quant-analyst
输入操作输出门控
假设矩阵指标统一、敏感性分析一致性包结论是否对假设敏感?
交接至
devils-advocate
(阶段8) +
domain-expert
(阶段9)
检查表:
  • 指标已统一
  • 一致性检查通过
  • 敏感性分析完整
  • 已指定误差范围
  • 时间线已闭合(如适用)

Stage 8: Counter-World Attack

阶段8:反事实攻击

Subagent:
devils-advocate
InputActionsOutputGate
Consistency PackConstruct counter-worlds, Kill PointsAdversarial ReviewDo single-point failures exist?
Key Activities:
  1. Construct 2-4 counterfactual worlds
  2. Identify Kill Points (evidence that if falsified, conclusion fails)
  3. Create Fragility Map (which judgments sensitive to which assumptions)
  4. Write Adversarial Review Memo
  5. Design Decision-Failure Simulation
Checklist:
  • At least 2 counterfactual worlds
  • Kill points identified
  • Fragility map complete
  • Adversarial review challenges main conclusion
  • Failure scenarios designed
Handoff to:
domain-expert
(Stage 9) +
methodologist
(if re-analysis needed)

子代理
devils-advocate
输入操作输出门控
一致性包构建反事实场景、致命点对抗性评审是否存在单点故障?
核心活动:
  1. 构建2-4种反事实场景
  2. 识别致命点(若该证据被证伪,结论将失效)
  3. 创建脆弱性地图(哪些判断对哪些假设敏感)
  4. 撰写对抗性评审备忘录
  5. 设计决策失败模拟
检查表:
  • 至少存在2种反事实场景
  • 已识别致命点
  • 脆弱性地图完整
  • 对抗性评审挑战主结论
  • 已设计失败场景
交接至
domain-expert
(阶段9) +
methodologist
(如需重新分析)

Stage 9: Domain Mechanism Validation

阶段9:领域机制验证

Subagent:
domain-expert
InputActionsOutputGate
Adversarial ReviewMechanism plausibility checkMechanism MemoDoes it violate industry common sense?
Handoff to:
editor
(Stage 10)
Checklist:
  • Mechanisms are plausible
  • Industry patterns respected
  • Anomalies flagged
  • Context provided
  • Common sense boundaries checked

子代理
domain-expert
输入操作输出门控
对抗性评审检查机制合理性机制备忘录是否违反行业常识?
交接至
editor
(阶段10)
检查表:
  • 机制合理
  • 符合行业模式
  • 异常已标记
  • 已提供上下文
  • 已检查常识边界

Stage 10: Conclusion Packaging

阶段10:结论包装

Subagent:
editor
InputActionsOutputGate
Mechanism MemoPyramid structure, risk gradingDraft ReportCan conclusions be grasped in 3 minutes?
Key Activities:
  1. Structure: Conclusion-first pyramid
  2. Language: Precise uncertainty expression
  3. Format: Scannable, forwardable
  4. Risk narrative: Clear and actionable
Handoff to:
qa-gatekeeper
(Stage 11)
Checklist:
  • Conclusion-first structure
  • Key Judgments clear
  • Uncertainty expressed consistently
  • 3-minute grasp test passed
  • Evidence anchors present

子代理
editor
输入操作输出门控
机制备忘录金字塔结构、风险分级报告草稿能否在3分钟内掌握结论?
核心活动:
  1. 结构:结论先行的金字塔结构
  2. 语言:精准的不确定性表达
  3. 格式:易扫描、可转发
  4. 风险叙事:清晰且可操作
交接至
qa-gatekeeper
(阶段11)
检查表:
  • 结论先行结构
  • 关键判断清晰
  • 不确定性表达一致
  • 通过3分钟掌握测试
  • 存在证据锚点

Stage 11: QA Gate

阶段11:QA门控

Subagent:
qa-gatekeeper
InputActionsOutputGate
Draft ReportMethod audit, compliance checkGo / No-GoIs release permitted?
Key Activities:
  1. Tradecraft QA (all 8 dimensions from rubric)
  2. Compliance check (ethics, privacy, permissions)
  3. Risk assessment
  4. Final Go/No-Go decision
Checklist:
  • All quality gates passed
  • Compliance verified
  • Ethics boundaries respected
  • Risk acceptable
  • Ready for delivery
If No-Go: Return to appropriate stage with specific feedback
子代理
qa-gatekeeper
输入操作输出门控
报告草稿方法论审计、合规检查通过/不通过是否允许发布?
核心活动:
  1. 行业标准QA(评估表的8个维度)
  2. 合规检查(伦理、隐私、权限)
  3. 风险评估
  4. 最终通过/不通过决策
检查表:
  • 所有质量门控已通过
  • 合规性已验证
  • 伦理边界已遵守
  • 风险可接受
  • 已准备好交付
若不通过:返回对应阶段并提供具体反馈

Key Resources

核心资源

Templates & Checklists

模板与检查表

  • PROJECT_MANAGEMENT.md: Complete project management template with all stages, gates, and tracking tables
  • REPORT_TEMPLATE.md: Standard report template with all required sections
  • QA_CHECKLIST.md: Pre-delivery quality assurance checklist
  • PROJECT_MANAGEMENT.md:包含所有阶段、门控和追踪表格的完整项目管理模板
  • REPORT_TEMPLATE.md:包含所有必填章节的标准报告模板
  • QA_CHECKLIST.md:交付前的质量保证检查表

Methodology Guides

方法论指南

  • STRUCTURED_ANALYSIS.md: Structured analytic techniques (ACH, Key Assumptions Check, Red Team, etc.)
  • OSINT_VERIFICATION.md: OSINT verification techniques for UGC, images, videos, geolocation
  • UNCERTAINTY_EXPRESSION.md: How to express likelihood, confidence, and distinguish facts/judgments/speculation
  • STRUCTURED_ANALYSIS.md:结构化分析技术(ACH、关键假设检查、红队法等)
  • OSINT_VERIFICATION.md:针对UGC、图片、视频、地理位置的OSINT验证技术
  • UNCERTAINTY_EXPRESSION.md:如何表达可能性、置信度,以及区分事实/判断/推测

Quality Standards

质量标准

  • RUBRIC.md: Evaluation rubric with 8 dimensions and scoring criteria
  • ETHICS_GUARDRAILS.md: Compliance and ethics boundaries
  • RUBRIC.md:包含8个维度和评分标准的评估表
  • ETHICS_GUARDRAILS.md:合规与伦理边界

Team & Subagents System

团队与子代理系统

All subagent system documentation is integrated into this SKILL.md file. See sections:
  • Subagents System Setup: Initialization and setup instructions
  • Evidence-Chain Production Line: Complete 12-stage workflow with subagent assignments
  • Weekly Research Rituals: Fixed weekly ceremonies
  • Subagent Role Definitions: Detailed role descriptions for all 10 subagents
所有子代理系统文档已集成到本SKILL.md文件中,请参阅以下章节:
  • 子代理系统设置:初始化和设置说明
  • 证据链生产线:包含子代理分配的完整12阶段工作流
  • 每周研究仪式:固定的每周流程
  • 子代理角色定义:所有10个子代理的详细角色描述

Evidence & Citation Standards

证据与引用标准

Source Register Minimum Fields

来源登记册必填字段

  • Type (official/company/media/academic/UGC/database)
  • Provenance (who produced, when, version)
  • Access path (how obtained/paid/scraped)
  • Bias risks (stakeholder interests, propaganda tendency, method limitations)
  • Reliability (High/Medium/Low + rationale)
  • Use limits (what it can prove)
  • 类型(官方/企业/媒体/学术/UGC/数据库)
  • 来源(制作者、时间、版本)
  • 获取路径(如何获取/付费/抓取)
  • 偏差风险(利益相关方、宣传倾向、方法局限性)
  • 可靠性(高/中/低 + 理由)
  • 使用限制(可证明的内容)

Evidence Table Structure

证据表结构

  • Claim (falsifiable assertion)
  • Evidence (citation + summary)
  • Supports/Contradicts (which hypothesis)
  • Strength (Strong/Medium/Weak: based on method and independence)
  • Alternative explanations
  • Notes (gaps, next verification steps)
  • 主张(可证伪的断言)
  • 证据(引用 + 摘要)
  • 支持/反驳(对应假设)
  • 强度(强/中/弱:基于方法和独立性)
  • 替代解释
  • 备注(缺口、下一步验证步骤)

Citation Requirements

引用要求

  • Each Key Judgment: At least 2 independent sources (or 1 primary authoritative + explanation why sufficient)
  • Each key number/timeline node: Must be traceable to original source or clear derivation chain
  • Conflicting evidence: Must be explicitly presented with explanation of choice and remaining uncertainty
  • 每个关键判断:至少2种独立来源(或1种权威主来源 + 充分性解释)
  • 每个关键数据/时间节点:必须可追溯至原始来源或清晰的推导链
  • 冲突证据:必须明确呈现,并解释选择理由和剩余不确定性

Structured Analytic Techniques

结构化分析技术

Select technique based on problem type:
  • Causal attribution / Who did it → ACH + prioritize disconfirming evidence
  • Future prediction / Risk → Scenario planning + indicator framework
  • Strong consensus → Devil's Advocacy / Team A-Team B
  • Unstable key premises → Key Assumptions Check
  • Adversary intent/behavior → Red Team (avoid mirror imaging)
See STRUCTURED_ANALYSIS.md for detailed procedures.
根据问题类型选择技术:
  • 因果归因 / 谁是责任人 → ACH + 优先考虑反证证据
  • 未来预测 / 风险 → 场景规划 + 指标框架
  • 强共识场景 → 魔鬼代言人 / A队-B队法
  • 不稳定核心前提 → 关键假设检查
  • 对手意图/行为 → 红队法(避免镜像思维)
有关详细流程,请参阅STRUCTURED_ANALYSIS.md

Uncertainty Expression

不确定性表达

Three Categories (Kent)

三类信息(Kent模型)

  • Fact: Observable, verifiable with high certainty
  • Judgment/Estimate: Evidence sufficient but still probabilistic
  • Inference/Speculation: Limited evidence, more logic-based
  • 事实:可观察、高确定性验证的信息
  • 判断/估算:有充分证据但仍具概率性的结论
  • 推理/推测:证据有限、更多基于逻辑的推断

Probability Words (5-tier)

概率词汇(5级)

  • Almost impossible
  • Unlikely
  • Possible
  • Likely
  • Almost certain
  • 几乎不可能
  • 不太可能
  • 可能
  • 很可能
  • 几乎肯定

Confidence Levels

置信度等级

  • High: Multiple independent sources, consistent, high-quality methods
  • Medium: Some independent verification, partial consistency
  • Low: Single source, high uncertainty, limited verification
Each Key Judgment must include: Likelihood (probability word) + Confidence (High/Medium/Low) + Why (evidence and method rationale)
See UNCERTAINTY_EXPRESSION.md for detailed guidance.
  • :多独立来源,一致且方法高质量
  • :部分独立验证,部分一致
  • :单一来源,高不确定性,验证有限
每个关键判断必须包含:可能性(概率词汇) + 置信度(高/中/低) + 理由(证据和方法)
有关详细指南,请参阅UNCERTAINTY_EXPRESSION.md

Common Failure Modes

常见失败模式

Scope Creep

范围蔓延

Symptom: Delivery date approaching but questions multiplying, conclusions becoming vague Solution: Force return to Task Contract. New questions must answer:
  1. Will not doing it affect the decision?
  2. Is there an evidence path? If unobtainable, move to "future work", not current scope
症状:交付日期临近但问题增多,结论模糊 解决方案:强制回归任务契约。新问题必须回答:
  1. 不做这项研究是否会影响决策?
  2. 是否有证据路径?若无法获取,移至“未来工作”,而非当前范围

Last-Minute Citation & Verification

最后一刻的引用与验证

Symptom: Report finished but evidence doesn't match/links broken/inconsistent metrics Solution: Front-load citation and evidence registration:
  • Register sources during collection (Source Register)
  • Pull evidence from Evidence Table when writing conclusions (not from memory)
症状:报告完成但证据不匹配/链接失效/指标不一致 解决方案:前置引用和证据登记:
  • 收集过程中登记来源(来源登记册)
  • 撰写结论时从证据表提取证据(而非凭记忆)

Project Rhythm (1-2 week research)

项目节奏(1-2周研究)

  • Day 1: Task Contract + Issue tree + Collection plan (Gate 1/2)
  • Day 2-4: Collection + registration + initial Evidence Table (daily Gate 3)
  • Day 5: Structured analysis (ACH/Key Assumptions Check) + initial Key Judgments (Gate 4)
  • Day 6: Fill gaps, handle conflicts, update confidence
  • Day 7: Deliverable writing + Red Team + QA (Gate 5)
  • 第1天:任务契约 + 问题树 + 收集计划(门控1/2)
  • 第2-4天:收集 + 登记 + 初始证据表(每日门控3)
  • 第5天:结构化分析(ACH/关键假设检查) + 初始关键判断(门控4)
  • 第6天:填补缺口、处理冲突、更新置信度
  • 第7天:交付物撰写 + 红队法 + QA(门控5)

Daily Standup (10 minutes)

每日站会(10分钟)

  • What new "usable evidence" was added yesterday (not "what was read")
  • What hypothesis/gap will be verified today
  • Blockers: Can't get data? Conflicting evidence? Scope change?
  • 昨日新增了哪些“可用证据”(而非“读了什么”)
  • 今日将验证哪些假设/缺口
  • 障碍:无法获取数据?冲突证据?范围变更?

Tradecraft Review (every 2-3 days)

行业标准评审(每2-3天)

  • Are claims covered? Are conflicts recorded?
  • Is there a tendency toward "feeling-based convergence"?
  • Does collection strategy need adjustment (change source types, languages, timeline)?
  • 主张是否被覆盖?冲突是否被记录?
  • 是否存在“基于感觉的收敛”倾向?
  • 收集策略是否需要调整(变更来源类型、语言、时间线)?

Red Team / Devil's Advocate (24 hours before delivery)

红队法 / 魔鬼代言人(交付前24小时)

  • What is the most vulnerable point of your conclusion?
  • Which evidence is weakest? What if it's wrong?
  • Have you "missed alternative explanations"?
  • 结论最脆弱的点是什么?
  • 哪项证据最弱?若它是错的会怎样?
  • 是否“遗漏了替代解释”?

Weekly Research Rituals

每周研究仪式

Fixed weekly ceremonies that support the evidence-chain production line:
支持证据链生产线的固定每周流程:

Ritual Overview

仪式概述

RitualFrequencyDurationParticipantsPurpose
Claim ReviewWeekly30 minresearch-lead, methodologistPrevent scope drift
Kill Point ReviewWeekly30 mindevils-advocate, research-leadIdentify single-point failures
Conflict Evidence Stand-upWeekly15 minevidence-librarian, allMake conflicting evidence explicit
Devil's DayBi-weekly2 hoursdevils-advocate, allCounter-world attack session
QA Pre-GateBefore delivery1 hourqa-gatekeeper, editorPre-release failure simulation

仪式频率时长参与者目的
主张评审每周30分钟research-lead, methodologist防止范围蔓延
致命点评审每周30分钟devils-advocate, research-lead识别单点故障
冲突证据站会每周15分钟evidence-librarian, 全体明确呈现冲突证据
魔鬼日每两周2小时devils-advocate, 全体反事实攻击会议
QA预门控交付前1小时qa-gatekeeper, editor发布前失败模拟

1. Claim Review (Monday, 30 min)

1. 主张评审(周一,30分钟)

Participants:
research-lead
(facilitator),
methodologist

Optional:
collection-strategist
,
domain-expert
Agenda:
  1. Review Current Claims (10 min)
    • List all active claims from Claim Tree
    • Check: Are they still falsifiable?
    • Check: Do they still answer KIQs?
  2. Scope Check (10 min)
    • Compare claims to Task Contract
    • Identify scope drift
    • Decide: Keep, modify, or remove claims
  3. Priority Update (10 min)
    • Re-rank claims by decision impact
    • Identify which claims need evidence first
    • Update collection priorities
Outputs:
  • Updated Claim Tree
  • Priority Matrix
  • Scope Change Log (if any)
Success Criteria:
  • All claims traceable to KIQs
  • No scope drift beyond Task Contract
  • Priorities reflect decision needs

参与者
research-lead
(主持人),
methodologist

可选
collection-strategist
,
domain-expert
议程:
  1. 评审当前主张(10分钟)
    • 列出主张树中的所有活跃主张
    • 检查:是否仍可证伪?
    • 检查:是否仍解答KIQs?
  2. 范围检查(10分钟)
    • 将主张与任务契约对比
    • 识别范围蔓延
    • 决定:保留、修改或移除主张
  3. 优先级更新(10分钟)
    • 根据决策影响重新排序主张
    • 识别哪些主张需优先获取证据
    • 更新收集优先级
输出:
  • 更新后的主张树
  • 优先级矩阵
  • 范围变更日志(如有)
成功标准:
  • 所有主张可追溯至KIQs
  • 无超出任务契约的范围蔓延
  • 优先级反映决策需求

2. Kill Point Review (Wednesday, 30 min)

2. 致命点评审(周三,30分钟)

Participants:
devils-advocate
(facilitator),
research-lead
,
methodologist
,
evidence-librarian
Agenda:
  1. Identify Kill Points (15 min)
    • Review current Key Judgments
    • For each judgment: What evidence, if falsified, would kill it?
    • List all Kill Points
  2. Assess Fragility (10 min)
    • Which judgments have single-point failures?
    • Which assumptions are most vulnerable?
    • Create Fragility Map
  3. Action Plan (5 min)
    • Which Kill Points need additional evidence?
    • Which need re-verification?
    • Assign follow-up tasks
Outputs:
  • Kill-Point List (updated)
  • Fragility Map
  • Action Items
Success Criteria:
  • All Key Judgments have identified Kill Points
  • Single-point failures flagged
  • Action plan for strengthening weak points

参与者
devils-advocate
(主持人),
research-lead
,
methodologist
,
evidence-librarian
议程:
  1. 识别致命点(15分钟)
    • 评审当前关键判断
    • 每个判断:若哪项证据被证伪,结论会失效?
    • 列出所有致命点
  2. 评估脆弱性(10分钟)
    • 哪些判断存在单点故障?
    • 哪些假设最脆弱?
    • 创建脆弱性地图
  3. 行动计划(5分钟)
    • 哪些致命点需要补充证据?
    • 哪些需要重新验证?
    • 分配后续任务
输出:
  • 更新后的致命点列表
  • 脆弱性地图
  • 行动项
成功标准:
  • 所有关键判断都已识别致命点
  • 单点故障已标记
  • 已制定强化弱点的行动计划

3. Conflict Evidence Stand-up (Friday, 15 min)

3. 冲突证据站会(周五,15分钟)

Participants:
evidence-librarian
(facilitator), all subagents (brief check-in)
Agenda:
  1. New Conflicts (5 min)
    • evidence-librarian
      reports new conflicting evidence
    • Brief description: What conflicts, why
  2. Status Update (5 min)
    • Each subagent: Any conflicts discovered in their work?
    • Quick round: "I found X conflicting with Y"
  3. Next Steps (5 min)
    • Which conflicts need investigation?
    • Assign to appropriate subagent
    • Update Conflict Evidence Log
Outputs:
  • Updated Conflict Evidence Log
  • Action Items for conflict resolution
Success Criteria:
  • All conflicts explicitly recorded
  • No conflicts "hidden" or ignored
  • Action plan for each conflict

参与者
evidence-librarian
(主持人), 所有子代理(简短签到)
议程:
  1. 新冲突(5分钟)
    • evidence-librarian
      报告新的冲突证据
    • 简要描述:冲突内容、原因
  2. 状态更新(5分钟)
    • 每个子代理:工作中是否发现冲突?
    • 快速轮询:“我发现X与Y冲突”
  3. 下一步计划(5分钟)
    • 哪些冲突需要调查?
    • 分配给对应子代理
    • 更新冲突证据日志
输出:
  • 更新后的冲突证据日志
  • 冲突解决行动项
成功标准:
  • 所有冲突已明确记录
  • 无冲突被“隐藏”或忽略
  • 每个冲突都有行动计划

4. Devil's Day (Bi-weekly, 2 hours)

4. 魔鬼日(每两周,2小时)

Participants:
devils-advocate
(facilitator), all subagents (full participation)
Agenda:
  1. Current State Review (20 min)
    • research-lead
      : Current Key Judgments
    • methodologist
      : Current hypothesis status
    • evidence-librarian
      : Evidence summary
  2. Counter-World Construction (40 min)
    • devils-advocate
      presents 2-3 alternative explanations
    • Group discussion: Can these worlds explain the evidence?
    • Identify what evidence would distinguish worlds
  3. Adversarial Attack (40 min)
    • devils-advocate
      attacks evidence chain
    • Each subagent defends their work
    • Identify weaknesses and gaps
  4. Action Plan (20 min)
    • What needs re-verification?
    • What evidence is missing?
    • Update Fragility Map and Kill Points
Outputs:
  • Adversarial Review Memo
  • Updated Kill-Point List
  • Re-verification Plan
  • Updated Fragility Map
Success Criteria:
  • At least 2 counterfactual worlds constructed
  • Evidence chain weaknesses identified
  • Action plan for strengthening conclusions
Rules:
  • devils-advocate
    must be systematic, not emotional
  • All participants must engage, not just listen
  • Focus on structure, not personalities

参与者
devils-advocate
(主持人), 所有子代理(全员参与)
议程:
  1. 当前状态评审(20分钟)
    • research-lead
      :当前关键判断
    • methodologist
      :当前假设状态
    • evidence-librarian
      :证据摘要
  2. 反事实场景构建(40分钟)
    • devils-advocate
      提出2-3种替代解释
    • 小组讨论:这些场景能否解释现有证据?
    • 识别区分场景的证据
  3. 对抗性攻击(40分钟)
    • devils-advocate
      攻击证据链
    • 每个子代理为自己的工作辩护
    • 识别弱点和缺口
  4. 行动计划(20分钟)
    • 哪些内容需要重新验证?
    • 缺少哪些证据?
    • 更新脆弱性地图和致命点
输出:
  • 对抗性评审备忘录
  • 更新后的致命点列表
  • 重新验证计划
  • 更新后的脆弱性地图
成功标准:
  • 至少构建2种反事实场景
  • 已识别证据链弱点
  • 已制定强化结论的行动计划
规则:
  • devils-advocate
    必须系统化,而非情绪化
  • 所有参与者必须参与,而非仅旁听
  • 聚焦结构,而非个人

5. QA Pre-Gate (Before delivery, 1 hour)

5. QA预门控(交付前,1小时)

Participants:
qa-gatekeeper
(facilitator),
editor
,
research-lead
, optional
devils-advocate
Agenda:
  1. Pre-Flight Check (15 min)
    • editor
      : Deliverable status
    • research-lead
      : Key Judgments summary
    • qa-gatekeeper
      : QA checklist preview
  2. Rubric Review (30 min)
    • Go through 8-dimension rubric
    • Score each dimension (1-5)
    • Identify any dimensions ≤2 (must fix)
  3. Failure Simulation (10 min)
    • qa-gatekeeper
      : "What if this is wrong?"
    • Identify worst-case scenarios
    • Check: Are risks acceptable?
  4. Go/No-Go Decision (5 min)
    • qa-gatekeeper
      : Final decision
    • If Go: Approval for delivery
    • If No-Go: Specific feedback and return stage
Outputs:
  • QA Report
  • Rubric Scores
  • Go/No-Go Decision
  • Action Items (if No-Go)
Success Criteria:
  • All dimensions ≥3 (ideally ≥4)
  • No critical issues
  • Risks acceptable
  • Ready for delivery

参与者
qa-gatekeeper
(主持人),
editor
,
research-lead
, 可选
devils-advocate
议程:
  1. 飞行前检查(15分钟)
    • editor
      :交付物状态
    • research-lead
      :关键判断摘要
    • qa-gatekeeper
      :QA检查表预览
  2. 评估表评审(30分钟)
    • 逐一检查8个维度的评估表
    • 为每个维度打分(1-5)
    • 识别得分≤2的维度(必须修复)
  3. 失败模拟(10分钟)
    • qa-gatekeeper
      :“若结论错误会怎样?”
    • 识别最坏场景
    • 检查:风险是否可接受?
  4. 通过/不通过决策(5分钟)
    • qa-gatekeeper
      :最终决策
    • 若通过:批准交付
    • 若不通过:提供具体反馈并返回对应阶段
输出:
  • QA报告
  • 评估表得分
  • 通过/不通过决策
  • 行动项(若不通过)
成功标准:
  • 所有维度得分≥3(理想≥4)
  • 无关键问题
  • 风险可接受
  • 已准备好交付

Ritual Integration with Workflow

仪式与工作流的集成

Weekly Schedule Example:
  • Monday: Morning: Claim Review (30 min); Rest of day: Normal workflow
  • Wednesday: Morning: Kill Point Review (30 min); Rest of day: Normal workflow
  • Friday: Morning: Conflict Evidence Stand-up (15 min); Afternoon (bi-weekly): Devil's Day (2 hours)
  • Before Delivery: 24-48 hours: QA Pre-Gate (1 hour)
Ritual Outputs Feed Workflow:
  • Claim Review → Updates Stage 1 (KIQ & Claim Draft)
  • Kill Point Review → Updates Stage 8 (Counter-World Attack)
  • Conflict Evidence Stand-up → Updates Stage 5 (Evidence Registration)
  • Devil's Day → May trigger return to Stage 6 (Structured Reasoning) or Stage 2 (Source Map)
  • QA Pre-Gate → Final check before Stage 11 (QA Gate)
Adapting Rituals:
  • Smaller teams: Combine roles, reduce frequency
  • Tighter timelines: Shorten durations, focus on critical rituals
  • Larger teams: Add sub-rituals, more detailed agendas
Common Pitfalls:
  1. Skipping rituals: "We don't have time" → Leads to scope drift, missed conflicts
  2. Rituals become formalities: No real engagement → No value
  3. Wrong participants: Missing key roles → Incomplete reviews
  4. No follow-up: Rituals identify issues but no action → Problems persist
Solution: Treat rituals as essential quality gates, not optional meetings.

每周日程示例:
  • 周一:上午:主张评审(30分钟);其余时间:正常工作流
  • 周三:上午:致命点评审(30分钟);其余时间:正常工作流
  • 周五:上午:冲突证据站会(15分钟);下午(每两周):魔鬼日(2小时)
  • 交付前:24-48小时:QA预门控(1小时)
仪式输出反馈工作流:
  • 主张评审 → 更新阶段1(KIQ与主张草拟)
  • 致命点评审 → 更新阶段8(反事实攻击)
  • 冲突证据站会 → 更新阶段5(证据登记)
  • 魔鬼日 → 可能触发返回阶段6(结构化推理)或阶段2(来源地图)
  • QA预门控 → 阶段11(QA门控)前的最终检查
仪式调整:
  • 小型团队:合并角色,降低频率
  • 紧张时间线:缩短时长,聚焦关键仪式
  • 大型团队:增加子仪式,细化议程
常见陷阱:
  1. 跳过仪式:“没时间” → 导致范围蔓延、遗漏冲突
  2. 仪式流于形式:无真正参与 → 无价值
  3. 错误参与者:缺少关键角色 → 评审不完整
  4. 无后续行动:仪式识别问题但无行动 → 问题持续
解决方案:将仪式视为核心质量门控,而非可选会议。

Subagents System Setup

子代理系统设置

Initializing Subagents in Your Workspace

在工作区初始化子代理

To use the DeepResearch subagents system, you need to copy the subagent definitions from the skill to your workspace:
Source:
@deepresearch/subagents/.cursor/agents/

Destination: Your workspace
.cursor/agents/
directory
要使用DeepResearch子代理系统,需将子代理定义从技能复制到你的工作区:
来源路径
@deepresearch/subagents/.cursor/agents/

目标路径:你的工作区
.cursor/agents/
目录

Setup Steps

设置步骤

⚠️ MANDATORY FIRST STEP: Before initializing subagents, you MUST create an
AGENTS.md
file in your project root using the
cursor-agents-md
skill. This file defines project-specific instructions for all DeepResearch work.
  1. Create AGENTS.md (MANDATORY):
    @cursor-agents-md Create an AGENTS.md file for this DeepResearch project
    The AGENTS.md file must include:
    • Project-specific research guidelines
    • Code style and documentation standards (if applicable)
    • File structure and organization rules
    • Boundaries and constraints
    • Important: Add a reminder to read
      cursor-agents-md
      skill before updating AGENTS.md
    Required content in AGENTS.md:
    markdown
    > **⚠️ Important:** You must read `cursor-agents-md` skills every time before write or update this `AGENTS.md`.
    
    # DeepResearch Project Instructions
    
    [Project-specific guidelines for DeepResearch work]
  2. Create workspace agents directory (if it doesn't exist):
    mkdir -p .cursor/agents
  3. Copy subagent definitions from the skill:
    • Copy all
      .md
      files from
      @deepresearch/subagents/.cursor/agents/
      to your workspace
      .cursor/agents/
    • Required files:
      • research-lead.md
      • collection-strategist.md
      • verification-expert.md
      • evidence-librarian.md
      • methodologist.md
      • quant-analyst.md
      • domain-expert.md
      • editor.md
      • qa-gatekeeper.md
      • devils-advocate.md
  4. Verify Cursor settings:
    • Ensure Cursor is in Nightly mode (Settings > Cursor Settings > Beta > Nightly)
    • Subagents feature should be enabled (Settings > Cursor Settings > Subagents)
  5. Test subagent invocation:
    @research-lead Create a Task Contract for researching [topic]
⚠️ 强制第一步:初始化子代理前,必须使用
cursor-agents-md
技能在项目根目录创建
AGENTS.md
文件。该文件定义了所有DeepResearch工作必须遵循的项目特定指令。
  1. 创建AGENTS.md(强制要求):
    @cursor-agents-md Create an AGENTS.md file for this DeepResearch project
    AGENTS.md文件必须包含:
    • 项目特定的研究指南
    • 代码风格和文档标准(如适用)
    • 文件结构和组织规则
    • 边界和约束
    • 重要提示:添加更新
      AGENTS.md
      前需阅读
      cursor-agents-md
      技能的提醒
    AGENTS.md必填内容:
    markdown
    > **⚠️ 重要提示:** 每次编写或更新此`AGENTS.md`前,你必须阅读`cursor-agents-md`技能。
    
    # DeepResearch项目指令
    
    [DeepResearch工作的项目特定指南]
  2. 创建工作区agents目录(若不存在):
    mkdir -p .cursor/agents
  3. 复制子代理定义从技能到工作区:
    • @deepresearch/subagents/.cursor/agents/
      下的所有
      .md
      文件复制到工作区
      .cursor/agents/
    • 必填文件:
      • research-lead.md
      • collection-strategist.md
      • verification-expert.md
      • evidence-librarian.md
      • methodologist.md
      • quant-analyst.md
      • domain-expert.md
      • editor.md
      • qa-gatekeeper.md
      • devils-advocate.md
  4. 验证Cursor设置:
    • 确保Cursor处于Nightly模式(设置 > Cursor设置 > Beta > Nightly)
    • 子代理功能应已启用(设置 > Cursor设置 > Subagents)
  5. 测试子代理调用:
    @research-lead Create a Task Contract for researching [topic]

Subagent Roles

子代理角色

SubagentRoleKey Responsibility
research-lead
Research LeadTask Contract, Priority, Final Judgments
collection-strategist
Collection StrategistSource Map Design, Retrieval Routes
verification-expert
OSINT Verification ExpertTruth Testing, Evidence Chain
evidence-librarian
Evidence LibrarianSource Register, Evidence Table
methodologist
Analytic MethodologistStructured Analysis, Hypothesis Competition
quant-analyst
Data AnalystData Consistency, Sensitivity Analysis
domain-expert
Domain ExpertMechanism Validation, Plausibility Check
editor
Editor/StorylinerDecision Packaging, Report Writing
qa-gatekeeper
QA/Ethics GatekeeperQuality Gates, Compliance Check
devils-advocate
Devil's AdvocateAdversarial Review, Counter-Hypotheses
子代理角色核心职责
research-lead
研究负责人任务契约、优先级、最终判断
collection-strategist
收集策略师来源地图设计、检索路径
verification-expert
OSINT验证专家真实性测试、证据链
evidence-librarian
证据管理员来源登记册、证据表
methodologist
分析方法论专家结构化分析、假设竞争
quant-analyst
数据分析师数据一致性、敏感性分析
domain-expert
领域专家机制验证、合理性检查
editor
编辑/叙事师决策包装、报告撰写
qa-gatekeeper
QA/伦理把关人质量门控、合规检查
devils-advocate
魔鬼代言人对抗性评审、反假设

Using Subagents

使用子代理

Invoke subagents using the Task tool in Cursor:
@research-lead Create a Task Contract for researching [topic]
@collection-strategist Design a Source Map for these claims: [claims]
@verification-expert Verify this image: [image URL]
@evidence-librarian Register this evidence: [evidence details]
@methodologist Apply ACH to these hypotheses: [hypotheses]
@quant-analyst Check consistency of these numbers: [numbers]
@domain-expert Validate this mechanism: [mechanism description]
@editor Package this research into a report: [research content]
@qa-gatekeeper Perform QA on this report: [report]
@devils-advocate Challenge this conclusion: [conclusion]
在Cursor中使用任务工具调用子代理:
@research-lead Create a Task Contract for researching [topic]
@collection-strategist Design a Source Map for these claims: [claims]
@verification-expert Verify this image: [image URL]
@evidence-librarian Register this evidence: [evidence details]
@methodologist Apply ACH to these hypotheses: [hypotheses]
@quant-analyst Check consistency of these numbers: [numbers]
@domain-expert Validate this mechanism: [mechanism description]
@editor Package this research into a report: [research content]
@qa-gatekeeper Perform QA on this report: [report]
@devils-advocate Challenge this conclusion: [conclusion]

Subagent Workflow Integration

子代理与工作流的集成

Each subagent is assigned to specific workflow stages:
  • Stage 0-1:
    research-lead
    +
    methodologist
  • Stage 2-3:
    collection-strategist
  • Stage 4:
    verification-expert
  • Stage 5:
    evidence-librarian
  • Stage 6:
    methodologist
  • Stage 7:
    quant-analyst
  • Stage 8:
    devils-advocate
  • Stage 9:
    domain-expert
  • Stage 10:
    editor
  • Stage 11:
    qa-gatekeeper
See "Evidence-Chain Production Line" section above for detailed stage-by-stage SOP and "Weekly Research Rituals" section for weekly ceremony procedures.
每个子代理被分配到特定的工作流阶段:
  • 阶段0-1
    research-lead
    +
    methodologist
  • 阶段2-3
    collection-strategist
  • 阶段4
    verification-expert
  • 阶段5
    evidence-librarian
  • 阶段6
    methodologist
  • 阶段7
    quant-analyst
  • 阶段8
    devils-advocate
  • 阶段9
    domain-expert
  • 阶段10
    editor
  • 阶段11
    qa-gatekeeper
有关详细的阶段SOP,请参阅上文“证据链生产线”章节;有关每周流程,请参阅“每周研究仪式”章节。

Complete Subagent Initialization Guide

完整子代理初始化指南

Prerequisites

前提条件

⚠️ MANDATORY FIRST STEP: Before initializing subagents, you MUST create an
AGENTS.md
file in your project root using the
cursor-agents-md
skill.
Step 0: Create AGENTS.md (MANDATORY):
  1. Invoke cursor-agents-md skill:
    @cursor-agents-md Create an AGENTS.md file for this DeepResearch project
  2. Required content in AGENTS.md:
    markdown
    > **⚠️ Important:** You must read `cursor-agents-md` skills every time before write or update this `AGENTS.md`.
    
    # DeepResearch Project Instructions
    
    ## Project Overview
    [Describe your research project]
    
    ## Research Guidelines
    [Project-specific research guidelines]
    
    ## File Structure
    [How research files should be organized]
    
    ## Boundaries
    [What should never be done]
  3. Verify AGENTS.md exists:
    bash
    ls AGENTS.md  # Should exist in project root
Why this is mandatory: AGENTS.md provides project-specific context that all subagents need to work effectively. Without it, subagents lack project-specific guidelines and may make incorrect assumptions.
⚠️ 强制第一步:初始化子代理前,必须使用
cursor-agents-md
技能在项目根目录创建
AGENTS.md
文件。
步骤0:创建AGENTS.md(强制要求):
  1. 调用cursor-agents-md技能:
    @cursor-agents-md Create an AGENTS.md file for this DeepResearch project
  2. AGENTS.md必填内容:
    markdown
    > **⚠️ 重要提示:** 每次编写或更新此`AGENTS.md`前,你必须阅读`cursor-agents-md`技能。
    
    # DeepResearch项目指令
    
    ## 项目概述
    [描述你的研究项目]
    
    ## 研究指南
    [项目特定的研究指南]
    
    ## 文件结构
    [研究文件的组织规则]
    
    ## 边界
    [禁止事项]
  3. 验证AGENTS.md存在:
    bash
    ls AGENTS.md  # 应存在于项目根目录
为什么这是强制要求:AGENTS.md提供了所有子代理有效工作所需的项目特定上下文。若无此文件,子代理缺少项目指南,可能做出错误假设。

Copying Subagent Files

复制子代理文件

Option 1: Manual Copy (Recommended)
  1. Locate the skill's subagent directory:
    • Path:
      @deepresearch/subagents/.cursor/agents/
    • Or:
      [skill-install-path]/deepresearch/subagents/.cursor/agents/
  2. Create workspace agents directory (if it doesn't exist):
    bash
    mkdir -p .cursor/agents
  3. Copy all subagent files:
    bash
    # Windows (PowerShell)
    Copy-Item "@deepresearch/subagents/.cursor/agents/*.md" -Destination ".cursor/agents/" -Exclude "README.md"
    
    # Linux/Mac
    cp @deepresearch/subagents/.cursor/agents/*.md .cursor/agents/ --exclude README.md
  4. Required files to copy:
    • research-lead.md
    • collection-strategist.md
    • verification-expert.md
    • evidence-librarian.md
    • methodologist.md
    • quant-analyst.md
    • domain-expert.md
    • editor.md
    • qa-gatekeeper.md
    • devils-advocate.md
Option 2: Using Cursor's File Operations
  1. Open Cursor IDE
  2. Navigate to
    @deepresearch/subagents/.cursor/agents/
  3. Select all
    .md
    files (except
    README.md
    )
  4. Copy to your workspace
    .cursor/agents/
    directory
选项1:手动复制(推荐)
  1. 定位技能的子代理目录:
    • 路径:
      @deepresearch/subagents/.cursor/agents/
    • 或:
      [技能安装路径]/deepresearch/subagents/.cursor/agents/
  2. 创建工作区agents目录(若不存在):
    bash
    mkdir -p .cursor/agents
  3. 复制所有子代理文件:
    bash
    # Windows (PowerShell)
    Copy-Item "@deepresearch/subagents/.cursor/agents/*.md" -Destination ".cursor/agents/" -Exclude "README.md"
    
    # Linux/Mac
    cp @deepresearch/subagents/.cursor/agents/*.md .cursor/agents/ --exclude README.md
  4. 必须复制的文件:
    • research-lead.md
    • collection-strategist.md
    • verification-expert.md
    • evidence-librarian.md
    • methodologist.md
    • quant-analyst.md
    • domain-expert.md
    • editor.md
    • qa-gatekeeper.md
    • devils-advocate.md
选项2:使用Cursor的文件操作
  1. 打开Cursor IDE
  2. 导航到
    @deepresearch/subagents/.cursor/agents/
  3. 选择所有
    .md
    文件(除
    README.md
    外)
  4. 复制到工作区
    .cursor/agents/
    目录

Verification

验证

After copying, verify the setup:
  1. Check AGENTS.md exists (MANDATORY):
    bash
    ls AGENTS.md  # Must exist in project root
  2. Check files exist:
    bash
    ls .cursor/agents/*.md
  3. Test subagent invocation:
    @research-lead Create a Task Contract for researching [your topic]
  4. Verify Cursor settings:
    • Settings > Cursor Settings > Beta > Nightly (enabled)
    • Settings > Cursor Settings > Subagents (should show your subagents)
复制完成后,验证设置:
  1. 检查AGENTS.md存在(强制要求):
    bash
    ls AGENTS.md  # 必须存在于项目根目录
  2. 检查文件存在:
    bash
    ls .cursor/agents/*.md
  3. 测试子代理调用:
    @research-lead Create a Task Contract for researching [your topic]
  4. 验证Cursor设置:
    • 设置 > Cursor设置 > Beta > Nightly(已启用)
    • 设置 > Cursor设置 > Subagents(应显示你的子代理)

Troubleshooting

故障排除

Subagents not appearing:
  • Check Cursor version: Must be Nightly build
  • Check file location: Must be in
    .cursor/agents/
    in workspace root
  • Check file format: Each file must have valid YAML frontmatter
  • Restart Cursor: After copying files, restart Cursor IDE
Subagent not responding:
  • Check file name: Must match subagent name exactly (e.g.,
    research-lead.md
    )
  • Check YAML frontmatter: Must have
    name
    ,
    description
    ,
    model
    fields
  • Check file encoding: Must be UTF-8
子代理未显示:
  • 检查Cursor版本:必须是Nightly版本
  • 检查文件位置:必须在工作区根目录的
    .cursor/agents/
  • 检查文件格式:每个文件必须有有效的YAML前置元数据
  • 重启Cursor:复制文件后重启Cursor IDE
子代理无响应:
  • 检查文件名:必须与子代理名称完全匹配(如
    research-lead.md
  • 检查YAML前置元数据:必须包含
    name
    description
    model
    字段
  • 检查文件编码:必须是UTF-8

File Structure

文件结构

After initialization, your workspace should have:
your-workspace/
├── AGENTS.md (project instructions - MANDATORY)
├── .cursor/
│   └── agents/
│       ├── research-lead.md
│       ├── collection-strategist.md
│       ├── verification-expert.md
│       ├── evidence-librarian.md
│       ├── methodologist.md
│       ├── quant-analyst.md
│       ├── domain-expert.md
│       ├── editor.md
│       ├── qa-gatekeeper.md
│       └── devils-advocate.md
└── [your project files]
初始化后,你的工作区应具有以下结构:
your-workspace/
├── AGENTS.md (项目指令 - 强制要求)
├── .cursor/
│   └── agents/
│       ├── research-lead.md
│       ├── collection-strategist.md
│       ├── verification-expert.md
│       ├── evidence-librarian.md
│       ├── methodologist.md
│       ├── quant-analyst.md
│       ├── domain-expert.md
│       ├── editor.md
│       ├── qa-gatekeeper.md
│       └── devils-advocate.md
└── [你的项目文件]

Customization

自定义

After copying, you can customize subagents for your specific needs:
  1. Edit subagent files in
    .cursor/agents/
  2. Modify descriptions to match your workflow
  3. Add project-specific instructions to system prompts
  4. Restart Cursor to load changes
Note: Customizations are local to your workspace and won't affect the skill definition.
复制完成后,可根据特定需求自定义子代理:
  1. 编辑
    .cursor/agents/
    下的子代理文件
  2. 修改描述以匹配你的工作流
  3. 添加项目特定指令到系统提示
  4. 重启Cursor加载变更
注意:自定义仅针对你的工作区,不会影响技能定义。

Updating Subagents

更新子代理

When the skill is updated:
  1. Compare versions: Check if skill subagents have changed
  2. Backup customizations: Save your custom changes
  3. Re-copy files: Copy updated files from skill
  4. Re-apply customizations: Merge your custom changes back
当技能更新时:
  1. 对比版本:检查技能子代理是否有变更
  2. 备份自定义内容:保存你的自定义修改
  3. 重新复制文件:从技能复制更新后的文件
  4. 重新应用自定义:合并你的自定义修改

Detailed Subagent Role Definitions

详细子代理角色定义

1. research-lead (Research Lead / Owner)

1. research-lead(研究负责人 / 所有者)

Core Responsibilities:
  • Define Task Contract: research question, boundaries, time window, deliverable format, success criteria
  • Set priorities and rhythm: prioritize KIQs that most impact decisions
  • Make final judgments: Key Judgments, confidence levels, risk narrative
Mental Model:
  • "Decision-backward": Ask "what will readers decide" first, then determine what to research
  • "Claim-first": Write research as falsifiable claims, not topic summaries
  • "Stop rules": Define what evidence is sufficient, when to stop
Key Outputs: Task Contract v1, Claim Tree v1, Key Judgments (final), Priority Matrix
Quality Gates: Research question is falsifiable and testable; Non-goals are explicit; KIQs ≤ 7 and actionable; Success criteria defined
Common Pitfalls: Unclear goals leading to scope creep; Premature convergence using intuition over evidence

核心职责:
  • 定义任务契约:研究问题、边界、时间窗口、交付物格式、成功标准
  • 设置优先级和节奏:优先处理对决策影响最大的KIQs
  • 做出最终判断:关键判断、置信度、风险叙事
思维模型:
  • “从决策倒推”:先问“读者会做出什么决策”,再确定研究内容
  • “主张先行”:将研究写成可证伪的主张,而非主题摘要
  • “停止规则”:定义足够的证据标准和停止时机
核心输出:任务契约v1、主张树v1、最终关键判断、优先级矩阵
质量门控:研究问题可证伪、可测试;非目标明确;KIQs数量≤7且可操作;成功标准已定义
常见陷阱:目标模糊导致范围蔓延;凭直觉而非证据过早收敛

2. collection-strategist (Collection Strategist / Source Map Designer)

2. collection-strategist(收集策略师 / 来源地图设计师)

Core Responsibilities:
  • Create Source Map: source type combinations for each sub-question (official/academic/industry/primary data/media/UGC)
  • Design retrieval routes: keywords, language variants, exclusion terms, archiving strategy
  • Control costs: prioritize "most discriminating" information
Mental Model:
  • "Source type complementarity": Each key judgment has at least 2 independent source types for cross-validation
  • "High signal first": Get high-credibility/auditable primary materials first, then supplement with secondary
  • "Coverage thinking": Focus on claims coverage, not link count
Key Outputs: Source Map, Collection Plan, Retrieval Route Specifications, Archiving Strategy
Quality Gates: Each claim has ≥2 independent source types; Cross-validation design exists; Archiving strategy defined
Common Pitfalls: Single-source dependency (only news/only company PR); No archiving leading to non-auditable results

核心职责:
  • 创建来源地图:每个子问题的来源类型组合(官方/学术/行业/原始数据/媒体/UGC)
  • 设计检索路径:关键词、语言变体、排除词、存档策略
  • 控制成本:优先获取“区分度最高”的信息
思维模型:
  • “来源类型互补”:每个关键判断至少有2种独立来源类型用于交叉验证
  • “高信号优先”:先获取高可信度/可审计的原始材料,再补充次级来源
  • “覆盖思维”:聚焦主张覆盖度,而非链接数量
核心输出:来源地图、收集计划、检索路径规范、存档策略
质量门控:每个主张对应≥2种独立来源类型;存在交叉验证设计;已定义存档策略
常见陷阱:单一来源依赖(仅新闻/仅企业PR);未存档导致结果不可审计

3. verification-expert (OSINT Verification Expert / Truth Tester)

3. verification-expert(OSINT验证专家 / 真实性测试员)

Core Responsibilities:
  • Verify UGC, event materials, images, videos: source/location/time/editing traces/reuse
  • Output auditable evidence chains: coordinates, screenshot comparisons, exclusion rationale, timelines
Mental Model:
  • "Falsify first, then verify": Try to prove it's fake first (reused old images, out-of-context, fake accounts)
  • "Chain inference": Only advance to "auditable" degree at each step, no jumps
  • "Consistency constraints": Time, geography, physical details must be mutually consistent
Key Outputs: Verified Evidence Pool, Geolocation Reports, Chronolocation Analysis, Verification Log
Quality Gates: Passes "falsify first" test; Evidence chain is auditable; Consistency checks passed
Common Pitfalls: Only giving conclusions without chain; Treating "looks like" as evidence

核心职责:
  • 验证UGC、事件材料、图片、视频:来源/位置/时间/编辑痕迹/复用情况
  • 输出可审计的证据链:坐标、截图对比、排除理由、时间线
思维模型:
  • “先证伪,再验证”:先尝试证明其为伪造(复用旧图、断章取义、虚假账户)
  • “链式推理”:每一步仅推进到“可审计”程度,不跳跃
  • “一致性约束”:时间、地理、物理细节必须相互一致
核心输出:已验证证据池、地理位置报告、时间定位分析、验证日志
质量门控:通过“先证伪”测试;证据链可审计;一致性检查通过
常见陷阱:仅给出结论而无证据链;将“看起来像”视为证据

4. evidence-librarian (Evidence Librarian / Sourcing & Traceability)

4. evidence-librarian(证据管理员 / 来源与可追溯性)

Core Responsibilities:
  • Source Register (source registry), Evidence Table (claim-evidence table)
  • Citation standards: each key judgment traceable to sources (with version/timestamp/location)
  • Conflict evidence management: conflict points, methods, bias risks, why chosen
Mental Model:
  • "Audit perspective": Assume someone will verify line by line, can you reproduce?
  • "Claim-evidence alignment": Each key sentence points to evidence; without evidence, downgrade to hypothesis
  • "Conflicts don't disappear": Conflicting evidence is an asset, must be explicitly presented
Key Outputs: Source Register, Evidence Table, Citation Index, Conflict Evidence Log
Quality Gates: Each key judgment has traceable citations; Evidence Table is complete; Conflicts are explicitly recorded
Common Pitfalls: Citations stack links but don't correspond to claims; Adding citations on last day causing full rework

核心职责:
  • 来源登记册、主张-证据表
  • 引用标准:每个关键判断可追溯到来源(含版本/时间戳/位置)
  • 冲突证据管理:冲突点、方法、偏差风险、选择理由
思维模型:
  • “审计视角”:假设有人会逐行验证,能否复现结果?
  • “主张-证据对齐”:每个关键句子都指向证据;无证据则降级为假设
  • “冲突不会消失”:冲突证据是资产,必须明确呈现
核心输出:来源登记册、证据表、引用索引、冲突证据日志
质量门控:每个关键判断都有可追溯引用;证据表完整;冲突已明确记录
常见陷阱:引用堆叠链接但与主张不对应;最后一天添加引用导致返工

5. methodologist (Analytic Methodologist / Reasoning Engineer)

5. methodologist(分析方法论专家 / 推理工程师)

Core Responsibilities:
  • Select and facilitate structured analysis: ACH, Key Assumptions Check, Red Team, scenario planning, indicator framework
  • Make reasoning process "replayable": why this explanation is better, what are key discriminating evidence
Mental Model:
  • "Competing explanations": Default at least 2 sets of hypotheses explaining the world in parallel
  • "Maximize discrimination": Prioritize evidence that best distinguishes hypotheses, not most evidence
  • "Bias immunity": Design processes to counter confirmation bias, premature convergence, mirror imaging
Key Outputs: ACH Matrix, Hypothesis Analysis, Key Assumptions Check, Alternative Explanations
Quality Gates: At least 2 competing hypotheses exist; Discriminating evidence identified; Bias mitigation techniques applied
Common Pitfalls: Only writing narrative; Not doing alternative explanations; Writing logical deductions as fact statements

核心职责:
  • 选择并推动结构化分析:ACH、关键假设检查、红队法、场景规划、指标框架
  • 使推理过程“可回放”:为什么该解释更优,关键区分性证据是什么
思维模型:
  • “竞争性解释”:默认至少存在2种并行解释世界的假设
  • “最大化区分度”:优先选择最能区分假设的证据,而非更多证据
  • “抗偏差”:设计流程抵消确认偏差、过早收敛、镜像思维
核心输出:ACH矩阵、假设分析、关键假设检查、替代解释
质量门控:至少存在2种竞争性假设;已识别区分性证据;已应用偏差缓解技术
常见陷阱:仅撰写叙事;未考虑替代解释;将逻辑推导写成事实陈述

6. quant-analyst (Data Analyst / Numbers & Consistency)

6. quant-analyst(数据分析师 / 数据与一致性)

Core Responsibilities:
  • Data cleaning, metric unification, comparable system building (e.g., market size, share, financial metrics)
  • Sensitivity analysis/scenario analysis: how sensitive conclusions are to assumption changes
  • Consistency checks: do numbers conflict, are timelines closed
Mental Model:
  • "Metrics before numbers": Unify definitions first, then discuss conclusions
  • "Range and error": Output intervals, confidence and error sources
  • "Explainable modeling": Models reveal driving factors, not "calculate a precise number"
Key Outputs: Consistency Report, Sensitivity Analysis, Data Cleaning Log, Metric Unification Guide
Quality Gates: Metrics are unified; Consistency checks passed; Sensitivity analysis complete
Common Pitfalls: Using precise numbers to mask uncertainty; Ignoring metric differences leading to wrong comparisons

核心职责:
  • 数据清洗、指标统一、可比系统构建(如市场规模、份额、财务指标)
  • 敏感性分析/场景分析:结论对假设变更的敏感度
  • 一致性检查:数据是否冲突,时间线是否闭合
思维模型:
  • “先指标后数据”:先统一定义,再讨论结论
  • “范围与误差”:输出区间、置信度和误差来源
  • “可解释建模”:模型揭示驱动因素,而非“计算精确数字”
核心输出:一致性报告、敏感性分析、数据清洗日志、指标统一指南
质量门控:指标已统一;一致性检查通过;敏感性分析完整
常见陷阱:用精确数字掩盖不确定性;忽略指标差异导致错误比较

7. domain-expert (Domain Expert / Context & Plausibility)

7. domain-expert(领域专家 / 上下文与合理性)

Core Responsibilities:
  • Provide "industry common sense boundaries": which conclusions cannot hold in reality (common mechanisms, regulation, business logic)
  • Guide evidence priorities: which sources/indicators are more critical in this domain
  • Help team identify "seemingly reasonable but mechanistically wrong" inferences
Mental Model:
  • "Mechanism testing": Not just "is there evidence", but "does it make sense mechanistically"
  • "Anomaly identification": When seeing signals violating industry patterns, trigger re-verification
Key Outputs: Mechanism Memo, Plausibility Assessment, Industry Context Guide, Anomaly Flag List
Quality Gates: Mechanisms are plausible; Industry patterns respected; Anomalies flagged and investigated
Common Pitfalls: Authority suppressing evidence (deciding by experience); Writing domain language that's unreadable

核心职责:
  • 提供“行业常识边界”:哪些结论在现实中不可能成立(常见机制、法规、业务逻辑)
  • 指导证据优先级:该领域哪些来源/指标更关键
  • 帮助团队识别“看似合理但机制错误”的推理
思维模型:
  • “机制测试”:不仅看“是否有证据”,还要看“机制上是否合理”
  • “异常识别”:当发现违反行业模式的信号时,触发重新验证
核心输出:机制备忘录、合理性评估、行业上下文指南、异常标记列表
质量门控:机制合理;符合行业模式;异常已标记并调查
常见陷阱:权威压制证据(凭经验决策);撰写难以理解的领域术语

8. editor (Editor / Storyliner / Decision Packaging)

8. editor(编辑 / 叙事师 / 决策包装)

Core Responsibilities:
  • Package research into usable deliverables: conclusion-first, clear hierarchy, scannable, forwardable
  • Control language quality: qualifiers, risk warnings, consistent uncertainty expression
  • "Reader experience": readers can grasp key judgments in 3 minutes, understand basis in 10 minutes
Mental Model:
  • "Reader bandwidth": Information density designed for decision-maker's time
  • "Conclusion-reason-evidence pyramid": Each layer stands independently
  • "Semantic precision": Treat "possible/likely/almost certain" as engineering specs, not rhetoric
Key Outputs: Draft Report, Executive Summary, Key Judgments Section, Risk Narrative
Quality Gates: Conclusion-first structure; 3-minute grasp test passed; Uncertainty expressed consistently
Common Pitfalls: Treating process as output; Sacrificing rigor for fluency (mixing facts/judgments)

核心职责:
  • 将研究包装成可用交付物:结论先行、层次清晰、易扫描、可转发
  • 控制语言质量:限定词、风险警告、一致性不确定性表达
  • “读者体验”:读者可在3分钟内掌握关键判断,10分钟内理解依据
思维模型:
  • “读者带宽”:为决策者的时间设计信息密度
  • “结论-理由-证据金字塔”:每层独立成立
  • “语义精准”:将“可能/很可能/几乎肯定”视为工程规范,而非修辞
核心输出:报告草稿、执行摘要、关键判断章节、风险叙事
质量门控:结论先行结构;通过3分钟掌握测试;不确定性表达一致
常见陷阱:将过程作为输出;为流畅性牺牲严谨性(混淆事实/判断)

9. qa-gatekeeper (QA / Risk & Ethics / Gatekeeper)

9. qa-gatekeeper(QA / 风险与伦理 / 把关人)

Core Responsibilities:
  • Tradecraft QA: objectivity, source transparency, alternative explanations, uncertainty, logical consistency
  • Compliance and ethics boundaries: privacy, permissions, gray data, misinformation risk
  • Failure plans: how to downgrade delivery when evidence insufficient, how to declare gaps
Mental Model:
  • "Prevent disasters before adding points": Research's most expensive cost is wrong conclusions causing decision losses
  • "Gatekeeper not debater": Doesn't make conclusions for you, but ensures you're qualified to make conclusions
  • "Revocability": Any conclusion must allow future updates with new evidence
Key Outputs: QA Report, Go/No-Go Decision, Compliance Checklist, Risk Assessment
Quality Gates: All quality gates passed; Compliance verified; Ethics boundaries respected
Common Pitfalls: QA intervenes too late; Treating compliance as formal review rather than part of research method

核心职责:
  • 行业标准QA:客观性、来源透明度、替代解释、不确定性、逻辑一致性
  • 合规与伦理边界:隐私、权限、灰色数据、错误信息风险
  • 失败预案:证据不足时如何降级交付,如何声明缺口
思维模型:
  • “防患于未然”:研究的最高成本是错误结论导致的决策损失
  • “把关人而非辩论者”:不为你做结论,但确保你有资格做结论
  • “可修订性”:任何结论必须允许未来用新证据更新
核心输出:QA报告、通过/不通过决策、合规检查表、风险评估
质量门控:所有质量门控已通过;合规性已验证;伦理边界已遵守
常见陷阱:QA介入过晚;将合规视为形式审查而非研究方法的一部分

10. devils-advocate (Devil's Advocate / Systematic Dissenter)

10. devils-advocate(魔鬼代言人 / 系统性反对者)

Core Responsibilities:
  • Construct counterfactual worlds: 2-4 "completely different but still self-consistent" world versions for current main conclusion
  • Attack evidence chain: identify single-point failures, sensitivity points, unverifiable reasoning steps
  • Trigger critical re-verification: specify "Kill Points" where if evidence is falsified, entire conclusion must restart
  • Failure simulation: design 3-5 "research failure decision disaster" scenarios
Mental Model:
  • "Adversarial intelligence assumption": Assume you're seeing selectively exposed information, systematically manipulated
  • "Disconfirming evidence priority": A conclusion doesn't need more supporting evidence, it needs disconfirming attempts that still can't kill it
  • "Failure backward": Ask "three years later, where is this project most likely to fail on which assumption"
  • "Single-point failure sensitivity": Particularly dislikes key judgments supported by only one piece of evidence
Key Outputs: Counter-Hypothesis Brief, Kill-Point List, Fragility Map, Adversarial Review Memo, Decision-Failure Simulation
Quality Gates: At least 2 counterfactual worlds constructed; Kill points identified; Fragility map complete; Adversarial review challenges main conclusion
Common Pitfalls: Anyone temporarily playing opposition → becomes polite objection without attack power; Only questioning views, not evidence chain → only hits surface, not structure; Only raising doubts, not proposing alternative worlds → cannot drive re-verification

核心职责:
  • 构建反事实场景:针对当前主结论,构建2-4种“完全不同但自洽”的场景
  • 攻击证据链:识别单点故障、敏感点、不可验证的推理步骤
  • 触发关键重新验证:指定“致命点”,若该证据被证伪,整个结论必须重启
  • 失败模拟:设计3-5种“研究失败导致决策灾难”的场景
思维模型:
  • “对抗性情报假设”:假设看到的是被选择性暴露、系统性操纵的信息
  • “反证优先”:结论不需要更多支持证据,需要的是无法证伪它的反证尝试
  • “从失败倒推”:问“三年后,这个项目最可能在哪个假设上失败”
  • “单点故障敏感性”:特别反感仅由单一证据支持的关键判断
核心输出:反假设简报、致命点列表、脆弱性地图、对抗性评审备忘录、决策失败模拟
质量门控:至少构建2种反事实场景;已识别致命点;脆弱性地图完整;对抗性评审挑战主结论
常见陷阱:临时扮演反对者→变成礼貌性异议,无攻击力度;仅质疑观点而非证据链→仅触及表面,未撼动结构;仅提出疑问而非替代场景→无法推动重新验证

Parallel Processing & Iteration Details

并行处理与迭代细节

Parallel Processing Opportunities:
  • Stage 4 (Verification) + Stage 5 (Evidence Registration): As evidence is verified, librarian can start registering
  • Stage 6 (Structured Reasoning) + Stage 7 (Data Consistency): Can run in parallel, then reconcile
  • Stage 8 (Devil's Advocate) + Stage 9 (Domain Expert): Can run in parallel, then reconcile
Common Iteration Loops:
  • Stage 6 → Stage 2: If evidence insufficient, return to collection
  • Stage 8 → Stage 6: If Kill Points identified, may need re-analysis
  • Stage 11 → Stage 10: If QA fails, return to editor
  • Stage 11 → Any Stage: If major issues found, return to appropriate stage
Handoff Protocols: Each handoff should include output artifacts, status summary, known issues, and next steps.
Quality Gate Escalation: If a gate fails: Minor issue → Fix within stage; Moderate issue → Return to previous stage; Major issue → Return to Stage 0 (Task Contract) or Stage 1 (KIQ)
并行处理机会:
  • 阶段4(验证) + 阶段5(证据登记):证据验证后,管理员可立即开始登记
  • 阶段6(结构化推理) + 阶段7(数据一致性):可并行运行,再 reconcile
  • 阶段8(魔鬼代言人) + 阶段9(领域专家):可并行运行,再 reconcile
常见迭代循环:
  • 阶段6 → 阶段2:若证据不足,返回收集阶段
  • 阶段8 → 阶段6:若识别致命点,可能需要重新分析
  • 阶段11 → 阶段10:若QA失败,返回编辑阶段
  • 阶段11 → 任意阶段:若发现重大问题,返回对应阶段
交接协议:每次交接应包含输出工件、状态摘要、已知问题和下一步计划
质量门控升级:若门控失败:小问题→阶段内修复;中等问题→返回上一阶段;大问题→返回阶段0(任务契约)或阶段1(KIQ)

Small Team Adaptation

小型团队适配

For small teams (1-3 people), one person can wear multiple hats:
  • Solo Researcher: research-lead + methodologist + editor
  • Two-Person Team:
    • Person A: research-lead + collection-strategist + evidence-librarian
    • Person B: methodologist + verification-expert + editor + qa-gatekeeper
  • Three-Person Team: Distribute roles more evenly
Key Principle: Even if one person does multiple roles, the functions should not be skipped. The mental models and quality gates still apply.
对于小型团队(1-3人),一人可兼任多个角色:
  • ** solo研究者**:research-lead + methodologist + editor
  • 两人团队
    • A成员:research-lead + collection-strategist + evidence-librarian
    • B成员:methodologist + verification-expert + editor + qa-gatekeeper
  • 三人团队:更均匀地分配角色
核心原则:即使一人兼任多角色,功能也不能跳过。思维模型和质量门控仍需遵循。