ultra-research

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

深度研究 Skill

Deep Research Skill

三段式：理解需求 → 迭代探索 → 整合报告。

通过 Playwright MCP 控制多个 AI 服务进行并行研究。各服务交互细节见 reference 文件。

Three-stage Process: Understand Requirements → Iterative Exploration → Integrate Report.

Control multiple AI services for parallel research via Playwright MCP. Details of interactions with each service can be found in reference files.

前置条件

Prerequisites

Playwright MCP 需以持久化浏览器模式运行（
```
--user-data-dir
```
），这样各 AI 服务的登录态才会保留
用户需要预先在 Playwright 浏览器中登录要使用的 AI 服务（Gemini、ChatGPT、Claude、Grok）——不需要全部登录，登录了哪些就用哪些
本 skill 的选择器基于中文 UI 环境；英文 UI 下按钮名和 placeholder 会不同，需用
```
browser_snapshot
```
实时探测

Playwright MCP must run in persistent browser mode (
```
--user-data-dir
```
) so that the login states of each AI service are retained
Users need to log in to the desired AI services (Gemini, ChatGPT, Claude, Grok) in the Playwright browser in advance — you don't need to log in to all, only use the ones you've logged into
The selectors of this skill are based on Chinese UI environment; button names and placeholders will differ in English UI, so real-time detection via
```
browser_snapshot
```
is required

可用的 AI 服务与模式

Available AI Services and Modes

服务	模式	耗时	特点
Gemini Pro	快速	10-60s	快速问答、事实查证（有搜索能力）
Gemini Deep Research	深度	5-10min	多源综合、广度覆盖
ChatGPT Thinking	快速	10-120s	深度推理、结构化分析
ChatGPT Deep Research	深度	5-30min	多源综合、广度覆盖
Claude Opus Extended	快速	10-120s	深度推理、方案设计
Grok 快速/专家	快速	10-60s	X/Twitter 内容搜索、时效性讨论和趋势
Grok DeepSearch	深度	数分钟	深度搜索、社交媒体观点汇集

这些是工具箱，不是流水线。 根据研究需要灵活选用和组合。

交互参考：

references/gemini.md — Gemini 操作、选择器、提取
references/chatgpt.md — ChatGPT 操作（最脆弱的服务，注意防御）
references/claude-web.md — Claude 操作、选择器、提取
references/grok.md — Grok 操作（X/Twitter 内容搜索的独特入口）
references/extraction.md — 通用提取技术（分块、引用链接、保存）
references/prompt-guide.md — 研究 prompt 构造建议

Service	Mode	Time Cost	Features
Gemini Pro	Fast	10-60s	Quick Q&A, fact-checking (with search capability)
Gemini Deep Research	Deep	5-10min	Multi-source synthesis, broad coverage
ChatGPT Thinking	Fast	10-120s	In-depth reasoning, structured analysis
ChatGPT Deep Research	Deep	5-30min	Multi-source synthesis, broad coverage
Claude Opus Extended	Fast	10-120s	In-depth reasoning, solution design
Grok Fast/Expert	Fast	10-60s	X/Twitter Content Search, real-time discussions and trends
Grok DeepSearch	Deep	Several minutes	In-depth search, social media opinion aggregation

These are toolboxes, not assembly lines. Flexibly select and combine them based on research needs.

Interaction references:

references/gemini.md — Gemini operations, selectors, extraction
references/chatgpt.md — ChatGPT operations (the most fragile service, pay attention to defense)
references/claude-web.md — Claude operations, selectors, extraction
references/grok.md — Grok operations (unique entry for X/Twitter content search)
references/extraction.md — General extraction techniques (chunking, reference links, saving)
references/prompt-guide.md — Suggestions for constructing research prompts

Part 1: 理解需求

Part 1: Understand Requirements

目标：搞清楚用户要什么，然后开始。

Goal: Clarify what the user needs, then get started.

先判断：用户的需求清楚吗？

First Judgment: Is the User's Requirement Clear?

收到用户 query 后，先判断你是否能理解研究的对象、范围、目的：

清楚 → 直接向用户确认你的理解（如需微调范围），然后开始
大致清楚但有细节模糊 → 先 WebSearch 搜 2-3 次补充认知，再和用户确认
不清楚 → 先 WebSearch 建立领域认知，再带着具体问题向用户澄清

澄清不是走流程——如果用户的意图一目了然，直接干活比问一堆问题有用。

After receiving the user's query, first judge whether you can understand the object, scope, and purpose of the research:

Clear → Directly confirm your understanding with the user (if scope adjustment is needed), then start
Generally clear but with ambiguous details → First perform 2-3 WebSearch to supplement your cognition, then confirm with the user
Unclear → First perform WebSearch to establish domain cognition, then clarify with specific questions to the user

Clarification is not a formality — if the user's intention is obvious, taking action directly is more useful than asking a bunch of questions.

构造查询

Construct Queries

为选定的 AI 模式各准备一条查询。不同 AI 给不同角度比给同一个 prompt 更有效。见 prompt-guide.md。

Prepare a query for each selected AI mode. Providing different perspectives to different AIs is more effective than using the same prompt for all. See prompt-guide.md.

创建研究目录

Create Research Directory

在合适位置创建研究目录（如

research-主题/

），创建 todo.md 记录状态。

Create a research directory at an appropriate location (e.g.,

research-topic/

), and create todo.md to record the status.

Part 2: 迭代探索

Part 2: Iterative Exploration

目标：多路并发搜索，逐步深入，直到信息充分。

Goal: Conduct multi-channel concurrent searches, gradually deepen, until sufficient information is obtained.

深度由进展决定

Depth is Determined by Progress

不要预设研究深度，让研究过程自己说话：

起步：先用快速模式铺开（Gemini Pro、ChatGPT Thinking、Claude Opus、Grok 专家），快速建立基线认知
发现复杂度：如果快速模式的结果显示话题确实多维、有矛盾、需要更多来源 → 追加 Deep Research / DeepSearch
收敛：如果快速模式已经给出了一致的、充分的答案 → 不需要开 DR，直接整合

简单问题不要大炮打蚊子。复杂问题不要浅尝辄止。让信息本身告诉你该挖多深。

Do not preset research depth; let the research process speak for itself:

Start : First use fast modes to lay the groundwork (Gemini Pro, ChatGPT Thinking, Claude Opus, Grok Expert) to quickly establish baseline cognition
Discover Complexity : If results from fast modes show the topic is indeed multi-dimensional, contradictory, or requires more sources → Add Deep Research / DeepSearch
Converge : If fast modes have provided consistent and sufficient answers → No need to enable DR, directly integrate

Do not use a sledgehammer to crack a nut for simple questions. Do not stop at the surface for complex questions. Let the information itself tell you how deep to dig.

派发

Dispatch

每个 AI 服务的具体操作步骤见各自 reference 文件。通用流程：

新开 Tab → 导航到服务 → 检查登录
选择模式 → 输入查询
记录到 todo.md（Tab 编号 + 查询内容）
发完一个立刻切下一个，不等结果

登录处理：这些服务使用的是用户的账号。如果发现某个服务未登录，立即暂停并告知用户，让用户选择：

去 Playwright 浏览器中登录该服务，然后继续
跳过该服务，用其他已登录的服务继续

不要自动跳过——用户可能想登录。

Specific operation steps for each AI service can be found in their respective reference files. General process:

Open a new Tab → Navigate to the service → Check login status
Select mode → Enter query
Record to todo.md (Tab number + query content)
Switch to the next one immediately after sending, do not wait for results

Login Handling : These services use the user's account. If a service is found to be unlogged, immediately pause and inform the user, allowing the user to choose:

Log in to the service in the Playwright browser, then continue
Skip this service and continue with other logged-in services

Do not skip automatically — the user may want to log in.

轮询与提取

Polling and Extraction

先到先提取，不等所有流完成。

快速模式：~30s 后开始检查
DR/DeepSearch 模式：~90s 后开始检查
用 screenshot 判断是否完成（比 snapshot 可靠）——各服务完成标志见 reference

提取技术见 extraction.md。每份结果保存为独立 md（

01-gemini-pro.md

、

02-chatgpt-thinking.md

...）。

Extract as soon as results are available, do not wait for all streams to complete.

Fast modes: Start checking after ~30s
DR/DeepSearch modes: Start checking after ~90s
Use screenshots to judge completion (more reliable than snapshot) — completion markers for each service can be found in reference files

Extraction techniques can be found in extraction.md. Save each result as an independent md file (e.g.,

01-gemini-pro.md

02-chatgpt-thinking.md

...).

定向深入

Targeted Deepening

每提取一份结果，立即评估：

发现了哪些子问题？
哪些维度数据不够或有矛盾？
有没有意外发现值得追查？

如果有 → 追加查询。选择最合适的工具：

快速模式 — 定向查证具体子问题（秒级）
WebSearch/WebFetch — 交叉验证事实（比开新 AI Tab 更快）
DR/DeepSearch — 仅用于发现的子问题确实需要广度覆盖时
Grok — 特别适合查看某话题在 X/Twitter 上的讨论和观点

Evaluate immediately after extracting each result:

What sub-issues have been discovered?
Which dimensions lack data or have contradictions?
Are there any unexpected findings worth investigating?

If yes → Add supplementary queries. Choose the most suitable tool:

Fast modes — Targeted verification of specific sub-issues (seconds-level)
WebSearch/WebFetch — Cross-validate facts (faster than opening a new AI Tab)
DR/DeepSearch — Only use when the discovered sub-issues truly require broad coverage
Grok — Especially suitable for viewing discussions and opinions on a topic on X/Twitter

Tab 管理

Tab Management

控制在 ≤8 个活跃 Tab，多了浏览器会不稳定
提取完的 Tab 可以保留（可能需要追问），但不要无限堆积
浏览器断连时：
```
browser_close
```
→
```
pkill -f "mcp-chrome"
```
→
```
sleep 2
```
→ 重新 navigate

Keep active Tabs ≤8; too many will make the browser unstable
Tabs that have been extracted can be retained (may need follow-up questions), but do not accumulate indefinitely
If the browser disconnects:
```
browser_close
```
→
```
pkill -f "mcp-chrome"
```
→
```
sleep 2
```
→ Re-navigate

何时停止

When to Stop

核心问题有了交叉验证的答案
新搜索不再产出实质性新信息
用户要求的范围已充分覆盖

Core questions have cross-validated answers
New searches no longer produce substantial new information
The scope required by the user has been fully covered

Part 3: 整合报告

Part 3: Integrate Report

目标：按主题维度重组内容，交叉验证，输出一份深度报告。

Goal: Restructure content by topic dimensions, cross-validate, and output an in-depth report.

原则

Principles

按主题组织，不按来源罗列（不要 "Gemini 说... ChatGPT 说..."）
标注来源（
```
[Gemini DR]
```
、
```
[Grok]
```
、
```
[ChatGPT Thinking]
```
等），让读者知道可信度
共识（多路一致）和分歧（各执一词）都要标注
保留引用 URL — 原始网络来源链接是高价值信息，必须传递到最终报告
分歧不强行裁决 — 保留并标注，让用户判断
信息权重：DR 通常更全面，快速模式更聚焦，WebSearch 最即时，Grok 的 X/Twitter 链接是独家信息

Organize by topic, not by source (do not use "Gemini said... ChatGPT said...")
Label sources (e.g.,
```
[Gemini DR]
```
,
```
[Grok]
```
,
```
[ChatGPT Thinking]
```
) so readers can judge credibility
Both consensus (consistent across multiple channels) and disagreements (conflicting views) must be labeled
Retain citation URLs — Original web source links are high-value information and must be included in the final report
Do not force resolution of disagreements — Retain and label them, letting the user judge
Information weight: DR is usually more comprehensive, fast modes are more focused, WebSearch is the most real-time, and Grok's X/Twitter links are exclusive information

引用体系

Citation System

报告有两层引用，都要做好：

1. 网络来源引用（报告 → 原始网页）

正文中关键事实、数据、观点旁标注原始网络来源。格式灵活——脚注、行内链接、段落末尾引用块都行，根据报告风格选择最可读的方式。核心原则：读者看到一个关键论断，能直接找到原始网页验证。

各 AI 的引用链接已在原始结果文件的

## 参考链接

章节。整合时从那里挑选最相关的链接嵌入报告正文。不需要全部搬过来——只保留支撑关键论点的。

2. AI 来源标注（报告 → 哪个 AI 说的）

用行内标签

[Gemini DR]

、

[Grok]

、

[ChatGPT Thinking]

等标注信息来自哪个 AI 模式。这帮助读者判断可信度（DR 多源交叉 vs 快速模式单次推理）。多路一致的可以标

[多路共识]

。

3. 原始结果索引（报告 → 原始文件）

报告末尾附

## 原始结果索引

，列出所有原始结果文件和对应的查询内容：

undefined

The report has two layers of citations, both of which must be properly handled:

1. Web Source Citations (Report → Original Webpage)

Label the original web source next to key facts, data, and viewpoints in the text. The format is flexible — footnotes, inline links, citation blocks at the end of paragraphs are all acceptable; choose the most readable way based on the report style. Core principle: When readers see a key assertion, they can directly find the original webpage for verification.

Citation links for each AI are in the

## Reference Links

section of the original result files. When integrating, select the most relevant links from there to embed in the report text. No need to move all over — only retain those that support key arguments.

2. AI Source Labeling (Report → Which AI said it)

Use inline tags like

[Gemini DR]

[Grok]

[ChatGPT Thinking]

to label which AI mode the information comes from. This helps readers judge credibility (multi-source cross-validation in DR vs single inference in fast modes). Consensus across multiple channels can be labeled

[Multi-channel Consensus]

3. Original Result Index (Report → Original Files)

Attach

## Original Result Index

at the end of the report, listing all original result files and their corresponding query content:

undefined

原始结果索引

Original Result Index

文件	服务	模式	查询摘要
01-gemini-pro.md	Gemini	Pro	...
02-chatgpt-dr.md	ChatGPT	Deep Research	...


这让读者能回溯到完整的原始结果（含全部参考链接）。

File	Service	Mode	Query Summary
01-gemini-pro.md	Gemini	Pro	...
02-chatgpt-dr.md	ChatGPT	Deep Research	...


This allows readers to trace back to the complete original results (including all reference links).

流程

Process

写初稿 — 基于已有结果先写能写的，同步嵌入引用链接
标记缺口 — 哪些数据不够？哪些只有单一来源？
缺口重大 → 回到 Part 2 补充
逐步更新 — 新信息到手就更新报告
最终审校 — 检查引用完整性（关键论断有没有来源支撑？）→ 交付用户

报告保存为

00-final-report.md

。

Write Draft — First write what you can based on existing results, and embed citation links simultaneously
Mark Gaps — Which data is insufficient? Which has only a single source?
Significant Gaps → Return to Part 2 to supplement
Gradually Update — Update the report as new information is obtained
Final Review — Check citation integrity (Are key assertions supported by sources?) → Deliver to user

Save the report as

00-final-report.md

todo.md 持久化

todo.md Persistence

研究可能跨会话。todo.md 是恢复点。

markdown

undefined

Research may span sessions. todo.md is the recovery point.

markdown

undefined

研究：[主题]

Research: [Topic]

状态：[进行中/已完成]

Status: [In Progress/Completed]

搜索流

Search Streams

#	服务	模式	Tab	查询摘要	状态
01	Gemini	Pro	2	...	已提取
02	ChatGPT	DR	3	...	等待中
03	Grok	专家	4	...	进行中

#	Service	Mode	Tab	Query Summary	Status
01	Gemini	Pro	2	...	Extracted
02	ChatGPT	DR	3	...	Pending
03	Grok	Expert	4	...	In Progress

发现的子问题

Discovered Sub-issues

整合进度

Integration Progress

初稿
缺口填补
最终审校


**跨会话恢复**：读 todo.md → `browser_tabs` list 检查现有 Tab → 对照恢复。

---

Draft
Gap Filling
Final Review


**Cross-session Recovery** : Read todo.md → Check existing Tabs via `browser_tabs` list → Restore accordingly.

---

错误处理

Error Handling

情况	建议
未登录	暂停，告知用户——让用户选择登录还是跳过该服务
浏览器断连	`browser_close` → `pkill -f "mcp-chrome"` → sleep 2 → 重新 navigate
Cloudflare 拦截	重试 1-2 次 navigate，通常第二次就通过
单 Tab 超时（>40min）	标记超时，继续其他流
提取返回空	见 extraction.md 的降级策略
snapshot 返回空	等 2-3 秒重试，ChatGPT/Grok 有时会进入瞬时空状态
DR 未启动	ChatGPT DR 可能先回澄清问题，检测后回复让它直接开始
DR/DeepSearch 配额用尽	跳过，改用快速模式或 WebSearch 补充

部分失败不影响产出：≥2 路成功仍出报告（注明来源范围）。

Scenario	Suggestion
Not Logged In	Pause and inform the user — Let the user choose to log in or skip the service
Browser Disconnection	`browser_close` → `pkill -f "mcp-chrome"` → sleep 2 → Re-navigate
Cloudflare Interception	Retry navigation 1-2 times; usually passes on the second try
Single Tab Timeout (>40min)	Mark as timeout, continue with other streams
Extraction Returns Empty	See the downgrade strategy in extraction.md
snapshot Returns Empty	Retry after 2-3 seconds; ChatGPT/Grok sometimes enter a transient empty state
DR Not Started	ChatGPT DR may first send clarification questions; detect and reply to let it start directly
DR/DeepSearch Quota Exhausted	Skip, use fast modes or WebSearch to supplement

Partial failure does not affect output : A report can still be generated if ≥2 streams succeed (note the source scope).