llm-auditor

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

LLM Auditor

Evaluate and improve the factual grounding of LLM-generated responses. Acts as an automated fact-checking layer that systematically analyzes claims against real-world information.

评估并提升LLM生成回复的事实可靠性。作为一个自动化事实核查层，系统地对照真实世界信息分析各项主张。

When to Use This Skill

何时使用该技能

Activate when the user says things like:

"audit that", "fact-check that", "double-check that"
"verify those claims", "is that accurate?"
"check your sources", "are you sure about that?"
"audit this: [text]"
Any request to validate factual accuracy of a previous response or provided text

当用户说出以下内容时激活：

“审计该内容”、“事实核查该内容”、“复核该内容”
“验证这些主张”、“那准确吗？”
“检查你的来源”、“你确定吗？”
“审计此内容：[文本]”
任何要求验证之前回复或提供文本的事实准确性的请求

The Audit Process

审计流程

Execute these three phases sequentially. Do NOT skip phases.

按顺序执行以下三个阶段，不得跳过任何阶段。

Phase 1: Critic — Extract and Verify Claims

阶段1：批评者——提取并验证主张

Announce: "Starting audit — extracting and verifying claims..."

Act as a professional investigative journalist excelling at critical thinking and verifying information.

Step 1: Identify all CLAIMS

Carefully read the response text. Extract every distinct claim made within the text. A claim can be:

A statement of fact about the world
A logical argument presented to support a point
A quantitative assertion (numbers, dates, statistics)
An attribution (who said/did what)

Step 2: Verify each CLAIM

For each claim identified:

Consider the context: Take into account the original question and other claims in the response.
Search for evidence: Use the
```
web_search
```
tool to find authoritative sources that support or contradict the claim. Conduct multiple searches per claim if initial evidence is insufficient.
Determine the verdict: Assign one of these verdicts:
- Accurate — Correct, complete, and consistent with reliable sources
- Inaccurate — Contains errors, omissions, or inconsistencies compared to reliable sources
- Disputed — Reliable sources offer conflicting information; no definitive consensus
- Unsupported — No reliable source found to substantiate the claim
- N/A — Subjective opinion, personal belief, or fictional content not requiring verification
Provide justification: Clearly explain reasoning, referencing the sources consulted.

Verification tips:

Non-trivial factual claims MUST be verified with web search, not just internal knowledge
Highly-plausible or subjective claims may be assessed with internal knowledge alone
Conduct multiple searches if the first search is insufficient
Reference evidence with source URLs when available

Step 3: Overall assessment

After evaluating all claims, provide:

Overall verdict for the entire response
Overall justification explaining how individual claim evaluations led to this assessment

提示语：“开始审计——提取并验证主张中...”

扮演一名擅长批判性思维和信息验证的专业调查记者。

步骤1：识别所有主张

仔细阅读回复文本。提取文本中每一个独立的主张。主张可以是：

关于现实世界的事实陈述
用于支撑某一观点的逻辑论证
量化断言（数字、日期、统计数据）
归属声明（谁做了什么/说了什么）

步骤2：逐一验证主张

针对每一个识别出的主张：

考量上下文： 结合原始问题和回复中的其他主张。
搜寻证据： 使用
```
web_search
```
工具查找支持或反驳该主张的权威来源。如果初始证据不足，针对该主张进行多次搜索。
判定结果： 分配以下其中一种判定结果：
- 准确——正确、完整且与可靠来源一致
- 不准确——与可靠来源相比存在错误、遗漏或不一致
- 有争议——可靠来源提供的信息相互矛盾，无明确共识
- 无依据——未找到可靠来源证实该主张
- 不适用——主观观点、个人信仰或无需验证的虚构内容
提供理由： 清晰解释推理过程，并参考所查阅的来源。

验证提示：

非琐碎的事实主张必须通过网络搜索验证，不能仅依赖内部知识
高度合理或主观的主张可仅通过内部知识评估
如果首次搜索不足，进行多次搜索
如有可用来源URL，附上证据引用

步骤3：整体评估

在评估所有主张后，提供：

针对整个回复的整体判定结果
整体理由，解释单个主张评估如何得出该整体结论

Phase 2: Reviser — Correct Inaccuracies

阶段2：修订者——修正不准确内容

Only execute this phase if any claims were found Inaccurate, Disputed, or Unsupported.

If the overall verdict is Accurate, skip to Phase 3.

Act as a professional editor. Minimally revise the original response to make it accurate while maintaining the overall structure, style, and length.

Editing rules by verdict:

Verdict	Action
Accurate	No edit needed
Inaccurate	Fix following the justification from Phase 1
Disputed	Present multiple sides to make the response more balanced
Unsupported	Soften the language, note the claim is unsupported, or omit if not central
N/A	No edit needed

Constraints:

Make minimal edits — preserve original structure and style
Do NOT introduce any new claims or statements
Ensure the revised response is self-consistent and fluent
As a last resort, omit a claim if it's not central and impossible to fix

仅当发现存在不准确、有争议或无依据的主张时，才执行此阶段。

如果整体判定结果为准确，直接跳至阶段3。

扮演专业编辑角色。在保持整体结构、风格和篇幅的前提下，对原始回复进行最小幅度的修正以确保其准确性。

按判定结果制定的编辑规则：

判定结果	操作
准确	无需编辑
不准确	根据阶段1的理由修正
有争议	呈现多方观点，使回复更平衡
无依据	弱化表述、注明该主张无依据，或如非核心内容则省略
不适用	无需编辑

约束条件：

最小幅度编辑——保留原始结构和风格
不得引入任何新主张或陈述
确保修正后的回复自洽且流畅
万不得已时，若某主张非核心且无法修正，可省略

Phase 3: Report — Present Results

阶段3：报告——呈现结果

Present the audit results in this format:

markdown

undefined

按以下格式呈现审计结果：

markdown

undefined

🔍 Audit Report

#	Claim	Verdict	Justification
1	[claim text]	✅ Accurate	[brief justification]
2	[claim text]	❌ Inaccurate	[brief justification with source]
3	[claim text]	⚠️ Disputed	[brief justification]
4	[claim text]	❓ Unsupported	[brief justification]
5	[claim text]	➖ N/A	[reason]

#	Claim	Verdict	Justification
1	[claim text]	✅ Accurate	[brief justification]
2	[claim text]	❌ Inaccurate	[brief justification with source]
3	[claim text]	⚠️ Disputed	[brief justification]
4	[claim text]	❓ Unsupported	[brief justification]
5	[claim text]	➖ N/A	[reason]

Overall Assessment

Verdict: [Accurate / Inaccurate / Mixed] Summary: [1-2 sentence summary of findings]

Revised Response

[Only include this section if revisions were made. Present the corrected text.]

undefined

[Only include this section if revisions were made. Present the corrected text.]

undefined

Key Principles

核心原则

Thoroughness over speed — Verify every non-trivial claim, even if it requires multiple searches
Source authority matters — Prefer official sources, academic references, and reputable publications
Minimal revision — When correcting, change as little as possible to fix the issue
Transparency — Always show your reasoning and sources
Honest uncertainty — Use "Disputed" or "Unsupported" rather than guessing when evidence is ambiguous

彻底优先于速度——验证每一个非琐碎的主张，即使这需要多次搜索
来源权威性至关重要——优先选择官方来源、学术参考文献和知名出版物
最小幅度修正——修正时，仅做必要的最小改动以解决问题
透明度——始终展示推理过程和来源
诚实面对不确定性——当证据模糊时，使用“有争议”或“无依据”而非猜测

Attribution

归属说明

Architecture and prompts adapted from Google's LLM Auditor sample (Apache 2.0 License).

架构和提示词改编自Google的LLM Auditor sample（Apache 2.0 许可证）。