llm-auditor
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseLLM Auditor
LLM Auditor
Evaluate and improve the factual grounding of LLM-generated responses. Acts as an automated fact-checking layer that systematically analyzes claims against real-world information.
评估并提升LLM生成回复的事实可靠性。作为一个自动化事实核查层,系统地对照真实世界信息分析各项主张。
When to Use This Skill
何时使用该技能
Activate when the user says things like:
- "audit that", "fact-check that", "double-check that"
- "verify those claims", "is that accurate?"
- "check your sources", "are you sure about that?"
- "audit this: [text]"
- Any request to validate factual accuracy of a previous response or provided text
当用户说出以下内容时激活:
- “审计该内容”、“事实核查该内容”、“复核该内容”
- “验证这些主张”、“那准确吗?”
- “检查你的来源”、“你确定吗?”
- “审计此内容:[文本]”
- 任何要求验证之前回复或提供文本的事实准确性的请求
The Audit Process
审计流程
Execute these three phases sequentially. Do NOT skip phases.
按顺序执行以下三个阶段,不得跳过任何阶段。
Phase 1: Critic — Extract and Verify Claims
阶段1:批评者——提取并验证主张
Announce: "Starting audit — extracting and verifying claims..."
Act as a professional investigative journalist excelling at critical thinking and verifying information.
Step 1: Identify all CLAIMS
Carefully read the response text. Extract every distinct claim made within the text. A claim can be:
- A statement of fact about the world
- A logical argument presented to support a point
- A quantitative assertion (numbers, dates, statistics)
- An attribution (who said/did what)
Step 2: Verify each CLAIM
For each claim identified:
- Consider the context: Take into account the original question and other claims in the response.
- Search for evidence: Use the tool to find authoritative sources that support or contradict the claim. Conduct multiple searches per claim if initial evidence is insufficient.
web_search - Determine the verdict: Assign one of these verdicts:
- Accurate — Correct, complete, and consistent with reliable sources
- Inaccurate — Contains errors, omissions, or inconsistencies compared to reliable sources
- Disputed — Reliable sources offer conflicting information; no definitive consensus
- Unsupported — No reliable source found to substantiate the claim
- N/A — Subjective opinion, personal belief, or fictional content not requiring verification
- Provide justification: Clearly explain reasoning, referencing the sources consulted.
Verification tips:
- Non-trivial factual claims MUST be verified with web search, not just internal knowledge
- Highly-plausible or subjective claims may be assessed with internal knowledge alone
- Conduct multiple searches if the first search is insufficient
- Reference evidence with source URLs when available
Step 3: Overall assessment
After evaluating all claims, provide:
- Overall verdict for the entire response
- Overall justification explaining how individual claim evaluations led to this assessment
提示语:“开始审计——提取并验证主张中...”
扮演一名擅长批判性思维和信息验证的专业调查记者。
步骤1:识别所有主张
仔细阅读回复文本。提取文本中每一个独立的主张。主张可以是:
- 关于现实世界的事实陈述
- 用于支撑某一观点的逻辑论证
- 量化断言(数字、日期、统计数据)
- 归属声明(谁做了什么/说了什么)
步骤2:逐一验证主张
针对每一个识别出的主张:
- 考量上下文: 结合原始问题和回复中的其他主张。
- 搜寻证据: 使用工具查找支持或反驳该主张的权威来源。如果初始证据不足,针对该主张进行多次搜索。
web_search - 判定结果: 分配以下其中一种判定结果:
- 准确——正确、完整且与可靠来源一致
- 不准确——与可靠来源相比存在错误、遗漏或不一致
- 有争议——可靠来源提供的信息相互矛盾,无明确共识
- 无依据——未找到可靠来源证实该主张
- 不适用——主观观点、个人信仰或无需验证的虚构内容
- 提供理由: 清晰解释推理过程,并参考所查阅的来源。
验证提示:
- 非琐碎的事实主张必须通过网络搜索验证,不能仅依赖内部知识
- 高度合理或主观的主张可仅通过内部知识评估
- 如果首次搜索不足,进行多次搜索
- 如有可用来源URL,附上证据引用
步骤3:整体评估
在评估所有主张后,提供:
- 针对整个回复的整体判定结果
- 整体理由,解释单个主张评估如何得出该整体结论
Phase 2: Reviser — Correct Inaccuracies
阶段2:修订者——修正不准确内容
Only execute this phase if any claims were found Inaccurate, Disputed, or Unsupported.
If the overall verdict is Accurate, skip to Phase 3.
Act as a professional editor. Minimally revise the original response to make it accurate while maintaining the overall structure, style, and length.
Editing rules by verdict:
| Verdict | Action |
|---|---|
| Accurate | No edit needed |
| Inaccurate | Fix following the justification from Phase 1 |
| Disputed | Present multiple sides to make the response more balanced |
| Unsupported | Soften the language, note the claim is unsupported, or omit if not central |
| N/A | No edit needed |
Constraints:
- Make minimal edits — preserve original structure and style
- Do NOT introduce any new claims or statements
- Ensure the revised response is self-consistent and fluent
- As a last resort, omit a claim if it's not central and impossible to fix
仅当发现存在不准确、有争议或无依据的主张时,才执行此阶段。
如果整体判定结果为准确,直接跳至阶段3。
扮演专业编辑角色。在保持整体结构、风格和篇幅的前提下,对原始回复进行最小幅度的修正以确保其准确性。
按判定结果制定的编辑规则:
| 判定结果 | 操作 |
|---|---|
| 准确 | 无需编辑 |
| 不准确 | 根据阶段1的理由修正 |
| 有争议 | 呈现多方观点,使回复更平衡 |
| 无依据 | 弱化表述、注明该主张无依据,或如非核心内容则省略 |
| 不适用 | 无需编辑 |
约束条件:
- 最小幅度编辑——保留原始结构和风格
- 不得引入任何新主张或陈述
- 确保修正后的回复自洽且流畅
- 万不得已时,若某主张非核心且无法修正,可省略
Phase 3: Report — Present Results
阶段3:报告——呈现结果
Present the audit results in this format:
markdown
undefined按以下格式呈现审计结果:
markdown
undefined🔍 Audit Report
🔍 Audit Report
| # | Claim | Verdict | Justification |
|---|---|---|---|
| 1 | [claim text] | ✅ Accurate | [brief justification] |
| 2 | [claim text] | ❌ Inaccurate | [brief justification with source] |
| 3 | [claim text] | ⚠️ Disputed | [brief justification] |
| 4 | [claim text] | ❓ Unsupported | [brief justification] |
| 5 | [claim text] | ➖ N/A | [reason] |
| # | Claim | Verdict | Justification |
|---|---|---|---|
| 1 | [claim text] | ✅ Accurate | [brief justification] |
| 2 | [claim text] | ❌ Inaccurate | [brief justification with source] |
| 3 | [claim text] | ⚠️ Disputed | [brief justification] |
| 4 | [claim text] | ❓ Unsupported | [brief justification] |
| 5 | [claim text] | ➖ N/A | [reason] |
Overall Assessment
Overall Assessment
Verdict: [Accurate / Inaccurate / Mixed]
Summary: [1-2 sentence summary of findings]
Verdict: [Accurate / Inaccurate / Mixed]
Summary: [1-2 sentence summary of findings]
Revised Response
Revised Response
[Only include this section if revisions were made. Present the corrected text.]
undefined[Only include this section if revisions were made. Present the corrected text.]
undefinedKey Principles
核心原则
- Thoroughness over speed — Verify every non-trivial claim, even if it requires multiple searches
- Source authority matters — Prefer official sources, academic references, and reputable publications
- Minimal revision — When correcting, change as little as possible to fix the issue
- Transparency — Always show your reasoning and sources
- Honest uncertainty — Use "Disputed" or "Unsupported" rather than guessing when evidence is ambiguous
- 彻底优先于速度——验证每一个非琐碎的主张,即使这需要多次搜索
- 来源权威性至关重要——优先选择官方来源、学术参考文献和知名出版物
- 最小幅度修正——修正时,仅做必要的最小改动以解决问题
- 透明度——始终展示推理过程和来源
- 诚实面对不确定性——当证据模糊时,使用“有争议”或“无依据”而非猜测
Attribution
归属说明
Architecture and prompts adapted from Google's LLM Auditor sample (Apache 2.0 License).
架构和提示词改编自Google的LLM Auditor sample(Apache 2.0 许可证)。