Loading...
Loading...
Zero-context verification that every bibliographic entry in the paper is real, correctly attributed, and used in a context the cited paper actually supports. Uses a fresh cross-model reviewer with web/DBLP/arXiv lookup to catch hallucinated authors, wrong years, fabricated venues, version mismatches, and wrong-context citations (cite present but the cited paper does not establish the claim). Use when user says "审查引用", "check citations", "citation audit", "verify references", "引用核对", or before submission to ensure bibliography integrity.
npx skill4agent add wanshuiyin/auto-claude-code-research-in-sleep citation-audit\cite{...}experiment-auditresult-to-claimpaper-claim-auditpaper-writepaper-claim-auditpaper-compile\citegpt-5.4freshcodex-replyCITATION_AUDIT.mdCITATION_AUDIT.jsonreferences.bibpaper.bib*.tex\cite{...}sec/sections/\cite{key1,key2,...}(key, file, line, surrounding_sentence)paper/.aris/citation-audit/contexts.txt.aris/citation-audit/contexts.txtaudited_input_hashes/tmpmcp__codex__codexcodex-replymcp__codex__codex:
model: gpt-5.4
config: {"model_reasoning_effort": "xhigh"}
sandbox: read-only
prompt: |
You are auditing a bibliographic entry. Use web/DBLP/arXiv search.
## Bib entry
@article{key2024example,
author = {...}, title = {...}, journal = {...}, year = {...}, ...
}
## Where this entry is cited in the paper
[paste extracted contexts]
For this entry, verify:
1. EXISTENCE: does this paper exist at the claimed arXiv ID / DOI / venue?
Output: YES / NO / UNCERTAIN, with the verifying URL.
2. METADATA: are author names, year, venue, title correct?
For each, output: correct / wrong: should be ... / typo: ...
3. CONTEXT: for each use, does the cited paper actually support the surrounding claim?
Output per-use: SUPPORTS / WEAK / WRONG, with one-sentence reasoning.
VERDICT: KEEP / FIX / REPLACE / REMOVE
- KEEP: entry is clean, all uses are appropriate
- FIX: metadata needs correction; uses are appropriate
- REPLACE: cite is wrong-context, find a different paper that actually supports the claim
- REMOVE: entry is hallucinated or unsupportable
Be honest. If you cannot verify online, say UNCERTAIN; do not guess..aris/traces/citation-audit/<date>_runNN/<key>.mdCITATION_AUDIT.jsondetails.per_entryentriesverdictsummarydetails"details": {
"total_entries": 29,
"counts": { "KEEP": 11, "FIX": 14, "REPLACE": 3, "REMOVE": 1 },
"per_entry": [
{
"key": "lu2024aiscientist",
"verdict": "KEEP",
"axis_failures": [],
"uses": [
{"file": "sections/1.intro.tex", "line": 11, "verdict": "SUPPORTS"},
{"file": "sections/6.related.tex", "line": 8, "verdict": "SUPPORTS"}
]
},
{
"key": "madaan2023selfrefine",
"verdict": "FIX",
"axis_failures": ["CONTEXT"],
"uses": [
{"file": "sections/2.overview.tex", "line": 42, "verdict": "WRONG",
"note": "Self-Refine demonstrates iterative improvement, not correlated errors"},
{"file": "sections/6.related.tex", "line": 13, "verdict": "SUPPORTS"}
]
}
]
}audit_skillverdictreason_codesummaryaudited_input_hashestrace_paththread_idreviewer_modelreviewer_reasoninggenerated_atdetailsCITATION_AUDIT.md# Citation Audit Report
**Date**: 2026-04-19
**Bib file**: references.bib
**Total entries**: 29
## Summary
| Verdict | Count |
|---------|------|
| KEEP | 11 |
| FIX | 14 |
| REPLACE | 3 |
| REMOVE | 1 |
## Priority Fixes (CRITICAL — apply before submission)
### REMOVE: hidden2025aiscientistpitfalls
- Author listed as "Anonymous" — actual authors are Luo, Kasirzadeh, Shah
- Title is incomplete
- ACTION: Replace key with `luo2025aiscientistpitfalls`, update authors and title
### REPLACE-CONTEXT: madaan2023selfrefine in sec/2.overview.tex:42
- Cited to support: "single-model self-refinement can produce correlated errors"
- Self-Refine paper actually demonstrates iterative IMPROVEMENT, not correlated errors
- ACTION: Rewrite the sentence; cite Self-Refine for "self-feedback loop" framing instead
[... continues for each entry ...]
## All-Clean Entries (no action needed)
[list of KEEP keys]Fix [key]?
Change: <description of change>
Files affected: references.bib + sec/X.tex:Y
[Apply / Skip / Defer]AUTO_APPLY = truelatexmk -C && latexmk -pdf -interaction=nonstopmode main.texCitation undefinedReference undefinedCITATION_AUDIT.jsonpaper-writingtools/verify_paper_audits.shassurance| Skill | What it audits | What it catches |
|---|---|---|
| Evaluation code | Fake ground truth, self-normalized scores, phantom results |
| Result-to-claim mapping | Claims unsupported by evidence |
| Numerical claims in manuscript | Number inflation, best-seed cherry-pick, config mismatch |
| Bibliographic entries | Hallucinated refs, wrong-context citations, metadata errors |
and othersmcp__codex__codexshared-references/review-tracing.mdtools/save_trace.sh.aris/traces/citation-audit/<date>_run<NN>/--- trace:fullCITATION_AUDIT.mdCITATION_AUDIT.json.aris/traces/citation-audit/<date>_runNN/references.bibsec/*.texpaper/CITATION_AUDIT.json.bib\cite{...}NOT_APPLICABLEpaper-writingtools/verify_paper_audits.shshared-references/assurance-contract.md{
"audit_skill": "citation-audit",
"verdict": "PASS | WARN | FAIL | NOT_APPLICABLE | BLOCKED | ERROR",
"reason_code": "all_entries_keep | metadata_drift | wrong_context | hallucinated | ...",
"summary": "One-line human-readable verdict summary.",
"audited_input_hashes": {
"references.bib": "sha256:...",
"main.tex": "sha256:...",
"sections/3.related.tex": "sha256:..."
},
"trace_path": ".aris/traces/citation-audit/<date>_run<NN>/",
"thread_id": "<codex mcp thread id>",
"reviewer_model": "gpt-5.4",
"reviewer_reasoning": "xhigh",
"generated_at": "<UTC ISO-8601>",
"details": {
"total_entries": <int>,
"per_entry": [ { "key": "madaan2023selfrefine",
"verdict": "KEEP | FIX | REPLACE | REMOVE",
"axis_failures": [ "CONTEXT" | "METADATA" | "EXISTENCE" ],
"note": "..." }, ... ]
}
}audited_input_hashes.bibmain.texsections/*.tex/tmppaper/.aris/tools/verify_paper_audits.shpaper/paper/paper/...| Input state | Verdict | |
|---|---|---|
No | | |
| | |
| Every entry KEEP, all three axes green | | |
| Only FIX verdicts (metadata drift, no context errors) | | |
| Any REPLACE or REMOVE (wrong-context or hallucinated entry) | | |
| Web lookups timed out / reviewer invocation failed | | |
mcp__codex__codexcodex-replyshared-references/reviewer-independence.mdpaper-writingassurance/paper-claim-audit/experiment-audit/result-to-claimshared-references/citation-discipline.mdshared-references/reviewer-independence.md