citation-verifier
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseCitation Verifier
Citation Verifier(引用验证工具)
Generate and ensure every entry has a traceable verification record in .
citations/ref.bibcitations/verified.jsonlWhen network access is restricted, prefer a “record now, verify later” workflow: keep URLs/titles consistent and leave a clear verification note.
生成文件,并确保每个条目在中都有可追溯的验证记录。
citations/ref.bibcitations/verified.jsonl当网络受限,优先采用“先记录,后验证”的工作流:保持URL和标题一致,并留下清晰的验证说明。
Input
输入
papers/paper_notes.jsonl
papers/paper_notes.jsonl
Outputs
输出
citations/ref.bibcitations/verified.jsonl
citations/ref.bibcitations/verified.jsonl
Workflow (heuristic)
工作流(启发式规则)
- Collect ,
bibkey,title,url,yearfromauthors.papers/paper_notes.jsonl - Write/refresh :
citations/ref.bib- Prefer arXiv-style fields when /
arxiv_idexist (primary_category,eprint,archivePrefix).primaryClass
- Prefer arXiv-style fields when
- Write one verification record per BibTeX entry to with at least:
citations/verified.jsonl- ,
bibkey,title,urldate
- If you cannot verify via network, record a clear field (e.g., “auto-generated; needs manual verification”) and/or request human confirmation depending on your policy.
notes
- 从中提取
papers/paper_notes.jsonl、bibkey、title、url、year信息。authors - 写入/更新:
citations/ref.bib- 当存在/
arxiv_id时,优先使用arXiv风格字段(primary_category、eprint、archivePrefix)。primaryClass
- 当存在
- 为每个BibTeX条目在中写入一条验证记录,至少包含:
citations/verified.jsonl- 、
bibkey、title、urldate
- 若无法通过网络验证,需在字段中留下清晰说明(例如:“自动生成;需手动验证”),或根据策略请求人工确认。
notes
Quality checklist
质量检查清单
- Every BibTeX entry has a corresponding record.
verified.jsonl - No missing /
url/datein verification records.title
- 每个BibTeX条目都有对应的记录。
verified.jsonl - 验证记录中无缺失的/
url/date。title
Offline Mode
离线模式
When network access is restricted, run in offline mode to produce auditable records now, then verify later.
- Generate offline records:
verification_status: offline_generated - Verify later (when network is available):
--verify-only
当网络受限,可运行离线模式,先生成可审计的记录,后续再进行验证。
- 生成离线记录:
verification_status: offline_generated - 后续验证(网络可用时):
--verify-only
verification_status
verification_statusverification_status
状态说明
verification_status- : record was generated without network verification (needs later verification)
offline_generated - : URL/title verified successfully by the script
verified_online - : verification was attempted but failed (network error or title mismatch)
verify_failed - : missing/ambiguous fields (e.g., empty
needs_manual_verification/url)title
- :记录生成时未进行网络验证(需后续验证)
offline_generated - :脚本已成功验证URL/标题
verified_online - :尝试验证但失败(网络错误或标题不匹配)
verify_failed - :字段缺失/模糊(例如:
needs_manual_verification/url为空)title
Script
脚本
Quick Start
快速开始
python .codex/skills/citation-verifier/scripts/run.py --help- Offline (record now, verify later):
python .codex/skills/citation-verifier/scripts/run.py --workspace <workspace_dir> --offline
python .codex/skills/citation-verifier/scripts/run.py --help- 离线模式(先记录后验证):
python .codex/skills/citation-verifier/scripts/run.py --workspace <workspace_dir> --offline
All Options
所有选项
- : do not attempt network verification; write
--offlineverification_status=offline_generated - : verify existing
--verify-onlyrecords (does not rewrite BibTeX)citations/verified.jsonl - : stored in
--verification-note <text>citations/verified.jsonlnotes
- :不尝试网络验证;写入
--offlineverification_status=offline_generated - :验证已有的
--verify-only记录(不会重写BibTeX)citations/verified.jsonl - :将说明存储在
--verification-note <text>的citations/verified.jsonl字段中notes
Examples
示例
- Generate BibTeX + offline verification records:
python .codex/skills/citation-verifier/scripts/run.py --workspace <ws> --offline --verification-note "auto-generated; needs manual verification"
- Later, verify-only (when network is available):
python .codex/skills/citation-verifier/scripts/run.py --workspace <ws> --verify-only
- 生成BibTeX + 离线验证记录:
python .codex/skills/citation-verifier/scripts/run.py --workspace <ws> --offline --verification-note "auto-generated; needs manual verification"
- 后续网络可用时,仅执行验证:
python .codex/skills/citation-verifier/scripts/run.py --workspace <ws> --verify-only
Notes
注意事项
- Minimal requirement for every verification record: ,
url,date.title - The script sanitizes stray/unbalanced in titles to keep
{}parsing robust.bibtex - The script escapes LaTeX special chars in text fields () and rewrites superscript patterns like
& % $ # _orX^NasX$^N$to keep LaTeX builds stable.X\textsuperscript{N} - URLs are kept raw in BibTeX fields (BibTeX styles wrap them with
url);\url{...}uses@misc.howpublished=\url{...} - In offline mode, records are not truly verified; treat as a to-do for human/network verification.
offline_generated
- 每条验证记录的最低要求:包含、
url、date。title - 脚本会清理标题中多余/不匹配的,确保BibTeX解析的稳定性。
{} - 脚本会转义文本字段中的LaTeX特殊字符(),并将
& % $ # _或X^N这类上标格式重写为X$^N$,保证LaTeX编译稳定。X\textsuperscript{N} - URL在BibTeX的字段中保持原始格式(BibTeX样式会用
url包裹);\url{...}类型会使用@misc。howpublished=\url{...} - 离线模式下生成的记录并未真正经过验证;需将视为待完成的人工/网络验证任务。
offline_generated
Troubleshooting
故障排除
Common Issues
常见问题
Issue: Missing bibkey
/ missing url
in notes
bibkeyurl问题:笔记中缺失bibkey
/ 缺失url
bibkeyurlSymptom:
- is missing entries, or
citations/ref.bibhas emptyverified.jsonl.url/title
Causes:
- lacks
papers/paper_notes.jsonl/bibkeyfields.url
Solutions:
- Ensure each core paper note has a stable and a canonical
bibkey.url - Rerun citation generation after fixing notes.
症状:
- 缺少条目,或
citations/ref.bib中的verified.jsonl为空。url/title
原因:
- 缺少
papers/paper_notes.jsonl/bibkey字段。url
解决方案:
- 确保每篇核心论文笔记都有稳定的和标准的
bibkey。url - 修复笔记后重新运行引用生成脚本。
Issue: verification_status=offline_generated
verification_status=offline_generated问题:verification_status=offline_generated
verification_status=offline_generatedSymptom:
- Records exist but are not truly verified.
Causes:
- was used, or network verification was unavailable.
--offline
Solutions:
- When network is available, run to upgrade records.
--verify-only - Or manually verify and update with notes.
citations/verified.jsonl
症状:
- 记录已存在但未经过真正验证。
原因:
- 使用了参数,或网络验证不可用。
--offline
解决方案:
- 网络可用时,运行参数升级记录状态。
--verify-only - 或手动验证并更新中的说明。
citations/verified.jsonl
Recovery Checklist
恢复检查清单
- Every BibTeX entry has a matching record.
citations/verified.jsonl - Verification records include ,
url,date.title
- 每个BibTeX条目都有对应的记录。
citations/verified.jsonl - 验证记录包含、
url、date。title