arxiv-preflight
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChinesearXiv Preflight Check
arXiv提交前预检
Pipeline position: Phase 3a (submission readiness). Runs after Phase 2
fixes and PDF recompilation. Gate for arxiv-figures and arxiv-package.
See for full execution order.
/manuscript-pipeline流水线阶段: 第3a阶段(提交就绪)。在第2阶段修复和PDF重新编译后运行,是arxiv-figures和arxiv-package的前置检查环节。完整执行顺序请查看。
/manuscript-pipelinePurpose
目标
Systematically validate a TeX/LaTeX project (or PDF submission) against all
arXiv submission requirements, producing a structured pass/fail report with
specific fixes for every violation found.
Companion skills:
- — audits prose quality, structure, and claims
manuscript-review - — audits computational reproducibility
manuscript-provenance - — optimizes figures for arXiv
arxiv-figures - — packages the submission tarball
arxiv-package
This skill focuses exclusively on arXiv technical compliance — not prose
quality or scientific content.
系统地验证TeX/LaTeX项目(或PDF提交件)是否符合所有arXiv提交要求,生成结构化的通过/失败报告,并针对发现的每一项违规提供具体修复方案。
配套工具:
- — 审核文稿的文字质量、结构和论点
manuscript-review - — 审核计算可复现性
manuscript-provenance - — 为arXiv优化图表
arxiv-figures - — 打包提交用的tarball文件
arxiv-package
本工具仅专注于arXiv技术合规性审核——不涉及文稿质量或科学内容。
Workflow
工作流程
1. Ingest
1. 导入
Identify the submission directory. Locate:
- Main file(s)
.tex - All included files (figures, ,
.bbl, style files, subdirectories).bst - Any file
00README.XXX - Any directory
anc/ - Compiled PDF (if available)
Read references/guidelines.md识别提交目录,定位:
- 主文件
.tex - 所有包含的文件(图表、、
.bbl、样式文件、子目录).bst - 任何文件
00README.XXX - 任何目录
anc/ - 已编译的PDF(如果存在)
Read references/guidelines.md2. Validation Passes
2. 验证环节
Execute all passes. For each check:
- PASS — requirement met
- FAIL — violation found (document exact file, line, specific fix)
- WARN — advisory (not a blocker but may cause issues)
- N/A — not applicable
Pass 1 — File Organization
- No absolute file paths in any file (
.tex,\input,\includegraphics,\include)\bibliography - No spaces or special characters (,
&,\) in filenames: - No auxiliary files that should be excluded: ,
.aux,.log,.toc,.lot,.lof,.dvi,.ps(except figure PDFs).pdf - Required files present: if using BibTeX/BibLaTeX,
.bblif using makeindex,.ind/.glsif using glossary/nomenclature.nls - No hidden files (starting with ) except
.-related configs.tex - No journal templates or referee letters included
- Subdirectory structure: no calls into subdirectories (use
\include{}instead)\input{} - If multiple files with
.tex: verify\documentclassdeclares00README.XXXtoplevelfile
Pass 2 — TeX/LaTeX Compliance
- Processor compatibility: figure formats match processor
- DVI mode: only /
.psfigures.eps - PDFLaTeX: only /
.pdf/.pngfigures.jpg - No mixed formats without conditional compilation
- DVI mode: only
- No package usage (use
psfig)graphicx - No macro in date fields
\today - No embedded JavaScript
- testing uses
\pdfoutputpackage, notifpdf\ifx\pdfoutput\undefined - No shell-escape dependent packages without workarounds (→
minted)frozencache=true - No package for external document references
xr - No double-spaced "referee" mode formatting
- Caption calls use
\cite\protect\cite - not used for subdirectory files
\include - BibLaTeX format version compatible with target TeX Live (3.3 for TL2025)
.bbl - All custom/non-standard style files included in submission
- No packages outside TeX Live distribution (check against common non-TL packages)
Pass 3 — Figure Validation
- All referenced figures exist at specified paths
- Figure formats match processor requirements
- No figures reference absolute paths
- No embedded animations, JavaScript, or interactive elements
- PNG files: check for oversized images (>34 Megapixel warning)
- Reasonable file sizes (flag individual figures >5MB, total figures >25MB)
- Alt text present in calls (advisory — accessibility best practice)
\includegraphics - uses
\includegraphicspackage, not deprecated alternativesgraphicx
Pass 4 — Bibliography Validation
- filename matches corresponding
.bblfilename.tex - If included:
.bibalso included (arXiv can process.bblbut.bibis safer).bbl - arXiv identifiers in references use correct format ()
YYMM.NNNNN - No extraneous formatting within e-print identifiers
- BibLaTeX format version check (examine
.bblversion in\RequirePackage).bbl
Pass 5 — Metadata Compliance (check front matter)
.tex- Title: no all-uppercase, no raw Unicode, cryptic macros expanded
- Authors: proper name format, no honorifics, no truncation
et al. - Abstract: no leading "Abstract" text, within 1920 character limit, no leading whitespace on lines
- Abstract: opaque TeX macros expanded, no formatting commands (,
\em)\it - If comments metadata present: page count and figure count included (advisory)
Pass 6 — PDF Validation (if compiled PDF available)
- All fonts embedded (check with or equivalent)
pdffonts - No Type 3 (bitmap) fonts
- Machine readable (not scanned/bitmapped)
- No embedded JavaScript
- No security restrictions preventing text extraction
- Reasonable file size
Pass 7 — 00README.XXX Validation (if present)
- Valid directive syntax (each line: or standalone directive)
filename directive - Referenced files exist
- DVI-related directives use correct filename (no extension)
.tex - No conflicting directives
Pass 8 — Ancillary Files (if directory present)
anc/- No files in
.texdirectoryanc/ - No PDFs with embedded JavaScript
- No internal references to directory from
anc/source.tex - Reasonable total size
Pass 9 — Common Error Scan
- at top of PS/EPS files (not at end)
%%BoundingBox - No conflicts between
\Bbbkandnewtxmathamssymb - No ambiguous double subscript/superscript (→
a_x_y)a_{x}_y - No concatenated source files
- No Scientific Workplace files
.rap - No modified versions of standard style files (,
epsf.sty)epsfig.sty - with complex section names:
hyperrefor proper PDF string handlingbookmarks=false
执行所有检查项。对于每一项检查:
- 通过 — 符合要求
- 失败 — 发现违规(记录具体文件、行号和修复方案)
- 警告 — 建议项(不阻止提交但可能引发问题)
- 不适用 — 不相关
检查1:文件组织
- 所有文件中不得使用绝对文件路径(
.tex、\input、\includegraphics、\include命令)\bibliography - 文件名中不得包含空格或特殊字符(、
&、\): - 不得包含应排除的辅助文件:、
.aux、.log、.toc、.lot、.lof、.dvi、.ps(图表PDF除外).pdf - 必须包含所需文件:使用BibTeX/BibLaTeX时需包含,使用makeindex时需包含
.bbl,使用术语表/命名法时需包含.ind/.gls.nls - 不得包含隐藏文件(以开头),除非是TeX相关配置文件
. - 不得包含期刊模板或审稿意见
- 子目录结构:不得使用调用子目录中的文件(改用
\include{})\input{} - 若存在多个带的
\documentclass文件:需验证.tex中是否声明了00README.XXXtoplevelfile
检查2:TeX/LaTeX合规性
- 处理器兼容性:图表格式与处理器匹配
- DVI模式:仅支持/
.ps格式图表.eps - PDFLaTeX:仅支持/
.pdf/.png格式图表.jpg - 未使用条件编译时不得混合格式
- DVI模式:仅支持
- 不得使用包(改用
psfig)graphicx - 日期字段中不得使用宏
\today - 不得嵌入JavaScript
- 检测需使用
\pdfoutput包,而非ifpdf\ifx\pdfoutput\undefined - 依赖shell-escape的包需提供替代方案(如需设置
minted)frozencache=true - 不得使用包引用外部文档
xr - 不得使用双倍行距的“审稿”模式格式
- 标题中的调用需使用
\cite\protect\cite - 不得对目录文件使用
\include - BibLaTeX 格式版本需与目标TeX Live兼容(TL2025对应3.3版本)
.bbl - 所有自定义/非标准样式文件需包含在提交件中
- 不得使用TeX Live发行版之外的包(对照常见非TL包检查)
检查3:图表验证
- 所有引用的图表均存在于指定路径
- 图表格式符合处理器要求
- 图表不得引用绝对路径
- 不得嵌入动画、JavaScript或交互元素
- PNG文件:检查是否存在超大图片(超过34兆像素时发出警告)
- 文件大小合理(标记单个图表超过5MB、总图表超过25MB的情况)
- 调用中包含替代文本(建议项——无障碍最佳实践)
\includegraphics - 需使用
\includegraphics包,而非已废弃的替代方案graphicx
检查4:参考文献验证
- 文件名对应
.bbl文件名.tex - 若包含文件:需同时包含
.bib文件(arXiv可处理.bbl但.bib更安全).bbl - 参考文献中的arXiv标识符格式正确()
YYMM.NNNNN - 电子预印本标识符中不得包含多余格式
- 检查BibLaTeX 格式版本(查看
.bbl中的.bbl版本)\RequirePackage
检查5:元数据合规性(检查前置内容)
.tex- 标题:不得全大写,不得使用原始Unicode,需展开晦涩宏
- 作者:格式规范,不得包含尊称,不得使用截断
et al. - 摘要:不得包含开头的“Abstract”文本,字符数不超过1920,行首不得有空白
- 摘要:需展开不透明TeX宏,不得包含格式命令(、
\em)\it - 若存在注释元数据:需包含页数和图表数(建议项)
检查6:PDF验证(若存在已编译PDF)
- 所有字体均已嵌入(使用或等效工具检查)
pdffonts - 不得使用Type 3(位图)字体
- 机器可读(非扫描/位图文件)
- 不得嵌入JavaScript
- 不得设置阻止文本提取的安全限制
- 文件大小合理
检查7:00README.XXX验证(若存在)
- 指令语法有效(每行格式:或独立指令)
文件名 指令 - 引用的文件均存在
- DVI相关指令使用正确文件名(无扩展名)
.tex - 无冲突指令
检查8:辅助文件(若存在目录)
anc/- 目录中不得包含
anc/文件.tex - 不得包含嵌入JavaScript的PDF
- 源文件中不得引用
.tex目录anc/ - 总大小合理
检查9:常见错误扫描
- PS/EPS文件的需位于文件顶部(而非末尾)
%%BoundingBox - 与
newtxmath之间不得存在amssymb冲突\Bbbk - 不得存在模糊的双重下标/上标(如需改为
a_x_y)a_{x}_y - 不得包含拼接的源文件
- 不得包含Scientific Workplace 文件
.rap - 不得包含标准样式文件的修改版本(、
epsf.sty)epsfig.sty - 包含复杂章节名称的需设置
hyperref或正确处理PDF字符串bookmarks=false
3. Generate Report
3. 生成报告
Produce a structured report:
markdown
undefined生成结构化报告:
markdown
undefinedarXiv Preflight Report
arXiv提交前预检报告
Project: [directory name]
Date: [date]
Processor: [detected processor]
Verdict: [READY / NEEDS FIXES / BLOCKED]
项目: [目录名称]
日期: [日期]
处理器: [检测到的处理器]
结论: [就绪 / 需要修复 / 阻塞]
Summary
摘要
| Category | Pass | Fail | Warn | N/A |
|---|---|---|---|---|
| File Organization | ||||
| TeX Compliance | ||||
| Figures | ||||
| Bibliography | ||||
| Metadata | ||||
| 00README | ||||
| Ancillary Files | ||||
| Common Errors |
| 分类 | 通过 | 失败 | 警告 | 不适用 |
|---|---|---|---|---|
| 文件组织 | ||||
| TeX合规性 | ||||
| 图表 | ||||
| 参考文献 | ||||
| 元数据 | ||||
| 00README | ||||
| 辅助文件 | ||||
| 常见错误 |
Blocking Issues (FAIL)
阻塞问题(失败)
[Each with file, line number, specific violation, exact fix]
[每项包含文件、行号、具体违规内容、精准修复方案]
Warnings (WARN)
警告项(警告)
[Advisory items that may cause processing issues]
[可能引发处理问题的建议项]
All Checks
所有检查项
[Full pass/fail/warn/N/A status for every checkpoint]
undefined[所有检查点的完整通过/失败/警告/不适用状态]
undefined4. Output
4. 输出
Save report as in the project directory.
arxiv-preflight-report.mdPresent verdict and blocking issue count. If READY, confirm submission can proceed.
If NEEDS FIXES, list the specific fixes in priority order.
将报告保存为到项目目录中。
arxiv-preflight-report.md展示审核结论和阻塞问题数量。如果状态为就绪,确认可以提交。如果状态为需要修复,按优先级列出具体修复项。
Core Principles
核心原则
- Binary compliance. arXiv requirements are not suggestions — FAIL means the submission will be rejected or processing will break. WARN means it may cause issues.
- Exact fixes. Every FAIL includes the specific command, line, or file change needed.
- No false positives. Only flag violations against documented arXiv requirements. Do not impose style preferences or best practices as failures.
- Processor-aware. All checks account for the detected TeX processor. PDFLaTeX rules do not apply to DVI-mode submissions and vice versa.
- 二元合规性:arXiv的要求并非建议——失败意味着提交将被拒绝或处理失败,警告意味着可能引发问题。
- 精准修复:每一项失败都包含所需的具体命令、行号或文件修改方案。
- 无误报:仅标记违反arXiv文档化要求的问题,不将风格偏好或最佳实践列为失败项。
- 处理器感知:所有检查都考虑检测到的TeX处理器,PDFLaTeX规则不适用于DVI模式提交,反之亦然。