document-scanning

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Document Scanning

文档扫描

Supported File Types

支持的文件类型

ExtensionTypeSub-Agent
.docxWord documentword-accessibility
.xlsxExcel workbookexcel-accessibility
.pptxPowerPoint presentationpowerpoint-accessibility
.pdfPDF documentpdf-accessibility
扩展名类型Sub-Agent
.docxWord 文档word-accessibility
.xlsxExcel 工作簿excel-accessibility
.pptxPowerPoint 演示文稿powerpoint-accessibility
.pdfPDF 文档pdf-accessibility

File Discovery Commands

文件发现命令

PowerShell (Windows)

PowerShell (Windows)

powershell
undefined
powershell
undefined

Non-recursive scan

Non-recursive scan

Get-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf
Get-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf

Recursive scan

Recursive scan

Get-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf -Recurse | Where-Object { $.Name -notlike '~$*' -and $.Name -notlike '.tmp' -and $_.Name -notlike '.bak' } | Where-Object { $_.FullName -notmatch '\/[\/]' }
undefined
Get-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf -Recurse | Where-Object { $.Name -notlike '~$*' -and $.Name -notlike '.tmp' -and $_.Name -notlike '.bak' } | Where-Object { $_.FullName -notmatch '\/[\/]' }
undefined

Bash (macOS)

Bash (macOS)

bash
undefined
bash
undefined

Non-recursive scan

Non-recursive scan

find "<folder>" -maxdepth 1 -type f ( -name ".docx" -o -name ".xlsx" -o -name ".pptx" -o -name ".pdf" ) ! -name "~$*"
find "<folder>" -maxdepth 1 -type f ( -name ".docx" -o -name ".xlsx" -o -name ".pptx" -o -name ".pdf" ) ! -name "~$*"

Recursive scan

Recursive scan

find "<folder>" -type f ( -name ".docx" -o -name ".xlsx" -o -name ".pptx" -o -name ".pdf" )
! -name "~$" ! -name ".tmp" ! -name ".bak"
! -path "
/.git/" ! -path "/node_modules/" ! -path "/pycache/" ! -path "/.vscode/*"
undefined
find "<folder>" -type f ( -name ".docx" -o -name ".xlsx" -o -name ".pptx" -o -name ".pdf" )
! -name "~$" ! -name ".tmp" ! -name ".bak"
! -path "
/.git/" ! -path "/node_modules/" ! -path "/pycache/" ! -path "/.vscode/*"
undefined

Delta Detection

变更检测

Git-based

基于Git

bash
undefined
bash
undefined

Files changed since last commit

Files changed since last commit

git diff --name-only HEAD~1 HEAD -- '.docx' '.xlsx' '.pptx' '.pdf'
git diff --name-only HEAD~1 HEAD -- '.docx' '.xlsx' '.pptx' '.pdf'

Files changed since a specific tag

Files changed since a specific tag

git diff --name-only <tag> HEAD -- '.docx' '.xlsx' '.pptx' '.pdf'
git diff --name-only <tag> HEAD -- '.docx' '.xlsx' '.pptx' '.pdf'

Files changed in the last N days

Files changed in the last N days

git log --since="N days ago" --name-only --diff-filter=ACMR --pretty="" -- '.docx' '.xlsx' '.pptx' '.pdf' | sort -u
undefined
git log --since="N days ago" --name-only --diff-filter=ACMR --pretty="" -- '.docx' '.xlsx' '.pptx' '.pdf' | sort -u
undefined

Timestamp-based (PowerShell)

基于时间戳(PowerShell)

powershell
undefined
powershell
undefined

Files modified since a specific date

Files modified since a specific date

Get-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf -Recurse | Where-Object { $_.LastWriteTime -gt [datetime]"2025-01-01" }
undefined
Get-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf -Recurse | Where-Object { $_.LastWriteTime -gt [datetime]"2025-01-01" }
undefined

Files to Skip

需要跳过的文件

Always exclude these patterns during scanning:
  • ~$*
    - Office lock/temp files (created when a document is open)
  • *.tmp
    - Temporary files
  • *.bak
    - Backup files
  • Files inside
    .git/
    ,
    node_modules/
    ,
    .vscode/
    ,
    __pycache__/
    directories
扫描过程中请始终排除以下模式的文件:
  • ~$*
    - Office 锁定/临时文件(文档打开时创建)
  • *.tmp
    - 临时文件
  • *.bak
    - 备份文件
  • 位于
    .git/
    node_modules/
    .vscode/
    __pycache__/
    目录下的文件

Scan Configuration Files

扫描配置文件

FilePurpose
.a11y-office-config.json
Rule enable/disable for Word, Excel, PowerPoint
.a11y-pdf-config.json
Rule enable/disable for PDF scanning
文件名用途
.a11y-office-config.json
Word、Excel、PowerPoint 的规则启用/禁用配置
.a11y-pdf-config.json
PDF 扫描的规则启用/禁用配置

Scan Profiles

扫描方案

ProfileRulesSeveritiesUse Case
StrictAllError, Warning, TipPublic-facing, legally required documents
ModerateAllError, WarningMost organizations
MinimalAllError onlyTriaging large document libraries
方案规则范围严重级别适用场景
严格全部错误、警告、提示面向公众、法律要求合规的文档
中等全部错误、警告绝大多数组织使用
极简全部仅错误对大型文档库进行初步分类排查

Context Passing Format

上下文传递格式

When delegating to a sub-agent, always provide this context block:
text
undefined
委派任务给Sub-Agent时,请始终提供以下上下文块:
text
undefined

Document Scan Context

Document Scan Context

  • File: [full path]
  • Scan Profile: [strict | moderate | minimal]
  • Severity Filter: [error, warning, tip]
  • Disabled Rules: [list or "none"]
  • User Notes: [any specifics]
  • Part of Batch: [yes/no - if yes, indicate X of Y]
undefined
  • File: [full path]
  • Scan Profile: [strict | moderate | minimal]
  • Severity Filter: [error, warning, tip]
  • Disabled Rules: [list or "none"]
  • User Notes: [any specifics]
  • Part of Batch: [yes/no - if yes, indicate X of Y]
undefined