document-scanning
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseDocument Scanning
文档扫描
Supported File Types
支持的文件类型
| Extension | Type | Sub-Agent |
|---|---|---|
| .docx | Word document | word-accessibility |
| .xlsx | Excel workbook | excel-accessibility |
| .pptx | PowerPoint presentation | powerpoint-accessibility |
| PDF document | pdf-accessibility |
| 扩展名 | 类型 | Sub-Agent |
|---|---|---|
| .docx | Word 文档 | word-accessibility |
| .xlsx | Excel 工作簿 | excel-accessibility |
| .pptx | PowerPoint 演示文稿 | powerpoint-accessibility |
| PDF 文档 | pdf-accessibility |
File Discovery Commands
文件发现命令
PowerShell (Windows)
PowerShell (Windows)
powershell
undefinedpowershell
undefinedNon-recursive scan
Non-recursive scan
Get-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf
Get-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf
Recursive scan
Recursive scan
Get-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf -Recurse |
Where-Object { $.Name -notlike '~$*' -and $.Name -notlike '.tmp' -and $_.Name -notlike '.bak' } |
Where-Object { $_.FullName -notmatch '\/[\/]' }
undefinedGet-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf -Recurse |
Where-Object { $.Name -notlike '~$*' -and $.Name -notlike '.tmp' -and $_.Name -notlike '.bak' } |
Where-Object { $_.FullName -notmatch '\/[\/]' }
undefinedBash (macOS)
Bash (macOS)
bash
undefinedbash
undefinedNon-recursive scan
Non-recursive scan
find "<folder>" -maxdepth 1 -type f ( -name ".docx" -o -name ".xlsx" -o -name ".pptx" -o -name ".pdf" ) ! -name "~$*"
find "<folder>" -maxdepth 1 -type f ( -name ".docx" -o -name ".xlsx" -o -name ".pptx" -o -name ".pdf" ) ! -name "~$*"
Recursive scan
Recursive scan
find "<folder>" -type f ( -name ".docx" -o -name ".xlsx" -o -name ".pptx" -o -name ".pdf" )
! -name "~$" ! -name ".tmp" ! -name ".bak"
! -path "/.git/" ! -path "/node_modules/" ! -path "/pycache/" ! -path "/.vscode/*"
! -name "~$" ! -name ".tmp" ! -name ".bak"
! -path "/.git/" ! -path "/node_modules/" ! -path "/pycache/" ! -path "/.vscode/*"
undefinedfind "<folder>" -type f ( -name ".docx" -o -name ".xlsx" -o -name ".pptx" -o -name ".pdf" )
! -name "~$" ! -name ".tmp" ! -name ".bak"
! -path "/.git/" ! -path "/node_modules/" ! -path "/pycache/" ! -path "/.vscode/*"
! -name "~$" ! -name ".tmp" ! -name ".bak"
! -path "/.git/" ! -path "/node_modules/" ! -path "/pycache/" ! -path "/.vscode/*"
undefinedDelta Detection
变更检测
Git-based
基于Git
bash
undefinedbash
undefinedFiles changed since last commit
Files changed since last commit
git diff --name-only HEAD~1 HEAD -- '.docx' '.xlsx' '.pptx' '.pdf'
git diff --name-only HEAD~1 HEAD -- '.docx' '.xlsx' '.pptx' '.pdf'
Files changed since a specific tag
Files changed since a specific tag
git diff --name-only <tag> HEAD -- '.docx' '.xlsx' '.pptx' '.pdf'
git diff --name-only <tag> HEAD -- '.docx' '.xlsx' '.pptx' '.pdf'
Files changed in the last N days
Files changed in the last N days
git log --since="N days ago" --name-only --diff-filter=ACMR --pretty="" -- '.docx' '.xlsx' '.pptx' '.pdf' | sort -u
undefinedgit log --since="N days ago" --name-only --diff-filter=ACMR --pretty="" -- '.docx' '.xlsx' '.pptx' '.pdf' | sort -u
undefinedTimestamp-based (PowerShell)
基于时间戳(PowerShell)
powershell
undefinedpowershell
undefinedFiles modified since a specific date
Files modified since a specific date
Get-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf -Recurse |
Where-Object { $_.LastWriteTime -gt [datetime]"2025-01-01" }
undefinedGet-ChildItem -Path "<folder>" -File -Include .docx,.xlsx,.pptx,.pdf -Recurse |
Where-Object { $_.LastWriteTime -gt [datetime]"2025-01-01" }
undefinedFiles to Skip
需要跳过的文件
Always exclude these patterns during scanning:
- - Office lock/temp files (created when a document is open)
~$* - - Temporary files
*.tmp - - Backup files
*.bak - Files inside ,
.git/,node_modules/,.vscode/directories__pycache__/
扫描过程中请始终排除以下模式的文件:
- - Office 锁定/临时文件(文档打开时创建)
~$* - - 临时文件
*.tmp - - 备份文件
*.bak - 位于 、
.git/、node_modules/、.vscode/目录下的文件__pycache__/
Scan Configuration Files
扫描配置文件
| File | Purpose |
|---|---|
| Rule enable/disable for Word, Excel, PowerPoint |
| Rule enable/disable for PDF scanning |
| 文件名 | 用途 |
|---|---|
| Word、Excel、PowerPoint 的规则启用/禁用配置 |
| PDF 扫描的规则启用/禁用配置 |
Scan Profiles
扫描方案
| Profile | Rules | Severities | Use Case |
|---|---|---|---|
| Strict | All | Error, Warning, Tip | Public-facing, legally required documents |
| Moderate | All | Error, Warning | Most organizations |
| Minimal | All | Error only | Triaging large document libraries |
| 方案 | 规则范围 | 严重级别 | 适用场景 |
|---|---|---|---|
| 严格 | 全部 | 错误、警告、提示 | 面向公众、法律要求合规的文档 |
| 中等 | 全部 | 错误、警告 | 绝大多数组织使用 |
| 极简 | 全部 | 仅错误 | 对大型文档库进行初步分类排查 |
Context Passing Format
上下文传递格式
When delegating to a sub-agent, always provide this context block:
text
undefined委派任务给Sub-Agent时,请始终提供以下上下文块:
text
undefinedDocument Scan Context
Document Scan Context
- File: [full path]
- Scan Profile: [strict | moderate | minimal]
- Severity Filter: [error, warning, tip]
- Disabled Rules: [list or "none"]
- User Notes: [any specifics]
- Part of Batch: [yes/no - if yes, indicate X of Y]
undefined- File: [full path]
- Scan Profile: [strict | moderate | minimal]
- Severity Filter: [error, warning, tip]
- Disabled Rules: [list or "none"]
- User Notes: [any specifics]
- Part of Batch: [yes/no - if yes, indicate X of Y]
undefined