view-pdf
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChinesePDF Viewer — Interactive Document Workflows
PDF Viewer — 交互式文档工作流
You have access to a local PDF server that renders documents in a live
viewer and lets you annotate, fill forms, and place signatures with
real-time visual feedback.
你可以访问本地PDF服务器,它能在实时查看器中渲染文档,让你添加注释、填写表单并添加签名,同时提供实时视觉反馈。
When to use this skill
何时使用此技能
Use the PDF viewer when the user wants interactivity:
- "Show me this contract" / "Open this paper"
- "Highlight the key terms and let me review"
- "Help me fill out this form"
- "Sign this on page 3" / "Add my initials to each page"
- "Stamp this CONFIDENTIAL" / "Mark this as approved"
- "Walk me through this document and annotate the important parts"
Do NOT use the viewer for pure ingestion:
- "Summarize this PDF" → use the native Read tool directly
- "What does page 5 say?" → use Read
- "Extract the table from section 3" → use Read
The viewer's value is showing the user the document and collaborating
on markup — not streaming text back to you.
当用户需要交互功能时使用PDF查看器:
- “给我展示这份合同” / “打开这份文件”
- “高亮关键条款并让我审阅”
- “帮我填写这份表单”
- “在第3页签名” / “在每一页添加我的首字母”
- “给这份文件盖上‘CONFIDENTIAL’(保密)章” / “标记此文件已批准”
- “带我浏览这份文档并标注重要部分”
纯内容提取场景请勿使用查看器:
- “总结这份PDF” → 直接使用原生Read工具
- “第5页写了什么?” → 使用Read工具
- “提取第3部分的表格” → 使用Read工具
查看器的核心价值在于向用户展示文档并协作完成标记,而非将文本内容返回给你。
Tools
工具
list_pdfs
list_pdfslist_pdfs
list_pdfsList available local PDFs and allowed local directories. No arguments.
列出可用的本地PDF文件和允许访问的本地目录。无参数。
display_pdf
display_pdfdisplay_pdf
display_pdfOpen a PDF in the interactive viewer. Call once per document.
- — local file path or HTTPS URL
url - — initial page (optional, default 1)
page - — if
elicit_form_inputs, prompts the user to fill form fields before displaying (use for interactive form-filling)true
Returns a — pass this to every call. Calling
again creates a separate viewer; interact calls with
the new UUID won't reach the one the user is looking at.
viewUUIDinteractdisplay_pdfAlso returns (name, type, page, bounding box) if the PDF
has fillable fields — use these coordinates for signature placement.
formFields在交互式查看器中打开PDF。每个文档仅调用一次。
- — 本地文件路径或HTTPS URL
url - — 初始页码(可选,默认值为1)
page - — 如果设为
elicit_form_inputs,会在展示PDF前提示用户填写表单域(用于交互式表单填写场景)true
返回 — 后续所有调用都需要传入此参数。再次调用会创建一个独立的查看器;使用新UUID的调用无法作用于用户当前正在查看的那个查看器。
viewUUIDinteractdisplay_pdfinteract如果PDF包含可填写表单域,还会返回(包含名称、类型、页码、边界框)——可使用这些坐标来放置签名。
formFieldsinteract
interactinteract
interactAll follow-up actions after . Pass plus one or
more commands. Batch multiple commands in one call via the
array — they run sequentially. End batches with
to verify changes visually.
display_pdfviewUUIDcommandsget_screenshotAnnotation actions:
- — add markup (see types below)
add_annotations - — modify existing (id + type required)
update_annotations - — delete by id array
remove_annotations - — auto-find text by query and highlight it (preferred over manual rects for text markup)
highlight_text
Navigation actions:
- (page),
navigate(query),search(query, silent),find(matchIndex),search_navigate(scale 0.5–3.0)zoom
Extraction actions:
- — extract text from page ranges (max 20 pages). Use for reading content to decide what to annotate, NOT for summarization.
get_text - — capture a page as an image (verify your annotations)
get_screenshot
Form action:
- — fill named fields:
fill_formfields: [{name, value}, ...]
display_pdfviewUUIDcommandsget_screenshot注释操作:
- — 添加标记(见下方类型说明)
add_annotations - — 修改现有标记(需要id和类型)
update_annotations - — 通过id数组删除标记
remove_annotations - — 通过查询内容自动查找文本并高亮(对于文本标记,优先使用此方式而非手动绘制矩形)
highlight_text
导航操作:
- (跳转页码)、
navigate(搜索内容)、search(搜索内容,静默模式)、find(跳转至指定匹配项)、search_navigate(缩放比例0.5–3.0)zoom
提取操作:
- — 提取指定页码范围的文本(最多20页)。用于读取内容以决定要标记的部分,不用于摘要生成。
get_text - — 将某一页捕获为图片(用于验证你的标记)
get_screenshot
表单操作:
- — 填写指定名称的表单域:
fill_formfields: [{name, value}, ...]
Annotation Types
注释类型
All annotations need (unique string), , (1-indexed).
Coordinates are PDF points (1/72 inch), origin top-left, Y increases
downward. US Letter is 612×792pt.
idtypepage| Type | Key properties | Use for |
|---|---|---|
| | Mark important text |
| | Emphasize terms |
| | Mark deletions |
| | Sticky-note comments |
| | Visible text on page |
| | Box regions |
| | Circle regions |
| | Draw lines/arrows |
| | APPROVED, DRAFT, CONFIDENTIAL, etc. |
| | Signatures, initials, logos |
Image annotations accept a local file path or HTTPS URL (no data:
URIs). Dimensions auto-detected if omitted. Users can also drag & drop
images directly onto the viewer.
所有注释都需要(唯一字符串)、、(从1开始计数)。坐标采用PDF点(1/72英寸),原点为左上角,Y轴向下递增。美国信纸尺寸为612×792pt。
idtypepage| 类型 | 关键属性 | 用途 |
|---|---|---|
| | 标记重要文本 |
| | 强调术语 |
| | 标记需要删除的内容 |
| | 便签式评论 |
| | 在页面上添加可见文本 |
| | 框选区域 |
| | 圈选区域 |
| | 绘制线条/箭头 |
| | 添加印章,如APPROVED(已批准)、DRAFT(草稿)、CONFIDENTIAL(保密)等 |
| | 添加签名、首字母、Logo等 |
图片注释支持本地文件路径或HTTPS URL(不支持data: URI)。如果省略尺寸参数,会自动检测尺寸。用户也可以直接将图片拖放到查看器中。
Interactive Workflows
交互式工作流
Collaborative annotation (AI-driven)
协作式注释(AI驱动)
- to open the document
display_pdf - →
interacton relevant page range to understand contentget_text - Propose a batch of annotations to the user (describe what you'll mark)
- On approval, →
interact+add_annotationsget_screenshot - Show the user, ask for edits, iterate
- When done, remind them they can download the annotated PDF from the viewer toolbar
- 调用打开文档
display_pdf - 调用→
interact获取相关页码范围的内容以理解文档get_text - 向用户提议一批注释内容(描述你要标记的内容)
- 获得批准后,调用→
interact+add_annotationsget_screenshot - 展示给用户,询问是否需要修改,循环迭代
- 完成后,提醒用户可以从查看器工具栏下载带有注释的PDF
Form filling (visual, not programmatic)
表单填写(可视化,非程序化)
Unlike headless form tools, this gives the user live visual
feedback and handles forms with cryptic/unnamed fields where the
label is printed on the page rather than in field metadata.
- — inspect returned
display_pdf(name, type, page, bounding box)formFields - If field names are cryptic (,
Text1),Field_7the pages and match bounding boxes to visual labelsget_screenshot - Ask the user for values using the visual labels, or infer from context
- →
interact, thenfill_formto show the resultget_screenshot - User confirms or edits directly in the viewer
For simple well-labeled forms, with
prompts the user upfront instead.
display_pdfelicit_form_inputs: true与无界面表单工具不同,此方式为用户提供实时视觉反馈,可处理字段名称不明确/无名称的表单——此类表单的标签是打印在页面上而非存储在字段元数据中。
- 调用— 检查返回的
display_pdf(名称、类型、页码、边界框)formFields - 如果字段名称不明确(如、
Text1),调用Field_7获取页面截图,将边界框与可视化标签匹配get_screenshot - 使用可视化标签向用户询问字段值,或根据上下文推断
- 调用→
interact,然后调用fill_form展示结果get_screenshot - 用户确认或直接在查看器中编辑
对于简单且标签清晰的表单,调用时设置,可提前提示用户填写。
display_pdfelicit_form_inputs: trueSigning (visual, not certified)
签名添加(可视化,非认证式)
- Ask for the signature/initials image path
- , check
display_pdffor signature-type fields or ask which page/positionformFields - →
interactwithadd_annotationsat the target coordinatestype: "image" - to confirm placement
get_screenshot
Disclaimer: This places a visual signature image. It is not a
certified or cryptographic digital signature.
- 向用户索要签名/首字母的图片路径
- 调用,检查
display_pdf是否有签名类型字段,或询问用户签名的页码/位置formFields - 调用→
interact,设置add_annotations并传入目标坐标type: "image" - 调用确认位置
get_screenshot
免责声明: 此操作仅添加可视化签名图片,并非经过认证的加密数字签名。
Supported Sources
支持的来源
- Local files (paths under client MCP roots)
- arXiv (URLs auto-convert to PDF)
/abs/ - Any direct HTTPS PDF URL (bioRxiv, Zenodo, OSF, etc. — use the direct PDF link, not the landing page)
- 本地文件(客户端MCP根目录下的路径)
- arXiv(格式的URL会自动转换为PDF)
/abs/ - 任何直接的HTTPS PDF URL(如bioRxiv、Zenodo、OSF等——请使用直接PDF链接,而非着陆页)
Out of Scope
不支持的场景
- Summarization / text extraction — use native Read instead
- Certified digital signatures — image stamping only
- PDF creation — this works on existing PDFs only
- 摘要生成/文本提取 — 请使用原生Read工具
- 认证式数字签名 — 仅支持图片盖章
- PDF创建 — 此技能仅适用于现有PDF",