pdf-analysis

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

PDF Analysis Skill

PDF分析Skill

Purpose

用途

This skill enables comprehensive analysis of PDF documents, particularly academic papers and technical documents, to extract valuable content for WeChat article creation. It provides systematic workflows for document parsing, content extraction, and transformation into engaging public-facing content.
本Skill支持对PDF文档(尤其是学术论文和技术文档)进行全面分析,为微信文章创作提取有价值的内容。它提供了文档解析、内容提取以及转化为受众喜爱的公开内容的系统化工作流程。

When to Use

使用场景

Activate this skill when users need to:
  • Analyze academic papers in PDF format
  • Extract content from technical documents
  • Understand complex research papers
  • Create WeChat content based on PDF materials
  • Summarize lengthy documents for audience consumption
当用户需要以下服务时,激活本Skill:
  • 分析PDF格式的学术论文
  • 从技术文档中提取内容
  • 理解复杂的研究论文
  • 基于PDF素材创作微信内容
  • 为受众总结冗长的文档

Core Workflow

核心工作流程

1. Document Preprocessing

1. 文档预处理

Start by examining the PDF structure:
  • Check document metadata (title, authors, publication date)
  • Assess document length and complexity
  • Identify document type (research paper, technical report, review article)
  • Note any special formatting or visual elements
  • Determine language and writing style
首先检查PDF结构:
  • 查看文档元数据(标题、作者、发表日期)
  • 评估文档长度和复杂度
  • 确定文档类型(研究论文、技术报告、综述文章)
  • 记录特殊格式或视觉元素
  • 判断语言和写作风格

2. Content Extraction

2. 内容提取

Systematically extract key content sections:
  • Abstract and Introduction: Core research questions and contributions
  • Methodology: Technical approaches and experimental setup
  • Results and Discussion: Key findings and their implications
  • Conclusion: Main takeaways and future work
  • References: Related work and additional resources
系统性提取关键内容板块:
  • 摘要与引言:核心研究问题与贡献
  • 研究方法:技术路径与实验设置
  • 结果与讨论:关键发现及其意义
  • 结论:主要收获与未来工作方向
  • 参考文献:相关研究与额外资源

3. Technical Analysis

3. 技术分析

For research papers, focus on:
  • Problem Statement: What issue is being addressed
  • Novel Contributions: What makes this research unique
  • Methodological Innovation: New techniques or approaches
  • Experimental Results: Key data and findings
  • Practical Applications: Real-world implications and use cases
针对研究论文,重点关注:
  • 问题陈述:研究解决的核心问题
  • 创新贡献:该研究的独特之处
  • 方法创新:新技术或路径
  • 实验结果:关键数据与发现
  • 实际应用:现实意义与使用场景

4. Content Transformation

4. 内容转化

Convert technical content into WeChat-friendly format:
  • Simplify Technical Language: Replace jargon with accessible explanations
  • Create Analogies: Use familiar concepts to explain complex ideas
  • Highlight Relevance: Connect research to everyday experiences
  • Structure Narratively: Create compelling story around the research
  • Add Visual Elements: Suggest diagrams or illustrations if helpful
将技术内容转化为适合微信平台的格式:
  • 简化技术语言:用易懂的解释替代专业术语
  • 创建类比:用熟悉的概念解释复杂想法
  • 突出关联性:将研究与日常经验联系起来
  • 叙事化结构:围绕研究创作引人入胜的故事
  • 添加视觉元素:如有需要,建议使用图表或插图

Document Analysis Templates

文档分析模板

Research Paper Analysis

研究论文分析

Paper Title: [EXTRACTED_TITLE]
Authors: [EXTRACTED_AUTHORS]
Publication: [JOURNAL/CONFERENCE]

Key Questions:
1. What problem does this research solve?
2. What is the main contribution?
3. How does this compare to existing approaches?
4. What are the practical implications?
5. Who would benefit from this research?

Content Extraction:
- Abstract Summary: [KEY_POINTS]
- Methodology Overview: [APPROACH_DESCRIPTION]
- Main Results: [FINDINGS]
- Limitations: [WEAKNESSES/CONSTRAINTS]
- Future Work: [NEXT_STEPS]
Paper Title: [EXTRACTED_TITLE]
Authors: [EXTRACTED_AUTHORS]
Publication: [JOURNAL/CONFERENCE]

Key Questions:
1. What problem does this research solve?
2. What is the main contribution?
3. How does this compare to existing approaches?
4. What are the practical implications?
5. Who would benefit from this research?

Content Extraction:
- Abstract Summary: [KEY_POINTS]
- Methodology Overview: [APPROACH_DESCRIPTION]
- Main Results: [FINDINGS]
- Limitations: [WEAKNESSES/CONSTRAINTS]
- Future Work: [NEXT_STEPS]

Technical Document Analysis

技术文档分析

Document Type: [MANUAL/GUIDE/REPORT]
Target Audience: [INTENDED_USERS]
Core Concepts: [MAIN_TOPICS]

Key Information:
- Purpose: [DOCUMENT_GOAL]
- Scope: [COVERED_AREAS]
- Recommendations: [KEY_ADVICE]
- Implementation: [PRACTICAL_STEPS]
- Resources: [ADDITIONAL_MATERIALS]
Document Type: [MANUAL/GUIDE/REPORT]
Target Audience: [INTENDED_USERS]
Core Concepts: [MAIN_TOPICS]

Key Information:
- Purpose: [DOCUMENT_GOAL]
- Scope: [COVERED_AREAS]
- Recommendations: [KEY_ADVICE]
- Implementation: [PRACTICAL_STEPS]
- Resources: [ADDITIONAL_MATERIALS]

Content Structure for WeChat

微信内容结构

Research-Based Article Structure

研究类文章结构

  1. Engaging Hook: Start with surprising finding or relatable problem
  2. Background Context: Explain why this research matters
  3. Problem Statement: Clearly articulate the issue being addressed
  4. Solution Overview: Describe the innovative approach
  5. Key Results: Highlight most important findings
  6. Practical Impact: Explain real-world applications
  7. Future Outlook: Discuss potential developments
  8. Call to Action: Encourage reader engagement or further learning
  1. 引人入胜的开头:以惊人发现或相关问题切入
  2. 背景介绍:说明该研究的重要性
  3. 问题陈述:清晰阐述研究解决的问题
  4. 方案概述:描述创新路径
  5. 关键结果:突出最重要的发现
  6. 实际影响:解释现实应用场景
  7. 未来展望:讨论潜在发展方向
  8. 行动号召:鼓励读者参与或进一步学习

Writing Style Guidelines

写作风格指南

  • Conversational Tone: Write as if explaining to interested friend
  • Storytelling Elements: Use narrative to maintain engagement
  • Visual Language: Help readers visualize concepts
  • Progressive Disclosure: Introduce complexity gradually
  • Practical Examples: Connect abstract concepts to concrete situations
  • 口语化语气:像给感兴趣的朋友讲解一样写作
  • 叙事元素:用故事性内容保持读者兴趣
  • 可视化语言:帮助读者直观理解概念
  • 渐进式复杂度:从简单开始,逐步增加细节
  • 实际案例:将抽象概念与具体场景联系起来

Quality Assessment Criteria

质量评估标准

Before finalizing content, verify:
  • Accurate representation of original document
  • Appropriate simplification without distortion
  • Clear connection to reader interests
  • Proper attribution to source material
  • Engaging and accessible language
  • Logical flow and structure
在完成内容前,验证以下内容:
  • 准确还原原始文档内容
  • 合理简化且不失真
  • 与读者兴趣清晰关联
  • 正确标注来源
  • 语言生动易懂
  • 逻辑流畅、结构清晰

Common Document Types and Approaches

常见文档类型及处理方法

Academic Papers

学术论文

  • Focus on novel contributions and breakthroughs
  • Explain experimental setup and results clearly
  • Connect theory to practical applications
  • Highlight significance for field advancement
  • 聚焦创新贡献与突破
  • 清晰解释实验设置与结果
  • 将理论与实际应用联系起来
  • 突出对领域发展的意义

Technical Reports

技术报告

  • Emphasize practical recommendations and guidelines
  • Extract actionable insights for readers
  • Simplify technical specifications
  • Provide implementation guidance
  • 强调实际建议与指导方针
  • 为读者提取可操作的见解
  • 简化技术规范
  • 提供实施指导

Review Articles

综述文章

  • Identify key trends and developments
  • Summarize consensus views and debates
  • Highlight emerging areas of research
  • Provide balanced perspective on field
  • 识别关键趋势与发展
  • 总结共识观点与争议
  • 突出新兴研究领域
  • 提供领域的平衡视角

Content Generation Strategies

内容生成策略

Simplification Techniques

简化技巧

  1. Analogical Reasoning: Compare complex concepts to familiar ones
  2. Concrete Examples: Use specific instances to illustrate general principles
  3. Progressive Complexity: Start simple, gradually add detail
  4. Visual Metaphors: Create mental images to aid understanding
  5. Question-Based Approach: Anticipate and answer reader questions
  1. 类比推理:将复杂概念与熟悉事物对比
  2. 具体案例:用具体实例说明通用原则
  3. 渐进式复杂度:从简单入手,逐步增加细节
  4. 视觉隐喻:创造心理图像辅助理解
  5. 问题导向法:预判并解答读者疑问

Engagement Strategies

互动策略

  1. Surprising Facts: Start with unexpected or counterintuitive information
  2. Personal Connection: Relate content to reader experiences
  3. Interactive Elements: Pose questions or scenarios for consideration
  4. Future Implications: Discuss how this might affect readers' lives
  5. Practical Takeaways: Provide actionable advice or insights
  1. 惊人事实:以意外或反直觉的信息开头
  2. 个人关联:将内容与读者经历联系起来
  3. 互动元素:提出供读者思考的问题或场景
  4. 未来影响:讨论内容对读者生活的潜在影响
  5. 实用收获:提供可操作的建议或见解

Integration with Workflow

工作流程整合

After PDF analysis:
  1. Extract key content and insights
  2. Determine appropriate content category
  3. Use
    create-article
    command for structured generation
  4. Apply consistent formatting and style
  5. Ensure proper attribution and references
完成PDF分析后:
  1. 提取关键内容与见解
  2. 确定合适的内容分类
  3. 使用
    create-article
    命令进行结构化生成
  4. 应用统一的格式与风格
  5. 确保正确的引用与标注

Additional Resources

额外资源

Reference Files

参考文件

  • references/analysis-techniques.md
    - Detailed methods for document analysis
  • references/content-templates.md
    - WeChat article templates for different document types
  • references/analysis-techniques.md
    - 文档分析的详细方法
  • references/content-templates.md
    - 不同文档类型的微信文章模板

Example Files

示例文件

  • examples/paper-analysis-example.md
    - Complete analysis of a research paper
  • examples/wechat-article-from-pdf.md
    - Full WeChat article based on PDF analysis
  • examples/paper-analysis-example.md
    - 一份完整的研究论文分析示例
  • examples/wechat-article-from-pdf.md
    - 基于PDF分析生成的完整微信文章

Technical Considerations

技术考量

PDF Parsing

PDF解析

  • Handle different PDF formats and structures
  • Extract text while preserving formatting context
  • Identify and process special elements (tables, figures, equations)
  • Manage multi-column layouts and academic formatting
  • 处理不同PDF格式与结构
  • 在保留格式上下文的同时提取文本
  • 识别并处理特殊元素(表格、图表、公式)
  • 管理多栏布局与学术格式

Content Quality

内容质量

  • Verify accuracy of extracted information
  • Check for completeness of key sections
  • Ensure proper handling of technical terminology
  • Maintain document context and flow
  • 验证提取信息的准确性
  • 检查关键板块的完整性
  • 确保专业术语的正确处理
  • 保留文档的上下文与逻辑 flow

Tips for Effective Analysis

高效分析技巧

  1. Read Abstract First: Get overview before detailed analysis
  2. Identify Target Audience: Tailor content appropriately
  3. Extract Key Numbers: Use statistics and data for credibility
  4. Find Human Angle: Connect technical content to human interests
  5. Verify Claims: Cross-check important statements with source
  6. Consider Visual Elements: Suggest diagrams or illustrations if helpful
  1. 先读摘要:在详细分析前先了解整体概况
  2. 明确目标受众:针对性调整内容
  3. 提取关键数据:用统计数据增强可信度
  4. 挖掘人文视角:将技术内容与人文兴趣联系起来
  5. 验证主张:对照原文交叉检查重要陈述
  6. 考虑视觉元素:如有需要,建议使用图表或插图

Common Pitfalls to Avoid

常见误区规避

  • Oversimplification: Don't distort complex concepts beyond recognition
  • Missing Context: Provide sufficient background for understanding
  • Jargon Overload: Replace or explain technical terms
  • Dry Presentation: Use engaging language and examples
  • Incomplete Attribution: Always credit original sources properly
  • 过度简化:不要将复杂概念扭曲到面目全非
  • 缺失上下文:提供足够的背景信息以帮助理解
  • 术语过载:替换或解释专业术语
  • 枯燥呈现:使用生动的语言与案例
  • 引用不全:始终正确标注原始来源