Academic Paper Analyzer - In-depth Analysis of Academic Papers
Core Capabilities
- MinerU Cloud API High-precision PDF Parsing
- Automatically extract images, tables, and LaTeX formulas
- Multiple Writing Styles: Storytelling / Academic / Concise
- Optional Formula Explanations: Insert formula images and provide detailed interpretations
- Optional Code Analysis: Explain in combination with GitHub open-source code
- Output Markdown + HTML (base64 image embedding)
Preparations
MinerU API Token
- Visit https://mineru.net to register an account
- Obtain the API Token
- Set environment variables (recommended):
bash
export MINERU_TOKEN="your_token_here"
Dependency Installation
bash
pip install requests markdown
Operation Steps
Step 1: PDF Parsing (Using MinerU API)
bash
python scripts/mineru_api.py <pdf_path> <output_dir>
Or directly pass the token:
bash
python scripts/mineru_api.py paper.pdf ./output YOUR_TOKEN
Output Results:
- - Markdown files (including formulas, tables)
- - High-quality extracted images
Step 2: Extract Paper Information
bash
python scripts/extract_paper_info.py <output_dir>/*.md paper_info.json
Step 3: Style Selection (Ask User)
Before generating the article, you must ask the user the following options:
1. Writing Style (Required)
| Style | Features | Application Scenarios |
|---|
| storytelling (Narrative) | Start from intuition, use metaphors and examples like storytelling | Official accounts, technical blogs, popular science |
| academic (Academic) | Professional terminology, rigorous expression, retain original concepts | Academic reports, paper reviews, research group sharing |
| concise (Concise) | Get straight to the core, use tables and lists, high information density | Quick understanding, paper overview, technical research |
2. Formula Options (Optional)
| Option | Description |
|---|
| with-formulas | Insert formula images and explain the meaning of symbols in detail |
| no-formulas (Default) | Text-only description, no formula images included |
3. Code Options (Optional, only if the paper has GitHub links)
| Option | Description |
|---|
| with-code | Clone the repository, paste key source code, and explain by comparing code with the paper |
| no-code (Default) | No code analysis included |
Example of Inquiry:
Please select an article style:
- academic - Academic, professional and rigorous (recommended default)
- storytelling - Narrative, down-to-earth
- concise - Concise, quick reading
Do you need formula explanations? (Recommended if the paper contains mathematical formulas)
Do you need to combine with GitHub code analysis? (Open-source repository detected: xxx)
If users are unsure, use the academic style by default.
Step 4: Intelligent Article Generation
Read the corresponding style definition file according to the user's selected style:
- - Storytelling Style Guide
- - Academic Style Guide
- - Concise Style Guide
- - Formula Explanation Guide
- - Code Analysis Guide
Lightweight Mode (Save Context)
Important: To avoid context bloat, please follow these principles:
- Do not repeatedly read image files - MinerU has extracted high-quality images, just reference the path directly
- Trust paper_info.json - Contains image lists and metadata, no visual confirmation needed
- Only view key images - Read at most 1-2 core architecture diagrams, reference the rest directly
- Let users verify - After generating HTML, let users check if the images are correct themselves
General Writing Principles
Avoid:
- Overused AI phrases (e.g., "in-depth discussion", "of vital importance", "in the field of...")
- Mechanized section titles
- LaTeX formula syntax (e.g., ) - Use extracted formula images instead
- Flat, straightforward technical descriptions
Adopt:
- Natural paragraph narration
- Make full use of images extracted by MinerU
- Every key figure in the paper should be explained
- Formula screenshots are more readable than LaTeX syntax
Storytelling Style Methodology (Exclusive for Storytelling Style)
The following methodology only applies when users select the storytelling style:
1. Start from intuition, don't talk about technology directly
- Wrong: "This paper proposes a conditional memory module based on hash tables"
- Correct: "Have you ever thought that large models actually don't have memory functions?"
2. Talk about historical background first, then introduce innovations
- Before introducing new technologies, explain related old technologies first
- Let readers understand "why this innovation is needed"
3. Use simple examples throughout the article
- Choose a simple example and reuse it repeatedly
- For example: "The capital of China is Beijing"
4. Use vivid metaphors
- "Using a cannon to kill a mosquito", "Looking up in a dictionary vs memorizing a dictionary"
- Make abstract concepts concrete
5. Logical progression, layer by layer in-depth
- Simple problems → Complex problems → Solutions
6. Refine core insights
- Summarize in one sentence, e.g., "Memory is memory, computation is computation"
Article Structure
1. Paper Information
markdown
**Paper Title**: xxx
**Paper Link**: [arXiv](https://arxiv.org/abs/xxxx)
**Author Team**: xxx
2. Intuition Introduction (2-3 paragraphs)
- Start with a question or scenario
- Arouse readers' curiosity
- Lead to "why this research is needed"
3. Background Knowledge (3-4 paragraphs)
- Explain relevant basic technologies or historical methods
- Illustrate with simple examples
- Let readers understand the limitations of existing solutions
4. Core Innovations (4-5 paragraphs)
- Explain the paper's innovations in detail
- Each innovation should be supported by images
- Use metaphors and examples to make abstract concepts concrete
- Display formulas with images instead of LaTeX syntax
5. Experimental Verification (2-3 paragraphs)
- Key experimental result charts
- Comparative analysis and data interpretation
- Highlight the most outstanding results
6. In-depth Analysis (2-3 paragraphs)
- Mechanism analysis, ablation experiments, etc.
- Explain "why this method works"
- Provide a deeper understanding
7. Thoughts and Outlook (1-2 paragraphs)
- Refine core insights
- Predict future development directions
- Personal opinions and evaluations
Step 5: Output Format (Ask User)
Markdown is the default output. After writing the article, ask users if they need other formats:
"Article generated:
. Do you need to generate an HTML version? (HTML embeds images for easy direct sharing)"
Format Comparison:
| Format | Advantages | Application Scenarios |
|---|
| MD (Default) | Lightweight, easy to edit, can be directly imported into official accounts | Daily use |
| HTML | Image embedding, single-file sharing | Preview effects, sharing with others |
If users need HTML:
bash
python scripts/generate_html.py <article.md> <output.html>
Resource Index
Style Definitions:
- - Storytelling Style
- - Academic Style
- - Concise Style
- - Formula Explanations
- - Code Analysis
Scripts:
- - MinerU Cloud API Call (Recommended)
- - Local Conversion (Alternative, requires PyMuPDF)
scripts/extract_paper_info.py
- Extract Paper Metadata
- - Generate HTML (base64 images)
Notes
- Prioritize using MinerU API for the highest accuracy, supporting formulas/tables
- Save context: Do not repeatedly read images, trust metadata
- Do not output the analysis process; users only need the final article
- Avoid bullet points, use natural paragraph narration
- Select 3-5 key charts as images
API Limitations
- Maximum size per file: 200MB
- Maximum pages per file: 600 pages
- Supports formats such as PDF, DOC, PPT, images, etc.