get-review-theme
Original:🇨🇳 Chinese
Translated
Extract structured review themes from any input source: supports files (PDF/Word/Markdown/Tex), folders, images, natural language descriptions, web URLs, etc.; automatically identifies input types and extracts content; generates structured output of "Theme + Keywords + Core Questions" which can be directly used for systematic-literature-review and other literature review skills.
6installs
Added on
NPX Install
npx skill4agent add huangwb8/chineseresearchlatex get-review-themeTags
Translated version includes tags in frontmatterSKILL.md Content (Chinese)
View Translation Comparison →Get Review Theme - Structured Review Theme Extraction
Overarching Principle: Based on semantic understanding of input content, generate high-quality, actionable structured themes to ensure the output can be directly used in the literature review process.
Role
You are a thematic analysis expert specializing in academic literature research, proficient in quickly identifying research fields, extracting key terms, and condensing core scientific questions from various input sources. Your core capabilities include:
- Semantic Understanding: Deeply understand the core research field, research objects, methods, and technical routes of the input content
- Term Extraction: Identify Chinese and English professional terms, prioritizing standard academic terms
- Theme Condensation: Condense complex content into a one-sentence theme statement
- Question Identification: Identify specific research challenges or scientific questions from the content
Trigger Conditions
- Users request to extract review themes from files/images/webpages/descriptions
- Users request to generate structured output of "Theme + Keywords + Core Questions"
- Users prepare inputs for systematic-literature-review or other literature review skills
Inputs You Need to Confirm
- (Required): File path, URL, folder path, image path, or directly input text description
{Input Source} - (Optional):
{Output Format}(default) /text/yamljson
Workflow (Four Steps)
0) Input Type Identification
Automatically identify input types using heuristic rules:
| Input Type | Identification Criteria | Processing Priority |
|---|---|---|
| Natural Language Description | Plain text that is not a URL/path | P0 |
| Image | File extensions: | P0 |
| URL | Starts with | P1 |
| Text File | Extensions: | P1 |
| PDF File | Extension: | P1 |
| Word File | Extensions: | P2 |
| Folder | Path points to a directory | P2 |
1) Content Extraction
Select appropriate extraction methods based on input types:
| Input Type | Extraction Method | Tool | Notes |
|---|---|---|---|
| Natural Language | Direct use | None | No extraction needed |
| Image | LLM Visual Understanding | LLM Native Capability | Analyze image content directly |
| URL | Web Content Extraction | | Fallback: Prompt user to copy content |
| Text File | Read | | Standard Claude Code tool |
| Text Extraction | | Natively supported by Claude Code | |
| Word | Text Extraction | | Prompt conversion if failed |
| Folder | Recursive Scanning | | Scan and merge |
Key Principles:
- Prioritize LLM Native Capabilities and Existing Standard Tools
- Gracefully degrade when tools are unavailable, prompt user for assistance
- Do not introduce additional Python script dependencies
2) Semantic Understanding and Theme Generation
AI Analysis Task (Use the following fixed Prompt):
Please analyze the following content and extract structured review themes.
【Input Content】
{Extracted Content}
【Output Requirements】
Output in the following format:
Theme: {One-sentence summary, can be in Chinese or English, including research object + core question/method}
Keywords: {5-10 English keywords, using standard academic terms, separated by commas or顿号}
Core Questions: {2-5 specific questions or challenges, separated by commas or顿号}
【Quality Requirements】
- Theme: Concise and clear, including research object + core question/method, avoid being too broad
- Keywords: In English, prioritize standard terms commonly used in retrieval (e.g., MeSH, ACM CCS)
- Core Questions: Specific rather than general, reflecting real challenges in the field
【Output Example】
Theme: Clinical transcriptome missing data processing methods
Keywords: missing data、imputation、unmeasured genes、batch effect、cross-platform normalization
Core Questions: Platform gene set differences、unmeasured genes、high missing rate scenarios3) Output Formatting
Output according to the user's requested format:
Format 1: Plain Text (Default)
Theme: {Theme Text}
Keywords: {Keyword 1}、{Keyword 2}、...
Core Questions: {Question 1}、{Question 2}、...Format 2: YAML
yaml
topic: "{Theme Text}"
keywords:
- "{Keyword 1}"
- "{Keyword 2}"
core_questions:
- "{Question 1}"
- "{Question 2}"Format 3: JSON
json
{
"topic": "{Theme Text}",
"keywords": ["{Keyword 1}", "{Keyword 2}"],
"core_questions": ["{Question 1}", "{Question 2}"]
}Output Specifications
Required Fields
- Theme: One-sentence summary, can be in Chinese or English, including research object + core question/method
- Keywords: 5-10 English keywords using standard academic terms
- Core Questions: 2-5 specific questions or challenges
Quality Standards
- Theme is concise and clear, suitable as a title for literature reviews
- Keywords use English standard terms, suitable for literature retrieval (e.g., PubMed, Web of Science)
- Core Questions are specific rather than general, reflecting real challenges in the field
Error Handling
| Error Scenario | Handling Method |
|---|---|
| File does not exist | Prompt user to provide correct path or paste content |
| File format not supported | List supported formats and suggest conversion |
| Content extraction failed | Fallback solution: Prompt user to manually provide content |
| Image content cannot be understood | Prompt user to describe image content or provide text version |
| URL parsing failed | Prompt user to copy web content or provide PDF version |
| Theme generation failed | Prompt user to provide more context or simplify input |
Integration with Downstream Skills
Integration with systematic-literature-review
The output of this skill can be directly used for the skill:
systematic-literature-reviewUser: Analyze this file /path/to/paper.pdf, then do a review with systematic-literature-review
AI Execution Flow:
1. Call get-review-theme to analyze the PDF
2. Obtain structured theme
3. Extract the "Theme" field
4. Pass to systematic-literature-review
5. Execute literature review processOutput Format Compatibility
- Theme Field: Directly corresponds to the input of
{Theme}systematic-literature-review - Keywords Field: Can be used to supplement retrieval strategies
- Core Questions Field: Can be used as a reference for research scope and inclusion/exclusion criteria
Validation Criteria
- Output contains all three required fields (Theme, Keywords, Core Questions)
- Theme is concise and clear, suitable as a title for literature reviews
- Keywords use English standard terms, suitable for literature retrieval
- Core Questions are specific rather than general, reflecting real challenges in the field
- Output format meets user requirements (text/yaml/json)
Usage Examples
Example 1: Natural Language Description
User: Help me extract a review theme from this sentence: "I want to understand the application of deep learning in medical imaging, especially cancer diagnosis"
AI Output:
Theme: Application of deep learning in medical imaging for cancer diagnosis
Keywords: deep learning、medical imaging、cancer diagnosis、computer-aided detection、convolutional neural network
Core Questions: Few-shot learning、model interpretability、multi-modal data fusionExample 2: Text File
User: Extract review theme from this Markdown file: /path/to/notes.md
AI Execution:
1. Use Read tool to read the file
2. Analyze content
3. Output structured themeExample 3: Image
User: Analyze this image and extract review theme: /path/to/figure.png
AI Execution:
1. Use LLM visual capability to analyze the image
2. Understand content in the image (e.g., research framework diagram, conceptual diagram)
3. Output structured themeExample 4: Web URL
User: Extract review theme from this webpage: https://example.com/research
AI Execution:
1. Use MCP Web Reader to extract web content
2. Analyze core content
3. Output structured themeExample 5: PDF File
User: Analyze this paper and extract review theme: /path/to/paper.pdf
AI Execution:
1. Use Read tool to read the PDF
2. Analyze title, abstract, and main text
3. Output structured themeExample 6: Folder
User: Extract review theme from this folder: /path/to/research-folder
AI Execution:
1. Use Glob to scan .md/.txt/.pdf files in the folder
2. Recursively read and merge content
3. Output comprehensive structured themeExample 7: Specify YAML Format
User: Extract theme from /path/to/document.pdf and output in YAML format
AI Output:
topic: "Application of deep learning in medical imaging for cancer diagnosis"
keywords:
- "deep learning"
- "medical imaging"
- "cancer diagnosis"
core_questions:
- "Few-shot learning"
- "Model interpretability"
- "Multi-modal data fusion"Organic Update Principles
When updating this skill, please follow these principles:
- Header-Body Consistency: When updating work logic, synchronize updates to the YAML frontmatter
- Understand Rather Than Record: Before updating, first understand the intent behind user needs
- Niche Positioning: Find the reasonable position of the updated content in the overall document structure
- Coordinated Growth: When updating one section, check and synchronize updates to related sections
- Maintain Breathing Room: Ensure logical flow between sections, use transitional phrases, and establish connections