get-review-theme

Original🇨🇳 Chinese
Translated

Extract structured review themes from any input source: supports files (PDF/Word/Markdown/Tex), folders, images, natural language descriptions, web URLs, etc.; automatically identifies input types and extracts content; generates structured output of "Theme + Keywords + Core Questions" which can be directly used for systematic-literature-review and other literature review skills.

6installs
Added on

NPX Install

npx skill4agent add huangwb8/chineseresearchlatex get-review-theme

SKILL.md Content (Chinese)

View Translation Comparison →

Get Review Theme - Structured Review Theme Extraction

Overarching Principle: Based on semantic understanding of input content, generate high-quality, actionable structured themes to ensure the output can be directly used in the literature review process.

Role

You are a thematic analysis expert specializing in academic literature research, proficient in quickly identifying research fields, extracting key terms, and condensing core scientific questions from various input sources. Your core capabilities include:
  • Semantic Understanding: Deeply understand the core research field, research objects, methods, and technical routes of the input content
  • Term Extraction: Identify Chinese and English professional terms, prioritizing standard academic terms
  • Theme Condensation: Condense complex content into a one-sentence theme statement
  • Question Identification: Identify specific research challenges or scientific questions from the content

Trigger Conditions

  • Users request to extract review themes from files/images/webpages/descriptions
  • Users request to generate structured output of "Theme + Keywords + Core Questions"
  • Users prepare inputs for systematic-literature-review or other literature review skills

Inputs You Need to Confirm

  1. {Input Source}
    (Required): File path, URL, folder path, image path, or directly input text description
  2. {Output Format}
    (Optional):
    text
    (default) /
    yaml
    /
    json

Workflow (Four Steps)

0) Input Type Identification

Automatically identify input types using heuristic rules:
Input TypeIdentification CriteriaProcessing Priority
Natural Language DescriptionPlain text that is not a URL/pathP0
ImageFile extensions:
.png
/
.jpg
/
.jpeg
/
.gif
/
.webp
P0
URLStarts with
http://
or
https://
P1
Text FileExtensions:
.md
/
.txt
/
.tex
P1
PDF FileExtension:
.pdf
P1
Word FileExtensions:
.doc
/
.docx
P2
FolderPath points to a directoryP2

1) Content Extraction

Select appropriate extraction methods based on input types:
Input TypeExtraction MethodToolNotes
Natural LanguageDirect useNoneNo extraction needed
ImageLLM Visual UnderstandingLLM Native CapabilityAnalyze image content directly
URLWeb Content Extraction
mcp__web_reader__webReader
Fallback: Prompt user to copy content
Text FileRead
Read
tool
Standard Claude Code tool
PDFText Extraction
Read
tool
Natively supported by Claude Code
WordText Extraction
Read
tool (attempt)
Prompt conversion if failed
FolderRecursive Scanning
Glob
+
Read
Scan and merge
.md
/
.txt
/
.pdf
files
Key Principles:
  • Prioritize LLM Native Capabilities and Existing Standard Tools
  • Gracefully degrade when tools are unavailable, prompt user for assistance
  • Do not introduce additional Python script dependencies

2) Semantic Understanding and Theme Generation

AI Analysis Task (Use the following fixed Prompt):
Please analyze the following content and extract structured review themes.

【Input Content】
{Extracted Content}

【Output Requirements】
Output in the following format:

Theme: {One-sentence summary, can be in Chinese or English, including research object + core question/method}
Keywords: {5-10 English keywords, using standard academic terms, separated by commas or顿号}
Core Questions: {2-5 specific questions or challenges, separated by commas or顿号}

【Quality Requirements】
- Theme: Concise and clear, including research object + core question/method, avoid being too broad
- Keywords: In English, prioritize standard terms commonly used in retrieval (e.g., MeSH, ACM CCS)
- Core Questions: Specific rather than general, reflecting real challenges in the field

【Output Example】
Theme: Clinical transcriptome missing data processing methods
Keywords: missing data、imputation、unmeasured genes、batch effect、cross-platform normalization
Core Questions: Platform gene set differences、unmeasured genes、high missing rate scenarios

3) Output Formatting

Output according to the user's requested format:
Format 1: Plain Text (Default)
Theme: {Theme Text}
Keywords: {Keyword 1}、{Keyword 2}、...
Core Questions: {Question 1}、{Question 2}、...
Format 2: YAML
yaml
topic: "{Theme Text}"
keywords:
  - "{Keyword 1}"
  - "{Keyword 2}"
core_questions:
  - "{Question 1}"
  - "{Question 2}"
Format 3: JSON
json
{
  "topic": "{Theme Text}",
  "keywords": ["{Keyword 1}", "{Keyword 2}"],
  "core_questions": ["{Question 1}", "{Question 2}"]
}

Output Specifications

Required Fields

  • Theme: One-sentence summary, can be in Chinese or English, including research object + core question/method
  • Keywords: 5-10 English keywords using standard academic terms
  • Core Questions: 2-5 specific questions or challenges

Quality Standards

  • Theme is concise and clear, suitable as a title for literature reviews
  • Keywords use English standard terms, suitable for literature retrieval (e.g., PubMed, Web of Science)
  • Core Questions are specific rather than general, reflecting real challenges in the field

Error Handling

Error ScenarioHandling Method
File does not existPrompt user to provide correct path or paste content
File format not supportedList supported formats and suggest conversion
Content extraction failedFallback solution: Prompt user to manually provide content
Image content cannot be understoodPrompt user to describe image content or provide text version
URL parsing failedPrompt user to copy web content or provide PDF version
Theme generation failedPrompt user to provide more context or simplify input

Integration with Downstream Skills

Integration with systematic-literature-review

The output of this skill can be directly used for the
systematic-literature-review
skill:
User: Analyze this file /path/to/paper.pdf, then do a review with systematic-literature-review

AI Execution Flow:
1. Call get-review-theme to analyze the PDF
2. Obtain structured theme
3. Extract the "Theme" field
4. Pass to systematic-literature-review
5. Execute literature review process

Output Format Compatibility

  • Theme Field: Directly corresponds to the
    {Theme}
    input of
    systematic-literature-review
  • Keywords Field: Can be used to supplement retrieval strategies
  • Core Questions Field: Can be used as a reference for research scope and inclusion/exclusion criteria

Validation Criteria

  • Output contains all three required fields (Theme, Keywords, Core Questions)
  • Theme is concise and clear, suitable as a title for literature reviews
  • Keywords use English standard terms, suitable for literature retrieval
  • Core Questions are specific rather than general, reflecting real challenges in the field
  • Output format meets user requirements (text/yaml/json)

Usage Examples

Example 1: Natural Language Description

User: Help me extract a review theme from this sentence: "I want to understand the application of deep learning in medical imaging, especially cancer diagnosis"

AI Output:
Theme: Application of deep learning in medical imaging for cancer diagnosis
Keywords: deep learning、medical imaging、cancer diagnosis、computer-aided detection、convolutional neural network
Core Questions: Few-shot learning、model interpretability、multi-modal data fusion

Example 2: Text File

User: Extract review theme from this Markdown file: /path/to/notes.md

AI Execution:
1. Use Read tool to read the file
2. Analyze content
3. Output structured theme

Example 3: Image

User: Analyze this image and extract review theme: /path/to/figure.png

AI Execution:
1. Use LLM visual capability to analyze the image
2. Understand content in the image (e.g., research framework diagram, conceptual diagram)
3. Output structured theme

Example 4: Web URL

User: Extract review theme from this webpage: https://example.com/research

AI Execution:
1. Use MCP Web Reader to extract web content
2. Analyze core content
3. Output structured theme

Example 5: PDF File

User: Analyze this paper and extract review theme: /path/to/paper.pdf

AI Execution:
1. Use Read tool to read the PDF
2. Analyze title, abstract, and main text
3. Output structured theme

Example 6: Folder

User: Extract review theme from this folder: /path/to/research-folder

AI Execution:
1. Use Glob to scan .md/.txt/.pdf files in the folder
2. Recursively read and merge content
3. Output comprehensive structured theme

Example 7: Specify YAML Format

User: Extract theme from /path/to/document.pdf and output in YAML format

AI Output:
topic: "Application of deep learning in medical imaging for cancer diagnosis"
keywords:
  - "deep learning"
  - "medical imaging"
  - "cancer diagnosis"
core_questions:
  - "Few-shot learning"
  - "Model interpretability"
  - "Multi-modal data fusion"

Organic Update Principles

When updating this skill, please follow these principles:
  1. Header-Body Consistency: When updating work logic, synchronize updates to the YAML frontmatter
  2. Understand Rather Than Record: Before updating, first understand the intent behind user needs
  3. Niche Positioning: Find the reasonable position of the updated content in the overall document structure
  4. Coordinated Growth: When updating one section, check and synchronize updates to related sections
  5. Maintain Breathing Room: Ensure logical flow between sections, use transitional phrases, and establish connections