literature-search

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Literature Search

学术文献搜索

Search multiple academic databases to find relevant papers.
搜索多个学术数据库以找到相关论文。

Input

输入

  • $ARGUMENTS
    — The search query (natural language)
  • $ARGUMENTS
    — 搜索查询(自然语言)

Scripts

脚本

Semantic Scholar (primary — best for ML/AI, has BibTeX)

Semantic Scholar(首选——最适合机器学习/人工智能领域,支持BibTeX)

bash
python ~/.claude/skills/deep-research/scripts/search_semantic_scholar.py \
  --query "QUERY" --max-results 20 --year-range 2022-2026 \
  --api-key "$(grep S2_API_Key /Users/lingzhi/Code/keys.md 2>/dev/null | cut -d: -f2 | tr -d ' ')" \
  -o results_s2.jsonl
Key flags:
--peer-reviewed-only
,
--top-conferences
,
--min-citations N
,
--venue NeurIPS ICML
bash
python ~/.claude/skills/deep-research/scripts/search_semantic_scholar.py \
  --query "QUERY" --max-results 20 --year-range 2022-2026 \
  --api-key "$(grep S2_API_Key /Users/lingzhi/Code/keys.md 2>/dev/null | cut -d: -f2 | tr -d ' ')" \
  -o results_s2.jsonl
关键参数:
--peer-reviewed-only
(仅同行评审论文)、
--top-conferences
(仅顶会论文)、
--min-citations N
(最低引用量N)、
--venue NeurIPS ICML
(指定发表场所)

arXiv (latest preprints)

arXiv(最新预印本)

bash
python ~/.claude/skills/deep-research/scripts/search_arxiv.py \
  --query "QUERY" --max-results 10 -o results_arxiv.jsonl
bash
python ~/.claude/skills/deep-research/scripts/search_arxiv.py \
  --query "QUERY" --max-results 10 -o results_arxiv.jsonl

OpenAlex (broadest coverage, free, no API key)

OpenAlex(覆盖范围最广,免费,无需API密钥)

bash
python ~/.claude/skills/literature-search/scripts/search_openalex.py \
  --query "QUERY" --max-results 20 --year-range 2022-2026 \
  --min-citations 5 -o results_openalex.jsonl
bash
python ~/.claude/skills/literature-search/scripts/search_openalex.py \
  --query "QUERY" --max-results 20 --year-range 2022-2026 \
  --min-citations 5 -o results_openalex.jsonl

Merge & Deduplicate

合并与去重

bash
python ~/.claude/skills/deep-research/scripts/paper_db.py merge \
  --inputs results_s2.jsonl results_arxiv.jsonl results_openalex.jsonl \
  --output merged.jsonl
bash
python ~/.claude/skills/deep-research/scripts/paper_db.py merge \
  --inputs results_s2.jsonl results_arxiv.jsonl results_openalex.jsonl \
  --output merged.jsonl

CrossRef (DOI-based lookup, broadest type coverage)

CrossRef(基于DOI的检索,覆盖类型最广)

bash
python ~/.claude/skills/literature-search/scripts/search_crossref.py \
  --query "QUERY" --rows 10 --output results_crossref.jsonl
Key flags:
--bibtex
(output .bib format),
--rows N
bash
python ~/.claude/skills/literature-search/scripts/search_crossref.py \
  --query "QUERY" --rows 10 --output results_crossref.jsonl
关键参数:
--bibtex
(输出.bib格式)、
--rows N
(返回结果数量N)

Download arXiv Source (get .tex files)

下载arXiv源文件(获取.tex文件)

bash
python ~/.claude/skills/literature-search/scripts/download_arxiv_source.py \
  --title "Paper Title" --output-dir arxiv_papers/
Key flags:
--arxiv-id 1706.03762
,
--metadata
,
--max-results N
bash
python ~/.claude/skills/literature-search/scripts/download_arxiv_source.py \
  --title "Paper Title" --output-dir arxiv_papers/
关键参数:
--arxiv-id 1706.03762
(指定arXiv编号)、
--metadata
(仅获取元数据)、
--max-results N
(最大结果数量)

Generate BibTeX from results

从结果生成BibTeX

bash
python ~/.claude/skills/deep-research/scripts/bibtex_manager.py \
  --jsonl merged.jsonl --output references.bib
bash
python ~/.claude/skills/deep-research/scripts/bibtex_manager.py \
  --jsonl merged.jsonl --output references.bib

Workflow

工作流程

  1. Expand the user's query into 2-4 complementary search queries
  2. Run Semantic Scholar search (primary) with expanded queries
  3. Run arXiv for very recent preprints (< 3 months)
  4. Optionally run OpenAlex for broader coverage
  5. Merge and deduplicate results
  6. Rank by: citations (0.3) + recency (0.3) + venue quality (0.2) + relevance (0.2)
  7. Present structured results table
  1. 将用户的查询扩展为2-4个互补的搜索查询
  2. 使用扩展后的查询运行Semantic Scholar检索(首选)
  3. 运行arXiv检索获取最新预印本(<3个月)
  4. 可选运行OpenAlex以获得更广泛的覆盖
  5. 合并并去重结果
  6. 按以下权重排序:引用量(0.3)+ 时效性(0.3)+ 发表场所质量(0.2)+ 相关性(0.2)
  7. 以结构化结果表格形式呈现

Venue Quality Tiers

发表场所质量等级

Tier 1: NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL, CVPR, ICCV, ECCV, KDD, AAAI, IJCAI, SIGIR, WWW Tier 2: AISTATS, UAI, COLT, COLING, EACL, WACV, JMLR, TACL Tier 3: Workshops, arXiv preprints — mark with
(preprint)
一级会议/期刊: NeurIPS、ICML、ICLR、ACL、EMNLP、NAACL、CVPR、ICCV、ECCV、KDD、AAAI、IJCAI、SIGIR、WWW 二级会议/期刊: AISTATS、UAI、COLT、COLING、EACL、WACV、JMLR、TACL 三级: 研讨会、arXiv预印本 —— 标记为
(preprint)

Output Format

输出格式

Present results as a table + detailed entries with BibTeX keys. Always note preprint status.
以表格形式呈现结果,并附带包含BibTeX键的详细条目。需始终标注预印本状态。

Related Skills

相关技能

  • Downstream: citation-management, literature-review, related-work-writing
  • See also: deep-research, novelty-assessment
  • 下游技能:文献管理文献综述相关工作撰写
  • 另见:深度研究创新性评估