tooluniverse-infectious-disease

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Infectious Disease Outbreak Intelligence

传染病暴发智能分析系统

Rapid response system for emerging pathogens using taxonomy analysis, target identification, structure prediction, and computational drug repurposing.
KEY PRINCIPLES:
  1. Speed is critical - Optimize for rapid actionable intelligence
  2. Target essential proteins - Focus on conserved, essential viral/bacterial proteins
  3. Leverage existing drugs - Prioritize FDA-approved compounds for repurposing
  4. Structure-guided - Use NvidiaNIM for rapid structure prediction and docking
  5. Evidence-graded - Grade repurposing candidates by evidence strength
  6. Actionable output - Prioritized drug candidates with rationale
  7. English-first queries - Always use English terms in tool calls (pathogen names, protein names, drug names), even if the user writes in another language. Only try original-language terms as a fallback. Respond in the user's language

针对新发病原体的快速响应系统,采用分类分析、靶点识别、结构预测和计算药物重定位技术。
核心原则
  1. 速度至关重要 - 优化以获取可快速落地的情报
  2. 靶向关键蛋白 - 聚焦保守、必需的病毒/细菌蛋白
  3. 利用现有药物 - 优先考虑FDA批准的可重定位化合物
  4. 结构导向 - 使用NvidiaNIM进行快速结构预测与分子对接
  5. 证据分级 - 根据证据强度对重定位候选药物分级
  6. 可落地输出 - 附带依据的优先推荐药物候选
  7. 优先使用英文查询 - 工具调用中始终使用英文术语(病原体名称、蛋白名称、药物名称),即使用户使用其他语言提问。仅在必要时尝试使用原语言术语作为备选。以用户使用的语言回复

When to Use

适用场景

Apply when user asks:
  • "New pathogen detected - what drugs might work?"
  • "Emerging virus [X] - therapeutic options?"
  • "Drug repurposing candidates for [pathogen]"
  • "What do we know about [novel coronavirus/bacteria]?"
  • "Essential targets in [pathogen] for drug development"
  • "Can we repurpose [drug] against [pathogen]?"

当用户提出以下问题时适用:
  • "检测到新型病原体,哪些药物可能有效?"
  • "新发病毒[X]的治疗方案有哪些?"
  • "[病原体]的药物重定位候选物"
  • "我们对[新型冠状病毒/细菌]了解多少?"
  • "[病原体]中用于药物开发的关键靶点"
  • "我们能否将[药物]重定位用于对抗[病原体]?"

Critical Workflow Requirements

关键工作流要求

1. Report-First Approach (MANDATORY)

1. 报告优先方法(强制要求)

  1. Create the report file FIRST:
    • File name:
      [PATHOGEN]_outbreak_intelligence.md
    • Initialize with section headers
    • Add placeholder:
      [Analyzing...]
  2. Progressively update as you gather data
  3. Output separate files:
    • [PATHOGEN]_drug_candidates.csv
      - Ranked repurposing candidates
    • [PATHOGEN]_target_proteins.csv
      - Druggable targets
  1. 首先创建报告文件
    • 文件名:
      [PATHOGEN]_outbreak_intelligence.md
    • 初始化时添加章节标题
    • 添加占位符:
      [分析中...]
  2. 收集数据时逐步更新
  3. 输出独立文件
    • [PATHOGEN]_drug_candidates.csv
      - 排序后的重定位候选药物
    • [PATHOGEN]_target_proteins.csv
      - 可成药靶点

2. Citation Requirements (MANDATORY)

2. 引用要求(强制要求)

markdown
undefined
markdown
undefined

Target: RNA-dependent RNA polymerase (RdRp)

靶点:RNA依赖的RNA聚合酶(RdRp)

  • UniProt: P0DTD1 (NSP12)
  • Essentiality: Required for replication
  • Conservation: >95% across variants
  • Drug precedent: Remdesivir targets RdRp
Source: UniProt via
UniProt_search
, literature review

---
  • UniProt:P0DTD1(NSP12)
  • 必需性:复制必需
  • 保守性:在各变异株中保守性>95%
  • 药物先例:Remdesivir靶向RdRp
来源:UniProt via
UniProt_search
, 文献综述

---

Phase 0: Tool Verification

阶段0:工具验证

Known Parameter Corrections

已知参数修正

ToolWRONG ParameterCORRECT Parameter
NCBI_Taxonomy_search
name
query
UniProt_search
name
query
ChEMBL_search_targets
target
query
NvidiaNIM_diffdock
protein_file
protein
(content)

工具错误参数正确参数
NCBI_Taxonomy_search
name
query
UniProt_search
name
query
ChEMBL_search_targets
target
query
NvidiaNIM_diffdock
protein_file
protein
(内容)

Workflow Overview

工作流概览

Phase 1: Pathogen Identification
├── Taxonomic classification
├── Closest relatives (for knowledge transfer)
├── Genome/proteome availability
└── OUTPUT: Pathogen profile
Phase 2: Target Identification
├── Essential genes/proteins
├── Conserved across strains
├── Druggability assessment
└── OUTPUT: Prioritized target list
Phase 3: Structure Prediction (NvidiaNIM)
├── AlphaFold2/ESMFold for targets
├── Binding site identification
├── Quality assessment (pLDDT)
└── OUTPUT: Target structures
Phase 4: Drug Repurposing Screen
├── Approved drugs for related pathogens
├── Broad-spectrum antivirals/antibiotics
├── Docking screen (NvidiaNIM_diffdock)
└── OUTPUT: Candidate drugs
Phase 4.5: Pathway Analysis (NEW)
├── KEGG: Pathogen metabolism pathways
├── Essential metabolic targets
├── Host-pathogen interaction pathways
└── OUTPUT: Pathway-based drug targets
Phase 5: Literature Intelligence (ENHANCED)
├── PubMed: Published outbreak reports
├── BioRxiv/MedRxiv: Recent preprints (CRITICAL for outbreaks)
├── ArXiv: Computational/ML preprints
├── OpenAlex: Citation tracking
└── OUTPUT: Evidence synthesis
Phase 6: Report Synthesis
├── Top drug candidates
├── Clinical trial opportunities
├── Recommended immediate actions
└── OUTPUT: Final report

阶段1:病原体识别
├── 分类学分类
├── 亲缘病原体(用于知识迁移)
├── 基因组/蛋白质组可用性
└── 输出:病原体概况
阶段2:靶点识别
├── 必需基因/蛋白
├── 菌株间保守性
├── 成药性评估
└── 输出:优先靶点列表
阶段3:结构预测(NvidiaNIM)
├── 针对靶点使用AlphaFold2/ESMFold
├── 结合位点识别
├── 质量评估(pLDDT)
└── 输出:靶点结构
阶段4:药物重定位筛选
├── 针对亲缘病原体的已批准药物
├── 广谱抗病毒/抗生素
├── 分子对接筛选(NvidiaNIM_diffdock)
└── 输出:候选药物
阶段4.5:通路分析(新增)
├── KEGG:病原体代谢通路
├── 必需代谢靶点
├── 宿主-病原体互作通路
└── 输出:基于通路的药物靶点
阶段5:文献情报(增强版)
├── PubMed:已发表的暴发报告
├── BioRxiv/MedRxiv:最新预印本(暴发期间至关重要)
├── ArXiv:计算/机器学习预印本
├── OpenAlex:引文追踪
└── 输出:证据综合
阶段6:报告合成
├── 顶级药物候选物
├── 临床试验机会
├── 推荐立即行动项
└── 输出:最终报告

Phase 1: Pathogen Identification

阶段1:病原体识别

1.1 Taxonomic Classification

1.1 分类学分类

python
def identify_pathogen(tu, pathogen_query):
    """Classify pathogen taxonomically."""
    
    # NCBI Taxonomy search
    taxonomy = tu.tools.NCBI_Taxonomy_search(query=pathogen_query)
    
    return {
        'taxid': taxonomy.get('taxid'),
        'scientific_name': taxonomy.get('scientific_name'),
        'rank': taxonomy.get('rank'),
        'lineage': taxonomy.get('lineage'),
        'type': classify_type(taxonomy)  # virus, bacteria, fungus, parasite
    }
python
def identify_pathogen(tu, pathogen_query):
    """对病原体进行分类学分类。"""
    
    # NCBI分类搜索
    taxonomy = tu.tools.NCBI_Taxonomy_search(query=pathogen_query)
    
    return {
        'taxid': taxonomy.get('taxid'),
        'scientific_name': taxonomy.get('scientific_name'),
        'rank': taxonomy.get('rank'),
        'lineage': taxonomy.get('lineage'),
        'type': classify_type(taxonomy)  # virus, bacteria, fungus, parasite
    }

1.2 Related Pathogens (Knowledge Transfer)

1.2 亲缘病原体(知识迁移)

python
def find_related_pathogens(tu, taxid):
    """Find related pathogens for drug knowledge transfer."""
    
    # Get family/genus level relatives
    relatives = tu.tools.NCBI_Taxonomy_get_children(
        taxid=taxid,
        rank="genus"
    )
    
    # Find relatives with approved drugs
    related_with_drugs = []
    for rel in relatives:
        drugs = tu.tools.ChEMBL_search_targets(
            query=rel['scientific_name'],
            organism_contains=True
        )
        if drugs:
            related_with_drugs.append({
                'pathogen': rel,
                'drugs': drugs
            })
    
    return related_with_drugs
python
def find_related_pathogens(tu, taxid):
    """寻找亲缘病原体以进行药物知识迁移。"""
    
    # 获取科/属级别的亲缘病原体
    relatives = tu.tools.NCBI_Taxonomy_get_children(
        taxid=taxid,
        rank="genus"
    )
    
    # 寻找有已批准药物的亲缘病原体
    related_with_drugs = []
    for rel in relatives:
        drugs = tu.tools.ChEMBL_search_targets(
            query=rel['scientific_name'],
            organism_contains=True
        )
        if drugs:
            related_with_drugs.append({
                'pathogen': rel,
                'drugs': drugs
            })
    
    return related_with_drugs

1.3 Output for Report

1.3 报告输出示例

markdown
undefined
markdown
undefined

1. Pathogen Profile

1. 病原体概况

1.1 Taxonomic Classification

1.1 分类学分类

PropertyValue
OrganismSARS-CoV-2
Taxonomy ID2697049
TypeRNA virus (positive-sense, single-stranded)
FamilyCoronaviridae
GenusBetacoronavirus
LineageRiboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales
属性
生物SARS-CoV-2
分类ID2697049
类型RNA病毒(正链单链)
冠状病毒科
β冠状病毒属
谱系Riboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales

1.2 Related Pathogens with Drug Precedent

1.2 有药物先例的亲缘病原体

RelativeSimilarityApproved DrugsRelevance
SARS-CoV79% genomeRemdesivir (EUA)High
MERS-CoV50% genomeNone approvedMedium
HCoV-229E45% genomeNone specificLow
Knowledge Transfer Opportunity: SARS-CoV drug development data highly relevant.
Source: NCBI Taxonomy, ChEMBL

---
亲缘病原体相似度已批准药物相关性
SARS-CoV基因组相似度79%Remdesivir(紧急使用授权)
MERS-CoV基因组相似度50%无已批准药物
HCoV-229E基因组相似度45%无特异性药物
知识迁移机会:SARS-CoV的药物开发数据高度相关。
来源:NCBI分类学, ChEMBL

---

Phase 2: Target Identification

阶段2:靶点识别

2.1 Essential Protein Identification

2.1 关键蛋白识别

python
def identify_targets(tu, pathogen_name):
    """Identify essential druggable targets."""
    
    # Search UniProt for pathogen proteins
    proteins = tu.tools.UniProt_search(
        query=f"organism:{pathogen_name}",
        reviewed=True
    )
    
    # Prioritize by essentiality and druggability
    targets = []
    for protein in proteins:
        # Check for known drug interactions
        chembl_target = tu.tools.ChEMBL_search_targets(
            query=protein['gene_name']
        )
        
        targets.append({
            'uniprot': protein['accession'],
            'name': protein['protein_name'],
            'function': protein['function'],
            'has_drug_precedent': len(chembl_target) > 0,
            'druggability': assess_druggability(protein)
        })
    
    return rank_targets(targets)
python
def identify_targets(tu, pathogen_name):
    """识别必需的可成药靶点。"""
    
    # 搜索UniProt获取病原体蛋白
    proteins = tu.tools.UniProt_search(
        query=f"organism:{pathogen_name}",
        reviewed=True
    )
    
    # 根据必需性和成药性优先排序
    targets = []
    for protein in proteins:
        # 检查已知药物相互作用
        chembl_target = tu.tools.ChEMBL_search_targets(
            query=protein['gene_name']
        )
        
        targets.append({
            'uniprot': protein['accession'],
            'name': protein['protein_name'],
            'function': protein['function'],
            'has_drug_precedent': len(chembl_target) > 0,
            'druggability': assess_druggability(protein)
        })
    
    return rank_targets(targets)

2.2 Target Prioritization Criteria

2.2 靶点优先排序标准

CriterionWeightDescription
Essentiality30%Required for replication/survival
Conservation25%Conserved across strains/variants
Druggability25%Structural features amenable to binding
Drug precedent20%Existing drugs for homologous targets
标准权重描述
必需性30%复制/存活必需
保守性25%在菌株/变异株间保守
成药性25%具有适合结合的结构特征
药物先例20%针对同源靶点的现有药物

2.3 Output for Report

2.3 报告输出示例

markdown
undefined
markdown
undefined

2. Druggable Targets

2. 可成药靶点

2.1 Prioritized Target List

2.1 优先靶点列表

RankTargetUniProtFunctionScoreDrug Precedent
1RdRp (NSP12)P0DTD1RNA replication92Remdesivir
2Main protease (Mpro)P0DTD1Polyprotein cleavage88Nirmatrelvir
3Papain-like proteaseP0DTD1Polyprotein cleavage75GRL0617 (preclinical)
4Spike proteinP0DTC2Host cell entry70Antibodies
5Helicase (NSP13)P0DTD1RNA unwinding65None approved
排名靶点UniProt功能得分药物先例
1RdRp(NSP12)P0DTD1RNA复制92Remdesivir
2主蛋白酶(Mpro)P0DTD1多蛋白切割88Nirmatrelvir
3木瓜样蛋白酶P0DTD1多蛋白切割75GRL0617(临床前)
4刺突蛋白P0DTC2宿主细胞进入70抗体
5解旋酶(NSP13)P0DTD1RNA解旋65无已批准药物

2.2 Target Details

2.2 靶点详情

Target 1: RNA-dependent RNA polymerase (RdRp/NSP12)

靶点1:RNA依赖的RNA聚合酶(RdRp/NSP12)

PropertyValue
UniProtP0DTD1 (polyprotein position 4393-5324)
Length932 amino acids
FunctionCatalyzes RNA synthesis from RNA template
EssentialityAbsolute (no replication without RdRp)
Conservation>99% across all SARS-CoV-2 variants
Binding siteNucleotide binding pocket
Drug precedentRemdesivir (FDA approved), Favipiravir
Source: UniProt, ChEMBL

---
属性
UniProtP0DTD1(多蛋白位置4393-5324)
长度932个氨基酸
功能催化以RNA为模板的RNA合成
必需性绝对必需(无RdRp则无法复制)
保守性在所有SARS-CoV-2变异株中保守性>99%
结合位点核苷酸结合口袋
药物先例Remdesivir(FDA批准)、Favipiravir
来源:UniProt, ChEMBL

---

Phase 3: Structure Prediction

阶段3:结构预测

3.1 AlphaFold2 Structure Prediction (NVIDIA NIM)

3.1 AlphaFold2结构预测(NVIDIA NIM)

python
def predict_target_structure(tu, sequence, target_name):
    """Predict structure for target protein."""
    
    # Use AlphaFold2 for high accuracy
    structure = tu.tools.NvidiaNIM_alphafold2(
        sequence=sequence,
        algorithm="mmseqs2",
        relax_prediction=False
    )
    
    # Parse pLDDT confidence
    plddt_scores = parse_plddt(structure)
    
    return {
        'structure': structure['structure'],
        'mean_plddt': np.mean(plddt_scores),
        'high_confidence_regions': get_high_confidence(plddt_scores),
        'predicted_binding_site': identify_binding_site(structure)
    }
python
def predict_target_structure(tu, sequence, target_name):
    """预测靶点蛋白的结构。"""
    
    # 使用AlphaFold2以获取高精度结果
    structure = tu.tools.NvidiaNIM_alphafold2(
        sequence=sequence,
        algorithm="mmseqs2",
        relax_prediction=False
    )
    
    # 解析pLDDT置信度
    plddt_scores = parse_plddt(structure)
    
    return {
        'structure': structure['structure'],
        'mean_plddt': np.mean(plddt_scores),
        'high_confidence_regions': get_high_confidence(plddt_scores),
        'predicted_binding_site': identify_binding_site(structure)
    }

3.2 Structure Quality Assessment

3.2 结构质量评估

pLDDT RangeConfidenceUse for Docking
>90Very HighExcellent
70-90HighGood
50-70MediumUse caution
<50LowNot recommended
pLDDT范围置信度对接适用性
>90极高极佳
70-90良好
50-70谨慎使用
<50不推荐

3.3 Output for Report

3.3 报告输出示例

markdown
undefined
markdown
undefined

3. Target Structures

3. 靶点结构

3.1 Structure Prediction Results

3.1 结构预测结果

TargetMethodLengthMean pLDDTDocking Ready
RdRp (NSP12)AlphaFold2932 aa91.2✓ Yes
MproAlphaFold2306 aa93.5✓ Yes
PLproAlphaFold2315 aa88.7✓ Yes
靶点方法长度平均pLDDT可用于对接
RdRp(NSP12)AlphaFold2932 aa91.2✓ 是
MproAlphaFold2306 aa93.5✓ 是
PLproAlphaFold2315 aa88.7✓ 是

3.2 RdRp Structure Quality

3.2 RdRp结构质量

RegionResiduespLDDTFunctional Role
Palm domain582-62094.2Catalytic site
Fingers domain397-58191.8NTP entry
Thumb domain621-81589.4RNA binding
Active siteD760, D76196.1Catalysis
Docking Recommendation: Structure suitable for docking; active site highly confident.
Source: NVIDIA NIM via
NvidiaNIM_alphafold2

---
区域残基pLDDT功能角色
手掌结构域582-62094.2催化位点
手指结构域397-58191.8NTP进入
拇指结构域621-81589.4RNA结合
活性位点D760, D76196.1催化
对接建议:结构适用于对接;活性位点置信度极高。
来源:NVIDIA NIM via
NvidiaNIM_alphafold2

---

Phase 4: Drug Repurposing Screen

阶段4:药物重定位筛选

4.1 Identify Repurposing Candidates

4.1 识别重定位候选药物

python
def get_repurposing_candidates(tu, target_name, pathogen_family):
    """Find approved drugs to repurpose."""
    
    candidates = []
    
    # 1. Drugs approved for related pathogens
    related_drugs = tu.tools.ChEMBL_search_drugs(
        query=pathogen_family,
        max_phase=4
    )
    candidates.extend(related_drugs)
    
    # 2. Broad-spectrum antivirals
    antivirals = tu.tools.ChEMBL_search_drugs(
        query="broad spectrum antiviral",
        max_phase=4
    )
    candidates.extend(antivirals)
    
    # 3. Drugs with known activity against target class
    target_class_drugs = tu.tools.DGIdb_get_drug_gene_interactions(
        genes=[target_name]
    )
    candidates.extend(target_class_drugs)
    
    return deduplicate(candidates)
python
def get_repurposing_candidates(tu, target_name, pathogen_family):
    """寻找可重定位的已批准药物。"""
    
    candidates = []
    
    # 1. 针对亲缘病原体的已批准药物
    related_drugs = tu.tools.ChEMBL_search_drugs(
        query=pathogen_family,
        max_phase=4
    )
    candidates.extend(related_drugs)
    
    # 2. 广谱抗病毒药物
    antivirals = tu.tools.ChEMBL_search_drugs(
        query="broad spectrum antiviral",
        max_phase=4
    )
    candidates.extend(antivirals)
    
    # 3. 针对靶点类别的药物
    target_class_drugs = tu.tools.DGIdb_get_drug_gene_interactions(
        genes=[target_name]
    )
    candidates.extend(target_class_drugs)
    
    return deduplicate(candidates)

4.2 Docking Screen (NVIDIA NIM)

4.2 分子对接筛选(NVIDIA NIM)

python
def dock_candidates(tu, target_structure, candidate_smiles_list):
    """Dock candidate drugs against target."""
    
    results = []
    for smiles in candidate_smiles_list:
        docking = tu.tools.NvidiaNIM_diffdock(
            protein=target_structure,
            ligand=smiles,
            num_poses=5
        )
        
        results.append({
            'smiles': smiles,
            'top_score': docking['poses'][0]['confidence'],
            'poses': docking['poses']
        })
    
    return sorted(results, key=lambda x: x['top_score'], reverse=True)
python
def dock_candidates(tu, target_structure, candidate_smiles_list):
    """将候选药物与靶点进行分子对接。"""
    
    results = []
    for smiles in candidate_smiles_list:
        docking = tu.tools.NvidiaNIM_diffdock(
            protein=target_structure,
            ligand=smiles,
            num_poses=5
        )
        
        results.append({
            'smiles': smiles,
            'top_score': docking['poses'][0]['confidence'],
            'poses': docking['poses']
        })
    
    return sorted(results, key=lambda x: x['top_score'], reverse=True)

4.3 Output for Report

4.3 报告输出示例

markdown
undefined
markdown
undefined

4. Drug Repurposing Screen

4. 药物重定位筛选

4.1 Candidate Identification

4.1 候选药物识别

SourceCandidatesFDA Approved
Related pathogen drugs128
Broad-spectrum antivirals1511
Target class drugs85
Total unique2819
来源候选药物数量FDA批准
亲缘病原体药物128
广谱抗病毒药物1511
靶点类别药物85
总计去重后2819

4.2 Docking Results (RdRp Target)

4.2 对接结果(RdRp靶点)

RankDrugIndicationDocking ScoreEvidence
1RemdesivirCOVID-190.92★★★ FDA approved
2FavipiravirInfluenza0.87★★☆ Phase 3 COVID
3SofosbuvirHCV0.84★★☆ In vitro active
4RibavirinRSV, HCV0.78★☆☆ Mixed results
5MolnupiravirCOVID-190.76★★★ FDA approved
排名药物适应症对接得分证据
1RemdesivirCOVID-190.92★★★ FDA批准
2Favipiravir流感0.87★★☆ COVID-19 3期临床
3SofosbuvirHCV0.84★★☆ 体外活性
4RibavirinRSV, HCV0.78★☆☆ 结果不一
5MolnupiravirCOVID-190.76★★★ FDA批准

4.3 Top Candidate: Remdesivir

4.3 顶级候选药物:Remdesivir

PropertyValue
Docking score0.92 (excellent)
MechanismRdRp inhibitor (nucleotide analog)
FDA statusApproved for COVID-19
Clinical evidenceACTT-1: Reduced recovery time
Binding modeActive site, chain termination
Source: NVIDIA NIM via
NvidiaNIM_diffdock
, ChEMBL

---
属性
对接得分0.92(极佳)
作用机制RdRp抑制剂(核苷酸类似物)
FDA状态批准用于COVID-19
临床证据ACTT-1:缩短恢复时间
结合模式结合活性位点,终止链合成
来源:NVIDIA NIM via
NvidiaNIM_diffdock
, ChEMBL

---

Phase 4.5: Pathway Analysis (NEW)

阶段4.5:通路分析(新增)

4.5.1 Pathogen Metabolism Pathways

4.5.1 病原体代谢通路

python
def analyze_pathogen_pathways(tu, pathogen_name, pathogen_type):
    """Identify druggable metabolic pathways in pathogen."""
    
    # KEGG pathogen pathways
    pathways = tu.tools.kegg_search_pathway(
        query=f"{pathogen_name} metabolism"
    )
    
    # Essential metabolic genes
    essential_genes = tu.tools.kegg_get_pathway_genes(
        pathway_id=pathways[0]['pathway_id']
    )
    
    # Host-pathogen interaction pathways
    host_pathogen = tu.tools.kegg_search_pathway(
        query=f"{pathogen_name} host interaction"
    )
    
    return {
        'metabolic_pathways': pathways,
        'essential_genes': essential_genes,
        'host_interaction': host_pathogen
    }
python
def analyze_pathogen_pathways(tu, pathogen_name, pathogen_type):
    """识别病原体中可成药的代谢通路。"""
    
    # KEGG病原体通路
    pathways = tu.tools.kegg_search_pathway(
        query=f"{pathogen_name} metabolism"
    )
    
    # 必需代谢基因
    essential_genes = tu.tools.kegg_get_pathway_genes(
        pathway_id=pathways[0]['pathway_id']
    )
    
    # 宿主-病原体互作通路
    host_pathogen = tu.tools.kegg_search_pathway(
        query=f"{pathogen_name} host interaction"
    )
    
    return {
        'metabolic_pathways': pathways,
        'essential_genes': essential_genes,
        'host_interaction': host_pathogen
    }

4.5.2 Output for Report

4.5.2 报告输出示例

markdown
undefined
markdown
undefined

4.5 Pathway Analysis

4.5 通路分析

Pathogen Metabolic Pathways (KEGG)

病原体代谢通路(KEGG)

PathwayEssentialityDrug Targets
Viral replication (ko03030)EssentialRdRp, Helicase
Viral protein processingEssentialMpro, PLpro
Host membrane interactionEssentialSpike, ACE2
通路必需性药物靶点
病毒复制(ko03030)必需RdRp, 解旋酶
病毒蛋白加工必需Mpro, PLpro
宿主膜互作必需刺突蛋白, ACE2

Druggable Pathway Targets

可成药通路靶点

TargetPathwayKnown DrugsEvidence
RdRpViral replicationRemdesivir★★★
3CLproProtein processingNirmatrelvir★★★
PLproProtein processingGRL-0617★★☆
靶点通路已知药物证据
RdRp病毒复制Remdesivir★★★
3CLpro蛋白加工Nirmatrelvir★★★
PLpro蛋白加工GRL-0617★★☆

Host-Pathogen Interaction Points

宿主-病原体互作位点

InteractionHost ProteinPathwayDruggability
EntryACE2Cell surface★★☆
FusionTMPRSS2Protease★★★
ReplicationHost ribosomesTranslation★☆☆
Source: KEGG, Reactome

---
互作宿主蛋白通路成药性
进入ACE2细胞表面★★☆
融合TMPRSS2蛋白酶★★★
复制宿主核糖体翻译★☆☆
来源:KEGG, Reactome

---

Phase 5: Literature Intelligence (ENHANCED)

阶段5:文献情报(增强版)

5.1 Comprehensive Literature Search

5.1 综合文献搜索

python
def comprehensive_outbreak_literature(tu, pathogen_name):
    """Search all literature sources for outbreak intelligence."""
    
    # PubMed: Peer-reviewed
    pubmed = tu.tools.PubMed_search_articles(
        query=f"{pathogen_name} AND (outbreak OR treatment OR drug)",
        limit=50,
        sort="date"
    )
    
    # BioRxiv: CRITICAL for outbreaks - newest findings
    biorxiv = tu.tools.BioRxiv_search_preprints(
        query=f"{pathogen_name} treatment mechanism",
        limit=20
    )
    
    # MedRxiv: Clinical preprints
    medrxiv = tu.tools.MedRxiv_search_preprints(
        query=f"{pathogen_name} clinical trial",
        limit=20
    )
    
    # ArXiv: Computational/ML papers
    arxiv = tu.tools.ArXiv_search_papers(
        query=f"{pathogen_name} drug discovery",
        category="q-bio",
        limit=10
    )
    
    # Clinical trials
    trials = tu.tools.search_clinical_trials(
        condition=pathogen_name,
        status="Recruiting"
    )
    
    # Citation analysis
    key_papers = pubmed[:10]
    for paper in key_papers:
        citation = tu.tools.openalex_search_works(
            query=paper['title'],
            limit=1
        )
        paper['citations'] = citation[0].get('cited_by_count', 0) if citation else 0
    
    return {
        'pubmed': pubmed,
        'biorxiv': biorxiv,
        'medrxiv': medrxiv,
        'arxiv': arxiv,
        'trials': trials,
        'key_papers': key_papers
    }
python
def comprehensive_outbreak_literature(tu, pathogen_name):
    """搜索所有文献源以获取暴发情报。"""
    
    # PubMed:同行评审文献
    pubmed = tu.tools.PubMed_search_articles(
        query=f"{pathogen_name} AND (outbreak OR treatment OR drug)",
        limit=50,
        sort="date"
    )
    
    # BioRxiv:暴发期间至关重要 - 最新发现
    biorxiv = tu.tools.BioRxiv_search_preprints(
        query=f"{pathogen_name} treatment mechanism",
        limit=20
    )
    
    # MedRxiv:临床预印本
    medrxiv = tu.tools.MedRxiv_search_preprints(
        query=f"{pathogen_name} clinical trial",
        limit=20
    )
    
    # ArXiv:计算/机器学习论文
    arxiv = tu.tools.ArXiv_search_papers(
        query=f"{pathogen_name} drug discovery",
        category="q-bio",
        limit=10
    )
    
    # 临床试验
    trials = tu.tools.search_clinical_trials(
        condition=pathogen_name,
        status="Recruiting"
    )
    
    # 引文分析
    key_papers = pubmed[:10]
    for paper in key_papers:
        citation = tu.tools.openalex_search_works(
            query=paper['title'],
            limit=1
        )
        paper['citations'] = citation[0].get('cited_by_count', 0) if citation else 0
    
    return {
        'pubmed': pubmed,
        'biorxiv': biorxiv,
        'medrxiv': medrxiv,
        'arxiv': arxiv,
        'trials': trials,
        'key_papers': key_papers
    }

5.2 Output for Report

5.2 报告输出示例

markdown
undefined
markdown
undefined

5. Literature Intelligence

5. 文献情报

5.1 Published Literature (Peer-Reviewed)

5.1 已发表文献(同行评审)

TopicPapersKey Finding
Treatment234Paxlovid remains effective
Resistance45Nirmatrelvir resistance mutations identified
Variants189XBB variants maintain drug sensitivity
Vaccines312Updated boosters protective
主题论文数量关键发现
治疗234Paxlovid仍保持有效性
耐药性45已识别Nirmatrelvir耐药突变
变异株189XBB变异株对药物保持敏感性
疫苗312更新后的加强针具有保护性

5.2 Preprints (CRITICAL for Emerging Outbreaks)

5.2 预印本(暴发期间至关重要)

⚠️ Note: Preprints are NOT peer-reviewed. Critical for rapid intelligence but use with caution.
SourceTitlePostedKey Finding
BioRxivNovel RdRp inhibitor shows activity...2024-02-01New candidate
MedRxivReal-world effectiveness of...2024-01-28Paxlovid 85% effective
BioRxivResistance mutations in...2024-01-25Monitor L50F mutation
⚠️ 注意:预印本未经过同行评审。对快速获取情报至关重要,但需谨慎使用。
来源标题发布日期关键发现
BioRxiv新型RdRp抑制剂显示出活性...2024-02-01新候选药物
MedRxiv...的真实世界有效性2024-01-28Paxlovid有效性达85%
BioRxiv...中的耐药突变2024-01-25需监测L50F突变

5.3 Computational/ML Preprints (ArXiv)

5.3 计算/机器学习预印本(ArXiv)

TitleCategoryRelevance
Deep learning for antiviral discoveryq-bio.BMDrug design
Structure prediction for novel...q-bio.BMTarget modeling
标题分类相关性
深度学习用于抗病毒药物发现q-bio.BM药物设计
新型...的结构预测q-bio.BM靶点建模

5.4 Active Clinical Trials

5.4 活跃临床试验

NCT IDPhaseDrugStatus
NCT050123453EnsitrelvirRecruiting
NCT050234562VV116Recruiting
NCT050345672S-217622Active
NCT编号阶段药物状态
NCT050123453Ensitrelvir招募中
NCT050234562VV116招募中
NCT050345672S-217622进行中

5.5 Citation Analysis (High-Impact Papers)

5.5 引文分析(高影响力论文)

PMIDTitleCitationsYear
33123456Remdesivir for COVID-195,2342020
34234567Paxlovid Phase 3 results2,8762022
Source: PubMed, BioRxiv, MedRxiv, ArXiv, OpenAlex, ClinicalTrials.gov

---
PMID标题引用量年份
33123456Remdesivir用于COVID-195,2342020
34234567Paxlovid 3期结果2,8762022
来源:PubMed, BioRxiv, MedRxiv, ArXiv, OpenAlex, ClinicalTrials.gov

---

Report Template

报告模板

markdown
undefined
markdown
undefined

Outbreak Intelligence Report: [PATHOGEN]

暴发情报报告:[PATHOGEN]

Generated: [Date] | Query: [Original query] | Status: In Progress

生成时间:[日期] | 查询:[原始查询] | 状态:分析中

Executive Summary

执行摘要

[Analyzing...]

[分析中...]

1. Pathogen Profile

1. 病原体概况

1.1 Classification

1.1 分类

[Analyzing...]
[分析中...]

1.2 Related Pathogens

1.2 亲缘病原体

[Analyzing...]

[分析中...]

2. Druggable Targets

2. 可成药靶点

2.1 Prioritized Targets

2.1 优先靶点

[Analyzing...]
[分析中...]

2.2 Target Details

2.2 靶点详情

[Analyzing...]

[分析中...]

3. Target Structures

3. 靶点结构

3.1 Prediction Results

3.1 预测结果

[Analyzing...]
[分析中...]

3.2 Binding Sites

3.2 结合位点

[Analyzing...]

[分析中...]

4. Drug Repurposing Screen

4. 药物重定位筛选

4.1 Candidate Drugs

4.1 候选药物

[Analyzing...]
[分析中...]

4.2 Docking Results

4.2 对接结果

[Analyzing...]
[分析中...]

4.3 Top Candidates

4.3 顶级候选药物

[Analyzing...]

[分析中...]

5. Literature Intelligence

5. 文献情报

5.1 Recent Findings

5.1 最新发现

[Analyzing...]
[分析中...]

5.2 Clinical Trials

5.2 临床试验

[Analyzing...]

[分析中...]

6. Recommendations

6. 建议

6.1 Immediate Actions

6.1 立即行动项

[Analyzing...]
[分析中...]

6.2 Clinical Trial Opportunities

6.2 临床试验机会

[Analyzing...]
[分析中...]

6.3 Research Priorities

6.3 研究优先级

[Analyzing...]

[分析中...]

7. Data Gaps & Limitations

7. 数据缺口与局限性

[Analyzing...]

[分析中...]

8. Data Sources

8. 数据来源

[Will be populated...]

---
[将填充...]

---

Evidence Grading

证据分级

TierSymbolCriteriaExample
T1★★★FDA approved for this pathogenRemdesivir for COVID
T2★★☆Clinical trial evidence OR approved for related pathogenFavipiravir
T3★☆☆In vitro activity OR strong docking + mechanismSofosbuvir
T4☆☆☆Computational prediction onlyNovel docking hits

层级符号标准示例
T1★★★FDA批准用于该病原体Remdesivir用于COVID-19
T2★★☆临床试验证据 或 批准用于亲缘病原体Favipiravir
T3★☆☆体外活性 或 高对接得分+作用机制Sofosbuvir
T4☆☆☆仅计算预测新型对接命中物

Completeness Checklist

完整性检查清单

Phase 1: Pathogen ID

阶段1:病原体识别

  • Taxonomic classification complete
  • Related pathogens identified
  • Genome/proteome availability noted
  • 分类学分类完成
  • 亲缘病原体已识别
  • 基因组/蛋白质组可用性已记录

Phase 2: Targets

阶段2:靶点

  • ≥5 targets identified
  • Essentiality documented
  • Conservation assessed
  • Drug precedent checked
  • 识别≥5个靶点
  • 必需性已记录
  • 保守性已评估
  • 药物先例已检查

Phase 3: Structures

阶段3:结构

  • Structures predicted for top 3 targets
  • pLDDT confidence reported
  • Binding sites identified
  • 已为前3个靶点预测结构
  • 已报告pLDDT置信度
  • 已识别结合位点

Phase 4: Drug Screen

阶段4:药物筛选

  • ≥20 candidates screened
  • FDA-approved drugs prioritized
  • Docking scores reported
  • Top 5 candidates detailed
  • 已筛选≥20个候选药物
  • 已优先考虑FDA批准药物
  • 已报告对接得分
  • 已详细说明前5个候选药物

Phase 5: Literature

阶段5:文献

  • Recent papers summarized
  • Active trials listed
  • Resistance data noted
  • 已总结最新论文
  • 已列出活跃试验
  • 已记录耐药性数据

Phase 6: Recommendations

阶段6:建议

  • ≥3 immediate actions
  • Clinical trial opportunities
  • Research priorities

  • ≥3个立即行动项
  • 临床试验机会
  • 研究优先级

Fallback Chains

备选工具链

Primary ToolFallback 1Fallback 2
NvidiaNIM_alphafold2
alphafold_get_prediction
NvidiaNIM_esmfold
NvidiaNIM_diffdock
NvidiaNIM_boltz2
Manual docking
NCBI_Taxonomy_search
UniProt_taxonomy
Manual classification
ChEMBL_search_drugs
DrugBank_search
PubChem bioassays

主工具备选工具1备选工具2
NvidiaNIM_alphafold2
alphafold_get_prediction
NvidiaNIM_esmfold
NvidiaNIM_diffdock
NvidiaNIM_boltz2
手动对接
NCBI_Taxonomy_search
UniProt_taxonomy
手动分类
ChEMBL_search_drugs
DrugBank_search
PubChem生物测定

Tool Reference

工具参考

See TOOLS_REFERENCE.md for complete tool documentation.
完整工具文档请参见TOOLS_REFERENCE.md