tooluniverse-infectious-disease
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseInfectious Disease Outbreak Intelligence
传染病暴发智能分析系统
Rapid response system for emerging pathogens using taxonomy analysis, target identification, structure prediction, and computational drug repurposing.
KEY PRINCIPLES:
- Speed is critical - Optimize for rapid actionable intelligence
- Target essential proteins - Focus on conserved, essential viral/bacterial proteins
- Leverage existing drugs - Prioritize FDA-approved compounds for repurposing
- Structure-guided - Use NvidiaNIM for rapid structure prediction and docking
- Evidence-graded - Grade repurposing candidates by evidence strength
- Actionable output - Prioritized drug candidates with rationale
- English-first queries - Always use English terms in tool calls (pathogen names, protein names, drug names), even if the user writes in another language. Only try original-language terms as a fallback. Respond in the user's language
针对新发病原体的快速响应系统,采用分类分析、靶点识别、结构预测和计算药物重定位技术。
核心原则:
- 速度至关重要 - 优化以获取可快速落地的情报
- 靶向关键蛋白 - 聚焦保守、必需的病毒/细菌蛋白
- 利用现有药物 - 优先考虑FDA批准的可重定位化合物
- 结构导向 - 使用NvidiaNIM进行快速结构预测与分子对接
- 证据分级 - 根据证据强度对重定位候选药物分级
- 可落地输出 - 附带依据的优先推荐药物候选
- 优先使用英文查询 - 工具调用中始终使用英文术语(病原体名称、蛋白名称、药物名称),即使用户使用其他语言提问。仅在必要时尝试使用原语言术语作为备选。以用户使用的语言回复
When to Use
适用场景
Apply when user asks:
- "New pathogen detected - what drugs might work?"
- "Emerging virus [X] - therapeutic options?"
- "Drug repurposing candidates for [pathogen]"
- "What do we know about [novel coronavirus/bacteria]?"
- "Essential targets in [pathogen] for drug development"
- "Can we repurpose [drug] against [pathogen]?"
当用户提出以下问题时适用:
- "检测到新型病原体,哪些药物可能有效?"
- "新发病毒[X]的治疗方案有哪些?"
- "[病原体]的药物重定位候选物"
- "我们对[新型冠状病毒/细菌]了解多少?"
- "[病原体]中用于药物开发的关键靶点"
- "我们能否将[药物]重定位用于对抗[病原体]?"
Critical Workflow Requirements
关键工作流要求
1. Report-First Approach (MANDATORY)
1. 报告优先方法(强制要求)
-
Create the report file FIRST:
- File name:
[PATHOGEN]_outbreak_intelligence.md - Initialize with section headers
- Add placeholder:
[Analyzing...]
- File name:
-
Progressively update as you gather data
-
Output separate files:
- - Ranked repurposing candidates
[PATHOGEN]_drug_candidates.csv - - Druggable targets
[PATHOGEN]_target_proteins.csv
-
首先创建报告文件:
- 文件名:
[PATHOGEN]_outbreak_intelligence.md - 初始化时添加章节标题
- 添加占位符:
[分析中...]
- 文件名:
-
收集数据时逐步更新
-
输出独立文件:
- - 排序后的重定位候选药物
[PATHOGEN]_drug_candidates.csv - - 可成药靶点
[PATHOGEN]_target_proteins.csv
2. Citation Requirements (MANDATORY)
2. 引用要求(强制要求)
markdown
undefinedmarkdown
undefinedTarget: RNA-dependent RNA polymerase (RdRp)
靶点:RNA依赖的RNA聚合酶(RdRp)
- UniProt: P0DTD1 (NSP12)
- Essentiality: Required for replication
- Conservation: >95% across variants
- Drug precedent: Remdesivir targets RdRp
Source: UniProt via , literature review
UniProt_search
---- UniProt:P0DTD1(NSP12)
- 必需性:复制必需
- 保守性:在各变异株中保守性>95%
- 药物先例:Remdesivir靶向RdRp
来源:UniProt via , 文献综述
UniProt_search
---Phase 0: Tool Verification
阶段0:工具验证
Known Parameter Corrections
已知参数修正
| Tool | WRONG Parameter | CORRECT Parameter |
|---|---|---|
| | |
| | |
| | |
| | |
| 工具 | 错误参数 | 正确参数 |
|---|---|---|
| | |
| | |
| | |
| | |
Workflow Overview
工作流概览
Phase 1: Pathogen Identification
├── Taxonomic classification
├── Closest relatives (for knowledge transfer)
├── Genome/proteome availability
└── OUTPUT: Pathogen profile
↓
Phase 2: Target Identification
├── Essential genes/proteins
├── Conserved across strains
├── Druggability assessment
└── OUTPUT: Prioritized target list
↓
Phase 3: Structure Prediction (NvidiaNIM)
├── AlphaFold2/ESMFold for targets
├── Binding site identification
├── Quality assessment (pLDDT)
└── OUTPUT: Target structures
↓
Phase 4: Drug Repurposing Screen
├── Approved drugs for related pathogens
├── Broad-spectrum antivirals/antibiotics
├── Docking screen (NvidiaNIM_diffdock)
└── OUTPUT: Candidate drugs
↓
Phase 4.5: Pathway Analysis (NEW)
├── KEGG: Pathogen metabolism pathways
├── Essential metabolic targets
├── Host-pathogen interaction pathways
└── OUTPUT: Pathway-based drug targets
↓
Phase 5: Literature Intelligence (ENHANCED)
├── PubMed: Published outbreak reports
├── BioRxiv/MedRxiv: Recent preprints (CRITICAL for outbreaks)
├── ArXiv: Computational/ML preprints
├── OpenAlex: Citation tracking
└── OUTPUT: Evidence synthesis
↓
Phase 6: Report Synthesis
├── Top drug candidates
├── Clinical trial opportunities
├── Recommended immediate actions
└── OUTPUT: Final report阶段1:病原体识别
├── 分类学分类
├── 亲缘病原体(用于知识迁移)
├── 基因组/蛋白质组可用性
└── 输出:病原体概况
↓
阶段2:靶点识别
├── 必需基因/蛋白
├── 菌株间保守性
├── 成药性评估
└── 输出:优先靶点列表
↓
阶段3:结构预测(NvidiaNIM)
├── 针对靶点使用AlphaFold2/ESMFold
├── 结合位点识别
├── 质量评估(pLDDT)
└── 输出:靶点结构
↓
阶段4:药物重定位筛选
├── 针对亲缘病原体的已批准药物
├── 广谱抗病毒/抗生素
├── 分子对接筛选(NvidiaNIM_diffdock)
└── 输出:候选药物
↓
阶段4.5:通路分析(新增)
├── KEGG:病原体代谢通路
├── 必需代谢靶点
├── 宿主-病原体互作通路
└── 输出:基于通路的药物靶点
↓
阶段5:文献情报(增强版)
├── PubMed:已发表的暴发报告
├── BioRxiv/MedRxiv:最新预印本(暴发期间至关重要)
├── ArXiv:计算/机器学习预印本
├── OpenAlex:引文追踪
└── 输出:证据综合
↓
阶段6:报告合成
├── 顶级药物候选物
├── 临床试验机会
├── 推荐立即行动项
└── 输出:最终报告Phase 1: Pathogen Identification
阶段1:病原体识别
1.1 Taxonomic Classification
1.1 分类学分类
python
def identify_pathogen(tu, pathogen_query):
"""Classify pathogen taxonomically."""
# NCBI Taxonomy search
taxonomy = tu.tools.NCBI_Taxonomy_search(query=pathogen_query)
return {
'taxid': taxonomy.get('taxid'),
'scientific_name': taxonomy.get('scientific_name'),
'rank': taxonomy.get('rank'),
'lineage': taxonomy.get('lineage'),
'type': classify_type(taxonomy) # virus, bacteria, fungus, parasite
}python
def identify_pathogen(tu, pathogen_query):
"""对病原体进行分类学分类。"""
# NCBI分类搜索
taxonomy = tu.tools.NCBI_Taxonomy_search(query=pathogen_query)
return {
'taxid': taxonomy.get('taxid'),
'scientific_name': taxonomy.get('scientific_name'),
'rank': taxonomy.get('rank'),
'lineage': taxonomy.get('lineage'),
'type': classify_type(taxonomy) # virus, bacteria, fungus, parasite
}1.2 Related Pathogens (Knowledge Transfer)
1.2 亲缘病原体(知识迁移)
python
def find_related_pathogens(tu, taxid):
"""Find related pathogens for drug knowledge transfer."""
# Get family/genus level relatives
relatives = tu.tools.NCBI_Taxonomy_get_children(
taxid=taxid,
rank="genus"
)
# Find relatives with approved drugs
related_with_drugs = []
for rel in relatives:
drugs = tu.tools.ChEMBL_search_targets(
query=rel['scientific_name'],
organism_contains=True
)
if drugs:
related_with_drugs.append({
'pathogen': rel,
'drugs': drugs
})
return related_with_drugspython
def find_related_pathogens(tu, taxid):
"""寻找亲缘病原体以进行药物知识迁移。"""
# 获取科/属级别的亲缘病原体
relatives = tu.tools.NCBI_Taxonomy_get_children(
taxid=taxid,
rank="genus"
)
# 寻找有已批准药物的亲缘病原体
related_with_drugs = []
for rel in relatives:
drugs = tu.tools.ChEMBL_search_targets(
query=rel['scientific_name'],
organism_contains=True
)
if drugs:
related_with_drugs.append({
'pathogen': rel,
'drugs': drugs
})
return related_with_drugs1.3 Output for Report
1.3 报告输出示例
markdown
undefinedmarkdown
undefined1. Pathogen Profile
1. 病原体概况
1.1 Taxonomic Classification
1.1 分类学分类
| Property | Value |
|---|---|
| Organism | SARS-CoV-2 |
| Taxonomy ID | 2697049 |
| Type | RNA virus (positive-sense, single-stranded) |
| Family | Coronaviridae |
| Genus | Betacoronavirus |
| Lineage | Riboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales |
| 属性 | 值 |
|---|---|
| 生物 | SARS-CoV-2 |
| 分类ID | 2697049 |
| 类型 | RNA病毒(正链单链) |
| 科 | 冠状病毒科 |
| 属 | β冠状病毒属 |
| 谱系 | Riboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales |
1.2 Related Pathogens with Drug Precedent
1.2 有药物先例的亲缘病原体
| Relative | Similarity | Approved Drugs | Relevance |
|---|---|---|---|
| SARS-CoV | 79% genome | Remdesivir (EUA) | High |
| MERS-CoV | 50% genome | None approved | Medium |
| HCoV-229E | 45% genome | None specific | Low |
Knowledge Transfer Opportunity: SARS-CoV drug development data highly relevant.
Source: NCBI Taxonomy, ChEMBL
---| 亲缘病原体 | 相似度 | 已批准药物 | 相关性 |
|---|---|---|---|
| SARS-CoV | 基因组相似度79% | Remdesivir(紧急使用授权) | 高 |
| MERS-CoV | 基因组相似度50% | 无已批准药物 | 中 |
| HCoV-229E | 基因组相似度45% | 无特异性药物 | 低 |
知识迁移机会:SARS-CoV的药物开发数据高度相关。
来源:NCBI分类学, ChEMBL
---Phase 2: Target Identification
阶段2:靶点识别
2.1 Essential Protein Identification
2.1 关键蛋白识别
python
def identify_targets(tu, pathogen_name):
"""Identify essential druggable targets."""
# Search UniProt for pathogen proteins
proteins = tu.tools.UniProt_search(
query=f"organism:{pathogen_name}",
reviewed=True
)
# Prioritize by essentiality and druggability
targets = []
for protein in proteins:
# Check for known drug interactions
chembl_target = tu.tools.ChEMBL_search_targets(
query=protein['gene_name']
)
targets.append({
'uniprot': protein['accession'],
'name': protein['protein_name'],
'function': protein['function'],
'has_drug_precedent': len(chembl_target) > 0,
'druggability': assess_druggability(protein)
})
return rank_targets(targets)python
def identify_targets(tu, pathogen_name):
"""识别必需的可成药靶点。"""
# 搜索UniProt获取病原体蛋白
proteins = tu.tools.UniProt_search(
query=f"organism:{pathogen_name}",
reviewed=True
)
# 根据必需性和成药性优先排序
targets = []
for protein in proteins:
# 检查已知药物相互作用
chembl_target = tu.tools.ChEMBL_search_targets(
query=protein['gene_name']
)
targets.append({
'uniprot': protein['accession'],
'name': protein['protein_name'],
'function': protein['function'],
'has_drug_precedent': len(chembl_target) > 0,
'druggability': assess_druggability(protein)
})
return rank_targets(targets)2.2 Target Prioritization Criteria
2.2 靶点优先排序标准
| Criterion | Weight | Description |
|---|---|---|
| Essentiality | 30% | Required for replication/survival |
| Conservation | 25% | Conserved across strains/variants |
| Druggability | 25% | Structural features amenable to binding |
| Drug precedent | 20% | Existing drugs for homologous targets |
| 标准 | 权重 | 描述 |
|---|---|---|
| 必需性 | 30% | 复制/存活必需 |
| 保守性 | 25% | 在菌株/变异株间保守 |
| 成药性 | 25% | 具有适合结合的结构特征 |
| 药物先例 | 20% | 针对同源靶点的现有药物 |
2.3 Output for Report
2.3 报告输出示例
markdown
undefinedmarkdown
undefined2. Druggable Targets
2. 可成药靶点
2.1 Prioritized Target List
2.1 优先靶点列表
| Rank | Target | UniProt | Function | Score | Drug Precedent |
|---|---|---|---|---|---|
| 1 | RdRp (NSP12) | P0DTD1 | RNA replication | 92 | Remdesivir |
| 2 | Main protease (Mpro) | P0DTD1 | Polyprotein cleavage | 88 | Nirmatrelvir |
| 3 | Papain-like protease | P0DTD1 | Polyprotein cleavage | 75 | GRL0617 (preclinical) |
| 4 | Spike protein | P0DTC2 | Host cell entry | 70 | Antibodies |
| 5 | Helicase (NSP13) | P0DTD1 | RNA unwinding | 65 | None approved |
| 排名 | 靶点 | UniProt | 功能 | 得分 | 药物先例 |
|---|---|---|---|---|---|
| 1 | RdRp(NSP12) | P0DTD1 | RNA复制 | 92 | Remdesivir |
| 2 | 主蛋白酶(Mpro) | P0DTD1 | 多蛋白切割 | 88 | Nirmatrelvir |
| 3 | 木瓜样蛋白酶 | P0DTD1 | 多蛋白切割 | 75 | GRL0617(临床前) |
| 4 | 刺突蛋白 | P0DTC2 | 宿主细胞进入 | 70 | 抗体 |
| 5 | 解旋酶(NSP13) | P0DTD1 | RNA解旋 | 65 | 无已批准药物 |
2.2 Target Details
2.2 靶点详情
Target 1: RNA-dependent RNA polymerase (RdRp/NSP12)
靶点1:RNA依赖的RNA聚合酶(RdRp/NSP12)
| Property | Value |
|---|---|
| UniProt | P0DTD1 (polyprotein position 4393-5324) |
| Length | 932 amino acids |
| Function | Catalyzes RNA synthesis from RNA template |
| Essentiality | Absolute (no replication without RdRp) |
| Conservation | >99% across all SARS-CoV-2 variants |
| Binding site | Nucleotide binding pocket |
| Drug precedent | Remdesivir (FDA approved), Favipiravir |
Source: UniProt, ChEMBL
---| 属性 | 值 |
|---|---|
| UniProt | P0DTD1(多蛋白位置4393-5324) |
| 长度 | 932个氨基酸 |
| 功能 | 催化以RNA为模板的RNA合成 |
| 必需性 | 绝对必需(无RdRp则无法复制) |
| 保守性 | 在所有SARS-CoV-2变异株中保守性>99% |
| 结合位点 | 核苷酸结合口袋 |
| 药物先例 | Remdesivir(FDA批准)、Favipiravir |
来源:UniProt, ChEMBL
---Phase 3: Structure Prediction
阶段3:结构预测
3.1 AlphaFold2 Structure Prediction (NVIDIA NIM)
3.1 AlphaFold2结构预测(NVIDIA NIM)
python
def predict_target_structure(tu, sequence, target_name):
"""Predict structure for target protein."""
# Use AlphaFold2 for high accuracy
structure = tu.tools.NvidiaNIM_alphafold2(
sequence=sequence,
algorithm="mmseqs2",
relax_prediction=False
)
# Parse pLDDT confidence
plddt_scores = parse_plddt(structure)
return {
'structure': structure['structure'],
'mean_plddt': np.mean(plddt_scores),
'high_confidence_regions': get_high_confidence(plddt_scores),
'predicted_binding_site': identify_binding_site(structure)
}python
def predict_target_structure(tu, sequence, target_name):
"""预测靶点蛋白的结构。"""
# 使用AlphaFold2以获取高精度结果
structure = tu.tools.NvidiaNIM_alphafold2(
sequence=sequence,
algorithm="mmseqs2",
relax_prediction=False
)
# 解析pLDDT置信度
plddt_scores = parse_plddt(structure)
return {
'structure': structure['structure'],
'mean_plddt': np.mean(plddt_scores),
'high_confidence_regions': get_high_confidence(plddt_scores),
'predicted_binding_site': identify_binding_site(structure)
}3.2 Structure Quality Assessment
3.2 结构质量评估
| pLDDT Range | Confidence | Use for Docking |
|---|---|---|
| >90 | Very High | Excellent |
| 70-90 | High | Good |
| 50-70 | Medium | Use caution |
| <50 | Low | Not recommended |
| pLDDT范围 | 置信度 | 对接适用性 |
|---|---|---|
| >90 | 极高 | 极佳 |
| 70-90 | 高 | 良好 |
| 50-70 | 中 | 谨慎使用 |
| <50 | 低 | 不推荐 |
3.3 Output for Report
3.3 报告输出示例
markdown
undefinedmarkdown
undefined3. Target Structures
3. 靶点结构
3.1 Structure Prediction Results
3.1 结构预测结果
| Target | Method | Length | Mean pLDDT | Docking Ready |
|---|---|---|---|---|
| RdRp (NSP12) | AlphaFold2 | 932 aa | 91.2 | ✓ Yes |
| Mpro | AlphaFold2 | 306 aa | 93.5 | ✓ Yes |
| PLpro | AlphaFold2 | 315 aa | 88.7 | ✓ Yes |
| 靶点 | 方法 | 长度 | 平均pLDDT | 可用于对接 |
|---|---|---|---|---|
| RdRp(NSP12) | AlphaFold2 | 932 aa | 91.2 | ✓ 是 |
| Mpro | AlphaFold2 | 306 aa | 93.5 | ✓ 是 |
| PLpro | AlphaFold2 | 315 aa | 88.7 | ✓ 是 |
3.2 RdRp Structure Quality
3.2 RdRp结构质量
| Region | Residues | pLDDT | Functional Role |
|---|---|---|---|
| Palm domain | 582-620 | 94.2 | Catalytic site |
| Fingers domain | 397-581 | 91.8 | NTP entry |
| Thumb domain | 621-815 | 89.4 | RNA binding |
| Active site | D760, D761 | 96.1 | Catalysis |
Docking Recommendation: Structure suitable for docking; active site highly confident.
Source: NVIDIA NIM via
NvidiaNIM_alphafold2
---| 区域 | 残基 | pLDDT | 功能角色 |
|---|---|---|---|
| 手掌结构域 | 582-620 | 94.2 | 催化位点 |
| 手指结构域 | 397-581 | 91.8 | NTP进入 |
| 拇指结构域 | 621-815 | 89.4 | RNA结合 |
| 活性位点 | D760, D761 | 96.1 | 催化 |
对接建议:结构适用于对接;活性位点置信度极高。
来源:NVIDIA NIM via
NvidiaNIM_alphafold2
---Phase 4: Drug Repurposing Screen
阶段4:药物重定位筛选
4.1 Identify Repurposing Candidates
4.1 识别重定位候选药物
python
def get_repurposing_candidates(tu, target_name, pathogen_family):
"""Find approved drugs to repurpose."""
candidates = []
# 1. Drugs approved for related pathogens
related_drugs = tu.tools.ChEMBL_search_drugs(
query=pathogen_family,
max_phase=4
)
candidates.extend(related_drugs)
# 2. Broad-spectrum antivirals
antivirals = tu.tools.ChEMBL_search_drugs(
query="broad spectrum antiviral",
max_phase=4
)
candidates.extend(antivirals)
# 3. Drugs with known activity against target class
target_class_drugs = tu.tools.DGIdb_get_drug_gene_interactions(
genes=[target_name]
)
candidates.extend(target_class_drugs)
return deduplicate(candidates)python
def get_repurposing_candidates(tu, target_name, pathogen_family):
"""寻找可重定位的已批准药物。"""
candidates = []
# 1. 针对亲缘病原体的已批准药物
related_drugs = tu.tools.ChEMBL_search_drugs(
query=pathogen_family,
max_phase=4
)
candidates.extend(related_drugs)
# 2. 广谱抗病毒药物
antivirals = tu.tools.ChEMBL_search_drugs(
query="broad spectrum antiviral",
max_phase=4
)
candidates.extend(antivirals)
# 3. 针对靶点类别的药物
target_class_drugs = tu.tools.DGIdb_get_drug_gene_interactions(
genes=[target_name]
)
candidates.extend(target_class_drugs)
return deduplicate(candidates)4.2 Docking Screen (NVIDIA NIM)
4.2 分子对接筛选(NVIDIA NIM)
python
def dock_candidates(tu, target_structure, candidate_smiles_list):
"""Dock candidate drugs against target."""
results = []
for smiles in candidate_smiles_list:
docking = tu.tools.NvidiaNIM_diffdock(
protein=target_structure,
ligand=smiles,
num_poses=5
)
results.append({
'smiles': smiles,
'top_score': docking['poses'][0]['confidence'],
'poses': docking['poses']
})
return sorted(results, key=lambda x: x['top_score'], reverse=True)python
def dock_candidates(tu, target_structure, candidate_smiles_list):
"""将候选药物与靶点进行分子对接。"""
results = []
for smiles in candidate_smiles_list:
docking = tu.tools.NvidiaNIM_diffdock(
protein=target_structure,
ligand=smiles,
num_poses=5
)
results.append({
'smiles': smiles,
'top_score': docking['poses'][0]['confidence'],
'poses': docking['poses']
})
return sorted(results, key=lambda x: x['top_score'], reverse=True)4.3 Output for Report
4.3 报告输出示例
markdown
undefinedmarkdown
undefined4. Drug Repurposing Screen
4. 药物重定位筛选
4.1 Candidate Identification
4.1 候选药物识别
| Source | Candidates | FDA Approved |
|---|---|---|
| Related pathogen drugs | 12 | 8 |
| Broad-spectrum antivirals | 15 | 11 |
| Target class drugs | 8 | 5 |
| Total unique | 28 | 19 |
| 来源 | 候选药物数量 | FDA批准 |
|---|---|---|
| 亲缘病原体药物 | 12 | 8 |
| 广谱抗病毒药物 | 15 | 11 |
| 靶点类别药物 | 8 | 5 |
| 总计去重后 | 28 | 19 |
4.2 Docking Results (RdRp Target)
4.2 对接结果(RdRp靶点)
| Rank | Drug | Indication | Docking Score | Evidence |
|---|---|---|---|---|
| 1 | Remdesivir | COVID-19 | 0.92 | ★★★ FDA approved |
| 2 | Favipiravir | Influenza | 0.87 | ★★☆ Phase 3 COVID |
| 3 | Sofosbuvir | HCV | 0.84 | ★★☆ In vitro active |
| 4 | Ribavirin | RSV, HCV | 0.78 | ★☆☆ Mixed results |
| 5 | Molnupiravir | COVID-19 | 0.76 | ★★★ FDA approved |
| 排名 | 药物 | 适应症 | 对接得分 | 证据 |
|---|---|---|---|---|
| 1 | Remdesivir | COVID-19 | 0.92 | ★★★ FDA批准 |
| 2 | Favipiravir | 流感 | 0.87 | ★★☆ COVID-19 3期临床 |
| 3 | Sofosbuvir | HCV | 0.84 | ★★☆ 体外活性 |
| 4 | Ribavirin | RSV, HCV | 0.78 | ★☆☆ 结果不一 |
| 5 | Molnupiravir | COVID-19 | 0.76 | ★★★ FDA批准 |
4.3 Top Candidate: Remdesivir
4.3 顶级候选药物:Remdesivir
| Property | Value |
|---|---|
| Docking score | 0.92 (excellent) |
| Mechanism | RdRp inhibitor (nucleotide analog) |
| FDA status | Approved for COVID-19 |
| Clinical evidence | ACTT-1: Reduced recovery time |
| Binding mode | Active site, chain termination |
Source: NVIDIA NIM via , ChEMBL
NvidiaNIM_diffdock
---| 属性 | 值 |
|---|---|
| 对接得分 | 0.92(极佳) |
| 作用机制 | RdRp抑制剂(核苷酸类似物) |
| FDA状态 | 批准用于COVID-19 |
| 临床证据 | ACTT-1:缩短恢复时间 |
| 结合模式 | 结合活性位点,终止链合成 |
来源:NVIDIA NIM via , ChEMBL
NvidiaNIM_diffdock
---Phase 4.5: Pathway Analysis (NEW)
阶段4.5:通路分析(新增)
4.5.1 Pathogen Metabolism Pathways
4.5.1 病原体代谢通路
python
def analyze_pathogen_pathways(tu, pathogen_name, pathogen_type):
"""Identify druggable metabolic pathways in pathogen."""
# KEGG pathogen pathways
pathways = tu.tools.kegg_search_pathway(
query=f"{pathogen_name} metabolism"
)
# Essential metabolic genes
essential_genes = tu.tools.kegg_get_pathway_genes(
pathway_id=pathways[0]['pathway_id']
)
# Host-pathogen interaction pathways
host_pathogen = tu.tools.kegg_search_pathway(
query=f"{pathogen_name} host interaction"
)
return {
'metabolic_pathways': pathways,
'essential_genes': essential_genes,
'host_interaction': host_pathogen
}python
def analyze_pathogen_pathways(tu, pathogen_name, pathogen_type):
"""识别病原体中可成药的代谢通路。"""
# KEGG病原体通路
pathways = tu.tools.kegg_search_pathway(
query=f"{pathogen_name} metabolism"
)
# 必需代谢基因
essential_genes = tu.tools.kegg_get_pathway_genes(
pathway_id=pathways[0]['pathway_id']
)
# 宿主-病原体互作通路
host_pathogen = tu.tools.kegg_search_pathway(
query=f"{pathogen_name} host interaction"
)
return {
'metabolic_pathways': pathways,
'essential_genes': essential_genes,
'host_interaction': host_pathogen
}4.5.2 Output for Report
4.5.2 报告输出示例
markdown
undefinedmarkdown
undefined4.5 Pathway Analysis
4.5 通路分析
Pathogen Metabolic Pathways (KEGG)
病原体代谢通路(KEGG)
| Pathway | Essentiality | Drug Targets |
|---|---|---|
| Viral replication (ko03030) | Essential | RdRp, Helicase |
| Viral protein processing | Essential | Mpro, PLpro |
| Host membrane interaction | Essential | Spike, ACE2 |
| 通路 | 必需性 | 药物靶点 |
|---|---|---|
| 病毒复制(ko03030) | 必需 | RdRp, 解旋酶 |
| 病毒蛋白加工 | 必需 | Mpro, PLpro |
| 宿主膜互作 | 必需 | 刺突蛋白, ACE2 |
Druggable Pathway Targets
可成药通路靶点
| Target | Pathway | Known Drugs | Evidence |
|---|---|---|---|
| RdRp | Viral replication | Remdesivir | ★★★ |
| 3CLpro | Protein processing | Nirmatrelvir | ★★★ |
| PLpro | Protein processing | GRL-0617 | ★★☆ |
| 靶点 | 通路 | 已知药物 | 证据 |
|---|---|---|---|
| RdRp | 病毒复制 | Remdesivir | ★★★ |
| 3CLpro | 蛋白加工 | Nirmatrelvir | ★★★ |
| PLpro | 蛋白加工 | GRL-0617 | ★★☆ |
Host-Pathogen Interaction Points
宿主-病原体互作位点
| Interaction | Host Protein | Pathway | Druggability |
|---|---|---|---|
| Entry | ACE2 | Cell surface | ★★☆ |
| Fusion | TMPRSS2 | Protease | ★★★ |
| Replication | Host ribosomes | Translation | ★☆☆ |
Source: KEGG, Reactome
---| 互作 | 宿主蛋白 | 通路 | 成药性 |
|---|---|---|---|
| 进入 | ACE2 | 细胞表面 | ★★☆ |
| 融合 | TMPRSS2 | 蛋白酶 | ★★★ |
| 复制 | 宿主核糖体 | 翻译 | ★☆☆ |
来源:KEGG, Reactome
---Phase 5: Literature Intelligence (ENHANCED)
阶段5:文献情报(增强版)
5.1 Comprehensive Literature Search
5.1 综合文献搜索
python
def comprehensive_outbreak_literature(tu, pathogen_name):
"""Search all literature sources for outbreak intelligence."""
# PubMed: Peer-reviewed
pubmed = tu.tools.PubMed_search_articles(
query=f"{pathogen_name} AND (outbreak OR treatment OR drug)",
limit=50,
sort="date"
)
# BioRxiv: CRITICAL for outbreaks - newest findings
biorxiv = tu.tools.BioRxiv_search_preprints(
query=f"{pathogen_name} treatment mechanism",
limit=20
)
# MedRxiv: Clinical preprints
medrxiv = tu.tools.MedRxiv_search_preprints(
query=f"{pathogen_name} clinical trial",
limit=20
)
# ArXiv: Computational/ML papers
arxiv = tu.tools.ArXiv_search_papers(
query=f"{pathogen_name} drug discovery",
category="q-bio",
limit=10
)
# Clinical trials
trials = tu.tools.search_clinical_trials(
condition=pathogen_name,
status="Recruiting"
)
# Citation analysis
key_papers = pubmed[:10]
for paper in key_papers:
citation = tu.tools.openalex_search_works(
query=paper['title'],
limit=1
)
paper['citations'] = citation[0].get('cited_by_count', 0) if citation else 0
return {
'pubmed': pubmed,
'biorxiv': biorxiv,
'medrxiv': medrxiv,
'arxiv': arxiv,
'trials': trials,
'key_papers': key_papers
}python
def comprehensive_outbreak_literature(tu, pathogen_name):
"""搜索所有文献源以获取暴发情报。"""
# PubMed:同行评审文献
pubmed = tu.tools.PubMed_search_articles(
query=f"{pathogen_name} AND (outbreak OR treatment OR drug)",
limit=50,
sort="date"
)
# BioRxiv:暴发期间至关重要 - 最新发现
biorxiv = tu.tools.BioRxiv_search_preprints(
query=f"{pathogen_name} treatment mechanism",
limit=20
)
# MedRxiv:临床预印本
medrxiv = tu.tools.MedRxiv_search_preprints(
query=f"{pathogen_name} clinical trial",
limit=20
)
# ArXiv:计算/机器学习论文
arxiv = tu.tools.ArXiv_search_papers(
query=f"{pathogen_name} drug discovery",
category="q-bio",
limit=10
)
# 临床试验
trials = tu.tools.search_clinical_trials(
condition=pathogen_name,
status="Recruiting"
)
# 引文分析
key_papers = pubmed[:10]
for paper in key_papers:
citation = tu.tools.openalex_search_works(
query=paper['title'],
limit=1
)
paper['citations'] = citation[0].get('cited_by_count', 0) if citation else 0
return {
'pubmed': pubmed,
'biorxiv': biorxiv,
'medrxiv': medrxiv,
'arxiv': arxiv,
'trials': trials,
'key_papers': key_papers
}5.2 Output for Report
5.2 报告输出示例
markdown
undefinedmarkdown
undefined5. Literature Intelligence
5. 文献情报
5.1 Published Literature (Peer-Reviewed)
5.1 已发表文献(同行评审)
| Topic | Papers | Key Finding |
|---|---|---|
| Treatment | 234 | Paxlovid remains effective |
| Resistance | 45 | Nirmatrelvir resistance mutations identified |
| Variants | 189 | XBB variants maintain drug sensitivity |
| Vaccines | 312 | Updated boosters protective |
| 主题 | 论文数量 | 关键发现 |
|---|---|---|
| 治疗 | 234 | Paxlovid仍保持有效性 |
| 耐药性 | 45 | 已识别Nirmatrelvir耐药突变 |
| 变异株 | 189 | XBB变异株对药物保持敏感性 |
| 疫苗 | 312 | 更新后的加强针具有保护性 |
5.2 Preprints (CRITICAL for Emerging Outbreaks)
5.2 预印本(暴发期间至关重要)
⚠️ Note: Preprints are NOT peer-reviewed. Critical for rapid intelligence but use with caution.
| Source | Title | Posted | Key Finding |
|---|---|---|---|
| BioRxiv | Novel RdRp inhibitor shows activity... | 2024-02-01 | New candidate |
| MedRxiv | Real-world effectiveness of... | 2024-01-28 | Paxlovid 85% effective |
| BioRxiv | Resistance mutations in... | 2024-01-25 | Monitor L50F mutation |
⚠️ 注意:预印本未经过同行评审。对快速获取情报至关重要,但需谨慎使用。
| 来源 | 标题 | 发布日期 | 关键发现 |
|---|---|---|---|
| BioRxiv | 新型RdRp抑制剂显示出活性... | 2024-02-01 | 新候选药物 |
| MedRxiv | ...的真实世界有效性 | 2024-01-28 | Paxlovid有效性达85% |
| BioRxiv | ...中的耐药突变 | 2024-01-25 | 需监测L50F突变 |
5.3 Computational/ML Preprints (ArXiv)
5.3 计算/机器学习预印本(ArXiv)
| Title | Category | Relevance |
|---|---|---|
| Deep learning for antiviral discovery | q-bio.BM | Drug design |
| Structure prediction for novel... | q-bio.BM | Target modeling |
| 标题 | 分类 | 相关性 |
|---|---|---|
| 深度学习用于抗病毒药物发现 | q-bio.BM | 药物设计 |
| 新型...的结构预测 | q-bio.BM | 靶点建模 |
5.4 Active Clinical Trials
5.4 活跃临床试验
| NCT ID | Phase | Drug | Status |
|---|---|---|---|
| NCT05012345 | 3 | Ensitrelvir | Recruiting |
| NCT05023456 | 2 | VV116 | Recruiting |
| NCT05034567 | 2 | S-217622 | Active |
| NCT编号 | 阶段 | 药物 | 状态 |
|---|---|---|---|
| NCT05012345 | 3 | Ensitrelvir | 招募中 |
| NCT05023456 | 2 | VV116 | 招募中 |
| NCT05034567 | 2 | S-217622 | 进行中 |
5.5 Citation Analysis (High-Impact Papers)
5.5 引文分析(高影响力论文)
| PMID | Title | Citations | Year |
|---|---|---|---|
| 33123456 | Remdesivir for COVID-19 | 5,234 | 2020 |
| 34234567 | Paxlovid Phase 3 results | 2,876 | 2022 |
Source: PubMed, BioRxiv, MedRxiv, ArXiv, OpenAlex, ClinicalTrials.gov
---| PMID | 标题 | 引用量 | 年份 |
|---|---|---|---|
| 33123456 | Remdesivir用于COVID-19 | 5,234 | 2020 |
| 34234567 | Paxlovid 3期结果 | 2,876 | 2022 |
来源:PubMed, BioRxiv, MedRxiv, ArXiv, OpenAlex, ClinicalTrials.gov
---Report Template
报告模板
markdown
undefinedmarkdown
undefinedOutbreak Intelligence Report: [PATHOGEN]
暴发情报报告:[PATHOGEN]
Generated: [Date] | Query: [Original query] | Status: In Progress
生成时间:[日期] | 查询:[原始查询] | 状态:分析中
Executive Summary
执行摘要
[Analyzing...]
[分析中...]
1. Pathogen Profile
1. 病原体概况
1.1 Classification
1.1 分类
[Analyzing...]
[分析中...]
1.2 Related Pathogens
1.2 亲缘病原体
[Analyzing...]
[分析中...]
2. Druggable Targets
2. 可成药靶点
2.1 Prioritized Targets
2.1 优先靶点
[Analyzing...]
[分析中...]
2.2 Target Details
2.2 靶点详情
[Analyzing...]
[分析中...]
3. Target Structures
3. 靶点结构
3.1 Prediction Results
3.1 预测结果
[Analyzing...]
[分析中...]
3.2 Binding Sites
3.2 结合位点
[Analyzing...]
[分析中...]
4. Drug Repurposing Screen
4. 药物重定位筛选
4.1 Candidate Drugs
4.1 候选药物
[Analyzing...]
[分析中...]
4.2 Docking Results
4.2 对接结果
[Analyzing...]
[分析中...]
4.3 Top Candidates
4.3 顶级候选药物
[Analyzing...]
[分析中...]
5. Literature Intelligence
5. 文献情报
5.1 Recent Findings
5.1 最新发现
[Analyzing...]
[分析中...]
5.2 Clinical Trials
5.2 临床试验
[Analyzing...]
[分析中...]
6. Recommendations
6. 建议
6.1 Immediate Actions
6.1 立即行动项
[Analyzing...]
[分析中...]
6.2 Clinical Trial Opportunities
6.2 临床试验机会
[Analyzing...]
[分析中...]
6.3 Research Priorities
6.3 研究优先级
[Analyzing...]
[分析中...]
7. Data Gaps & Limitations
7. 数据缺口与局限性
[Analyzing...]
[分析中...]
8. Data Sources
8. 数据来源
[Will be populated...]
---[将填充...]
---Evidence Grading
证据分级
| Tier | Symbol | Criteria | Example |
|---|---|---|---|
| T1 | ★★★ | FDA approved for this pathogen | Remdesivir for COVID |
| T2 | ★★☆ | Clinical trial evidence OR approved for related pathogen | Favipiravir |
| T3 | ★☆☆ | In vitro activity OR strong docking + mechanism | Sofosbuvir |
| T4 | ☆☆☆ | Computational prediction only | Novel docking hits |
| 层级 | 符号 | 标准 | 示例 |
|---|---|---|---|
| T1 | ★★★ | FDA批准用于该病原体 | Remdesivir用于COVID-19 |
| T2 | ★★☆ | 临床试验证据 或 批准用于亲缘病原体 | Favipiravir |
| T3 | ★☆☆ | 体外活性 或 高对接得分+作用机制 | Sofosbuvir |
| T4 | ☆☆☆ | 仅计算预测 | 新型对接命中物 |
Completeness Checklist
完整性检查清单
Phase 1: Pathogen ID
阶段1:病原体识别
- Taxonomic classification complete
- Related pathogens identified
- Genome/proteome availability noted
- 分类学分类完成
- 亲缘病原体已识别
- 基因组/蛋白质组可用性已记录
Phase 2: Targets
阶段2:靶点
- ≥5 targets identified
- Essentiality documented
- Conservation assessed
- Drug precedent checked
- 识别≥5个靶点
- 必需性已记录
- 保守性已评估
- 药物先例已检查
Phase 3: Structures
阶段3:结构
- Structures predicted for top 3 targets
- pLDDT confidence reported
- Binding sites identified
- 已为前3个靶点预测结构
- 已报告pLDDT置信度
- 已识别结合位点
Phase 4: Drug Screen
阶段4:药物筛选
- ≥20 candidates screened
- FDA-approved drugs prioritized
- Docking scores reported
- Top 5 candidates detailed
- 已筛选≥20个候选药物
- 已优先考虑FDA批准药物
- 已报告对接得分
- 已详细说明前5个候选药物
Phase 5: Literature
阶段5:文献
- Recent papers summarized
- Active trials listed
- Resistance data noted
- 已总结最新论文
- 已列出活跃试验
- 已记录耐药性数据
Phase 6: Recommendations
阶段6:建议
- ≥3 immediate actions
- Clinical trial opportunities
- Research priorities
- ≥3个立即行动项
- 临床试验机会
- 研究优先级
Fallback Chains
备选工具链
| Primary Tool | Fallback 1 | Fallback 2 |
|---|---|---|
| | |
| | Manual docking |
| | Manual classification |
| | PubChem bioassays |
| 主工具 | 备选工具1 | 备选工具2 |
|---|---|---|
| | |
| | 手动对接 |
| | 手动分类 |
| | PubChem生物测定 |
Tool Reference
工具参考
See TOOLS_REFERENCE.md for complete tool documentation.
完整工具文档请参见TOOLS_REFERENCE.md。