tooluniverse-chemical-safety
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseChemical Safety & Toxicology Assessment
化学安全与毒理学评估
Comprehensive chemical safety and toxicology analysis integrating predictive AI models, curated toxicogenomics databases, regulatory safety data, and chemical-biological interaction networks. Generates structured risk assessment reports with evidence grading.
整合预测AI模型、经过整理的毒理基因组学数据库、监管安全数据以及化学生物相互作用网络的综合性化学安全与毒理学分析。可生成带有证据分级的结构化风险评估报告。
When to Use This Skill
何时使用该技能
Triggers:
- "Is this chemical toxic?" / "What are the toxicity endpoints for [compound]?"
- "Assess the safety profile of [drug/chemical]"
- "What are the ADMET properties of [SMILES]?"
- "What genes does [chemical] interact with?"
- "What diseases are linked to [chemical] exposure?"
- "Predict toxicity for these molecules"
- "Drug safety assessment for [drug name]"
- "Environmental health risk of [chemical]"
- "Chemical hazard profiling"
- "Toxicogenomic analysis of [compound]"
Use Cases:
- Predictive Toxicology: AI-predicted toxicity endpoints (AMES mutagenicity, DILI, LD50, carcinogenicity, skin reactions) for novel compounds via SMILES
- ADMET Profiling: Full absorption, distribution, metabolism, excretion, toxicity characterization
- Toxicogenomics: Chemical-gene interaction mapping, gene-disease associations from CTD
- Regulatory Safety: FDA label warnings, boxed warnings, contraindications, adverse reactions
- Drug Safety Assessment: Combined DrugBank safety + FDA labels + adverse event data
- Chemical-Protein Interactions: STITCH-based chemical-protein binding and interaction networks
- Environmental Toxicology: Chemical-disease associations for environmental contaminants
触发场景:
- "该化学物质有毒吗?" / "[化合物]的毒性终点是什么?"
- "评估[药物/化学物质]的安全档案"
- "[SMILES]的ADMET特性是什么?"
- "[化学物质]与哪些基因相互作用?"
- "接触[化学物质]会关联哪些疾病?"
- "预测这些分子的毒性"
- "[药物名称]的药物安全评估"
- "[化学物质]的环境健康风险"
- "化学危害分析"
- "[化合物]的毒理基因组学分析"
适用场景:
- 预测毒理学: 通过SMILES对新型化合物进行AI预测的毒性终点分析(AMES致突变性、DILI肝毒性、LD50半数致死量、致癌性、皮肤反应)
- ADMET分析: 完整的吸收、分布、代谢、排泄、毒性特征分析
- 毒理基因组学: 化学-基因相互作用映射、CTD数据库中的基因-疾病关联分析
- 监管安全: FDA标签警告、黑框警告、禁忌症、不良反应
- 药物安全评估: 结合DrugBank安全数据、FDA标签数据及不良事件数据
- 化学-蛋白质相互作用: 基于STITCH的化学-蛋白质结合及相互作用网络分析
- 环境毒理学: 环境污染物的化学-疾病关联分析
KEY PRINCIPLES
核心原则
- Report-first approach - Create report file FIRST, then populate progressively
- Tool parameter verification - Verify params via before calling unfamiliar tools
get_tool_info - Evidence grading - Grade all safety claims by evidence strength (T1-T4)
- Citation requirements - Every toxicity finding must have inline source attribution
- Mandatory completeness - All sections must exist with data minimums or explicit "No data" notes
- Disambiguation first - Resolve compound identity (name -> SMILES, CID, ChEMBL ID) before analysis
- Negative results documented - "No toxicity signals found" is data; empty sections are failures
- Conservative risk assessment - When evidence is ambiguous, flag as "requires further investigation"
- English-first queries - Always use English chemical/drug names in tool calls
- 报告优先原则 - 先创建报告文件,再逐步填充内容
- 工具参数验证 - 在调用不熟悉的工具前,通过验证参数
get_tool_info - 证据分级 - 所有安全结论需按证据强度分级(T1-T4)
- 引用要求 - 每一项毒性发现必须有内联来源标注
- 完整性要求 - 所有章节必须存在,至少包含基础数据或明确标注“无数据”
- 先消歧义 - 在分析前先确认化合物身份(名称→SMILES、CID、ChEMBL ID)
- 记录阴性结果 - “未发现毒性信号”属于有效数据;空章节视为失败
- 保守风险评估 - 当证据不明确时,标记为“需进一步研究”
- 英文优先查询 - 在工具调用中始终使用英文化学/药物名称
Evidence Grading System (MANDATORY)
证据分级系统(强制要求)
Grade every toxicity claim by evidence strength:
| Tier | Symbol | Criteria | Examples |
|---|---|---|---|
| T1 | [T1] | Direct human evidence, regulatory finding | FDA boxed warning, clinical trial toxicity, human case reports |
| T2 | [T2] | Animal studies, validated in vitro | Nonclinical toxicology, AMES positive, animal LD50 |
| T3 | [T3] | Computational prediction, association data | ADMET-AI prediction, CTD association, QSAR model |
| T4 | [T4] | Database annotation, text-mined | Literature mention, database entry without validation |
所有毒性结论需按证据强度分级:
| 等级 | 标识 | 标准 | 示例 |
|---|---|---|---|
| T1 | [T1] | 直接人体证据、监管机构结论 | FDA黑框警告、临床试验毒性、人体病例报告 |
| T2 | [T2] | 动物研究、已验证的体外实验 | 非临床毒理学、AMES阳性、动物LD50数据 |
| T3 | [T3] | 计算预测、关联数据 | ADMET-AI预测、CTD关联、QSAR模型 |
| T4 | [T4] | 数据库注释、文本挖掘结果 | 文献提及、未经验证的数据库条目 |
Required Evidence Grading Locations
证据分级必填位置
Evidence grades MUST appear in:
- Executive Summary - Key toxicity findings graded
- Toxicity Predictions - Every ADMET-AI endpoint with confidence note
- Regulatory Safety - FDA findings marked [T1]
- Chemical-Gene Interactions - CTD data marked by curation status
- Risk Assessment - Final risk classification with supporting evidence tiers
证据分级必须出现在:
- 执行摘要 - 关键毒性发现需分级
- 毒性预测 - 每个ADMET-AI终点需标注置信度
- 监管安全 - FDA发现标记为[T1]
- 化学-基因相互作用 - CTD数据需标注整理状态
- 风险评估 - 最终风险分类需附带支持证据等级
Core Strategy: 8 Research Dimensions
核心策略:8个研究维度
Chemical/Drug Query
|
+-- PHASE 0: Compound Disambiguation (ALWAYS FIRST)
| +-- Resolve name -> SMILES, PubChem CID, ChEMBL ID
| +-- Get molecular formula, weight, canonical structure
|
+-- PHASE 1: Predictive Toxicology (ADMET-AI)
| +-- Mutagenicity (AMES)
| +-- Hepatotoxicity (DILI, ClinTox)
| +-- Carcinogenicity
| +-- Acute toxicity (LD50)
| +-- Skin reactions
| +-- Stress response pathways
| +-- Nuclear receptor activity
|
+-- PHASE 2: ADMET Properties
| +-- Absorption: BBB penetrance, bioavailability
| +-- Distribution: clearance, volume of distribution
| +-- Metabolism: CYP interactions (1A2, 2C9, 2C19, 2D6, 3A4)
| +-- Physicochemical: solubility, lipophilicity, pKa
|
+-- PHASE 3: Toxicogenomics (CTD)
| +-- Chemical-gene interactions
| +-- Chemical-disease associations
| +-- Affected biological pathways
|
+-- PHASE 4: Regulatory Safety (FDA Labels)
| +-- Boxed warnings (Black Box)
| +-- Contraindications
| +-- Adverse reactions
| +-- Warnings and precautions
| +-- Nonclinical toxicology
|
+-- PHASE 5: Drug Safety Profile (DrugBank)
| +-- Toxicity data
| +-- Contraindications
| +-- Drug interactions affecting safety
|
+-- PHASE 6: Chemical-Protein Interactions (STITCH)
| +-- Direct chemical-protein binding
| +-- Interaction confidence scores
| +-- Off-target effects
|
+-- PHASE 7: Structural Alerts (ChEMBL)
| +-- Known toxic substructures (PAINS, Brenk)
| +-- Structural alert flags
|
+-- SYNTHESIS: Integrated Risk Assessment
+-- Aggregate all evidence tiers
+-- Risk classification (Low/Medium/High/Critical)
+-- Data gaps and recommendationsChemical/Drug Query
|
+-- PHASE 0: Compound Disambiguation (ALWAYS FIRST)
| +-- Resolve name -> SMILES, PubChem CID, ChEMBL ID
| +-- Get molecular formula, weight, canonical structure
|
+-- PHASE 1: Predictive Toxicology (ADMET-AI)
| +-- Mutagenicity (AMES)
| +-- Hepatotoxicity (DILI, ClinTox)
| +-- Carcinogenicity
| +-- Acute toxicity (LD50)
| +-- Skin reactions
| +-- Stress response pathways
| +-- Nuclear receptor activity
|
+-- PHASE 2: ADMET Properties
| +-- Absorption: BBB penetrance, bioavailability
| +-- Distribution: clearance, volume of distribution
| +-- Metabolism: CYP interactions (1A2, 2C9, 2C19, 2D6, 3A4)
| +-- Physicochemical: solubility, lipophilicity, pKa
|
+-- PHASE 3: Toxicogenomics (CTD)
| +-- Chemical-gene interactions
| +-- Chemical-disease associations
| +-- Affected biological pathways
|
+-- PHASE 4: Regulatory Safety (FDA Labels)
| +-- Boxed warnings (Black Box)
| +-- Contraindications
| +-- Adverse reactions
| +-- Warnings and precautions
| +-- Nonclinical toxicology
|
+-- PHASE 5: Drug Safety Profile (DrugBank)
| +-- Toxicity data
| +-- Contraindications
| +-- Drug interactions affecting safety
|
+-- PHASE 6: Chemical-Protein Interactions (STITCH)
| +-- Direct chemical-protein binding
| +-- Interaction confidence scores
| +-- Off-target effects
|
+-- PHASE 7: Structural Alerts (ChEMBL)
| +-- Known toxic substructures (PAINS, Brenk)
| +-- Structural alert flags
|
+-- SYNTHESIS: Integrated Risk Assessment
+-- Aggregate all evidence tiers
+-- Risk classification (Low/Medium/High/Critical)
+-- Data gaps and recommendationsPhase 0: Compound Disambiguation (ALWAYS FIRST)
阶段0:化合物消歧义(必须优先执行)
CRITICAL: Resolve compound identity before any analysis.
关键:在任何分析前先确认化合物身份。
Input Types Handled
支持的输入类型
| Input Format | Resolution Strategy |
|---|---|
| Drug name (e.g., "Aspirin") | PubChem_get_CID_by_compound_name -> get SMILES from properties |
| SMILES string | Use directly for ADMET-AI; resolve to CID for other tools |
| PubChem CID | PubChem_get_compound_properties_by_CID -> get SMILES + name |
| ChEMBL ID | ChEMBL_get_molecule -> get SMILES + properties |
| 输入格式 | 解析策略 |
|---|---|
| 药物名称(如“Aspirin”) | PubChem_get_CID_by_compound_name → 从属性中获取SMILES |
| SMILES字符串 | 直接用于ADMET-AI;解析为CID供其他工具使用 |
| PubChem CID | PubChem_get_compound_properties_by_CID → 获取SMILES + 名称 |
| ChEMBL ID | ChEMBL_get_molecule → 获取SMILES + 属性 |
Resolution Steps
解析步骤
- Input detection: Determine if input is name, SMILES, CID, or ChEMBL ID
- SMILES: contains typical SMILES characters (=, #, [, ], (, ), c, n, o and no spaces in middle)
- CID: numeric only
- ChEMBL: starts with "CHEMBL"
- Otherwise: treat as compound name
- Name to CID:
PubChem_get_CID_by_compound_name(name=<compound_name>) - CID to properties:
PubChem_get_compound_properties_by_CID(cid=<cid>) - Extract SMILES: Get SMILES from PubChem properties (field: ,
ConnectivitySMILES, orCanonicalSMILESdepending on response format)IsomericSMILES - Store resolved IDs: Maintain dict with ,
name,smiles,cid,formula,weightinchi
- 输入检测: 判断输入是名称、SMILES、CID还是ChEMBL ID
- SMILES: 包含典型SMILES字符(=, #, [, ], (, ), c, n, o,且中间无空格)
- CID: 仅包含数字
- ChEMBL: 以“CHEMBL”开头
- 其他情况: 视为化合物名称
- 名称转CID:
PubChem_get_CID_by_compound_name(name=<compound_name>) - CID转属性:
PubChem_get_compound_properties_by_CID(cid=<cid>) - 提取SMILES: 从PubChem属性中获取SMILES(字段: 、
ConnectivitySMILES或CanonicalSMILES,取决于返回格式)IsomericSMILES - 存储解析后的ID: 维护包含、
name、smiles、cid、formula、weight的字典inchi
Disambiguation Output
消歧输出
markdown
undefinedmarkdown
undefinedCompound Identity
化合物身份信息
| Property | Value |
|---|---|
| Name | Acetaminophen |
| PubChem CID | 1983 |
| SMILES | CC(=O)Nc1ccc(O)cc1 |
| Formula | C8H9NO2 |
| Molecular Weight | 151.16 |
| InChI | InChI=1S/C8H9NO2/... |
---| 属性 | 值 |
|---|---|
| 名称 | 对乙酰氨基酚 |
| PubChem CID | 1983 |
| SMILES | CC(=O)Nc1ccc(O)cc1 |
| 分子式 | C8H9NO2 |
| 分子量 | 151.16 |
| InChI | InChI=1S/C8H9NO2/... |
---Phase 1: Predictive Toxicology (ADMET-AI)
阶段1:预测毒理学(ADMET-AI)
When: SMILES is available (from Phase 0 or provided directly)
Objective: Run comprehensive AI-predicted toxicity endpoints
适用场景: 已获取SMILES(来自阶段0或直接提供)
目标: 运行全面的AI预测毒性终点
Tools Used
使用的工具
All ADMET-AI tools take the same parameter format:
| Tool | Predicted Endpoints | Parameter |
|---|---|---|
| AMES, Carcinogens_Lagunin, ClinTox, DILI, LD50_Zhu, Skin_Reaction, hERG | |
| Stress response pathway activation (ARE, ATAD5, HSE, MMP, p53) | |
| AhR, AR, ER, PPARg, Aromatase nuclear receptor activity | |
所有ADMET-AI工具采用相同的参数格式:
| 工具 | 预测终点 | 参数 |
|---|---|---|
| AMES、Carcinogens_Lagunin、ClinTox、DILI、LD50_Zhu、Skin_Reaction、hERG | |
| 应激反应通路激活(ARE、ATAD5、HSE、MMP、p53) | |
| AhR、AR、ER、PPARg、芳香化酶核受体活性 | |
Workflow
工作流程
- Call
ADMETAI_predict_toxicity(smiles=[resolved_smiles]) - Call
ADMETAI_predict_stress_response(smiles=[resolved_smiles]) - Call
ADMETAI_predict_nuclear_receptor_activity(smiles=[resolved_smiles]) - For each endpoint, interpret prediction:
- Classification endpoints: Active (1) = toxic signal, Inactive (0) = no signal
- Regression endpoints (LD50): Report numerical value with context
- All predictions graded [T3] (computational prediction)
- 调用
ADMETAI_predict_toxicity(smiles=[resolved_smiles]) - 调用
ADMETAI_predict_stress_response(smiles=[resolved_smiles]) - 调用
ADMETAI_predict_nuclear_receptor_activity(smiles=[resolved_smiles]) - 解读每个终点的预测结果:
- 分类终点: 活性(1)= 有毒性信号,非活性(0)= 无毒性信号
- 回归终点(LD50): 报告数值及相关背景
- 所有预测结果分级为[T3](计算预测)
Decision Logic
决策逻辑
- Multiple SMILES: Can batch up to ~10 SMILES in single call
- Failed prediction: If ADMET-AI fails, note "prediction unavailable" (don't fail entire report)
- Confidence: Note that AI predictions are [T3] evidence, not definitive
- hERG flag: If hERG = Active, flag prominently (cardiac safety risk)
- AMES flag: If AMES = Active, flag prominently (mutagenicity concern)
- DILI flag: If DILI = Active, flag prominently (liver toxicity concern)
- 多SMILES: 单次调用最多可批量处理约10个SMILES
- 预测失败: 如果ADMET-AI调用失败,标注“预测不可用”(不终止整个报告)
- 置信度: 需注明AI预测为[T3]证据,并非确定性结论
- hERG标记: 如果hERG=活性,需突出标记(心脏安全风险)
- AMES标记: 如果AMES=活性,需突出标记(致突变性风险)
- DILI标记: 如果DILI=活性,需突出标记(肝毒性风险)
Output Table
输出表格
markdown
undefinedmarkdown
undefinedToxicity Predictions [T3]
毒性预测结果 [T3]
| Endpoint | Prediction | Interpretation | Concern Level |
|---|---|---|---|
| AMES Mutagenicity | Inactive | No mutagenic signal | Low |
| Carcinogenicity | Inactive | No carcinogenic signal | Low |
| ClinTox | Active | Clinical toxicity signal | HIGH |
| DILI | Active | Drug-induced liver injury risk | HIGH |
| LD50 (Zhu) | 2.45 log(mg/kg) | ~282 mg/kg (moderate) | Medium |
| Skin Reaction | Inactive | No skin sensitization signal | Low |
| hERG Inhibition | Active | Cardiac arrhythmia risk | HIGH |
All predictions from ADMET-AI. Evidence tier: [T3] (computational prediction)
---| 终点 | 预测结果 | 解读 | 风险等级 |
|---|---|---|---|
| AMES致突变性 | 非活性 | 无致突变信号 | 低 |
| 致癌性 | 非活性 | 无致癌信号 | 低 |
| ClinTox临床毒性 | 活性 | 存在临床毒性信号 | 高 |
| DILI肝毒性 | 活性 | 存在药物诱导肝损伤风险 | 高 |
| LD50 (Zhu模型) | 2.45 log(mg/kg) | ~282 mg/kg(中等) | 中等 |
| 皮肤反应 | 非活性 | 无皮肤致敏信号 | 低 |
| hERG抑制 | 活性 | 存在心律失常风险 | 高 |
所有预测结果来自ADMET-AI。证据等级: [T3](计算预测)
---Phase 2: ADMET Properties
阶段2:ADMET特性分析
When: SMILES is available
Objective: Full ADMET characterization beyond toxicity
适用场景: 已获取SMILES
目标: 完成毒性之外的完整ADMET特征分析
Tools Used
使用的工具
| Tool | Properties Predicted | Parameter |
|---|---|---|
| Blood-brain barrier crossing probability | |
| Oral bioavailability (F20%, F30%) | |
| Clearance, VDss, half-life, PPB | |
| CYP1A2, 2C9, 2C19, 2D6, 3A4 inhibition/substrate | |
| LogP, LogD, LogS, MW, pKa | |
| Aqueous solubility, lipophilicity, hydration free energy | |
| 工具 | 预测特性 | 参数 |
|---|---|---|
| 血脑屏障穿透概率 | |
| 口服生物利用度(F20%、F30%) | |
| 清除率、VDss稳态分布容积、半衰期、PPB血浆蛋白结合率 | |
| CYP1A2、2C9、2C19、2D6、3A4的抑制/底物特性 | |
| LogP、LogD、LogS、分子量、pKa | |
| 水溶性、亲脂性、水合自由能 | |
Workflow
工作流程
- Call all 6 ADMET tools in parallel (independent calls)
- Compile results into Absorption / Distribution / Metabolism / Excretion sections
- Assess Lipinski Rule of 5 compliance from physicochemical properties
- Flag drug-drug interaction risks from CYP inhibition profiles
- 并行调用所有6个ADMET工具(独立调用)
- 将结果整理为吸收/分布/代谢/排泄章节
- 根据理化特性评估Lipinski五规则合规性
- 根据CYP抑制特征标记药物相互作用风险
Decision Logic
决策逻辑
- BBB penetrant + toxicity: If BBB = Yes and any CNS toxicity endpoint active, flag as neurotoxicity risk
- Low bioavailability: If F20% = Low, note absorption concerns
- CYP inhibitor: If CYP3A4 inhibitor = Yes, flag high DDI risk
- Lipinski violations: Count violations and report drug-likeness assessment
- 血脑屏障穿透+毒性: 如果BBB=是且任何中枢神经系统毒性终点为活性,标记为神经毒性风险
- 低生物利用度: 如果F20%=低,标注吸收问题
- CYP抑制剂: 如果CYP3A4抑制剂=是,标记高药物相互作用风险
- Lipinski规则违反: 统计违反次数并报告药物相似性评估结果
Output Format
输出格式
markdown
undefinedmarkdown
undefinedADMET Profile [T3]
ADMET分析结果 [T3]
Absorption
吸收
| Property | Value | Interpretation |
|---|---|---|
| BBB Penetrance | Yes | Crosses blood-brain barrier |
| Bioavailability (F20%) | 85% | Good oral absorption |
| 属性 | 值 | 解读 |
|---|---|---|
| 血脑屏障穿透 | 是 | 可穿过血脑屏障 |
| 生物利用度(F20%) | 85% | 口服吸收良好 |
Distribution
分布
| Property | Value | Interpretation |
|---|---|---|
| VDss | 1.2 L/kg | Moderate tissue distribution |
| PPB | 92% | Highly protein bound |
| 属性 | 值 | 解读 |
|---|---|---|
| VDss稳态分布容积 | 1.2 L/kg | 组织分布中等 |
| PPB血浆蛋白结合率 | 92% | 蛋白结合率高 |
Metabolism
代谢
| CYP Enzyme | Substrate | Inhibitor |
|---|---|---|
| CYP1A2 | No | No |
| CYP2C9 | Yes | No |
| CYP2C19 | No | No |
| CYP2D6 | No | No |
| CYP3A4 | Yes | Yes (DDI risk) |
| CYP酶 | 底物 | 抑制剂 |
|---|---|---|
| CYP1A2 | 否 | 否 |
| CYP2C9 | 是 | 否 |
| CYP2C19 | 否 | 否 |
| CYP2D6 | 否 | 否 |
| CYP3A4 | 是 | 是(存在药物相互作用风险) |
Excretion
排泄
| Property | Value | Interpretation |
|---|---|---|
| Clearance | 8.5 mL/min/kg | Moderate clearance |
| Half-life | 6.2 h | Moderate half-life |
---| 属性 | 值 | 解读 |
|---|---|---|
| 清除率 | 8.5 mL/min/kg | 清除率中等 |
| 半衰期 | 6.2 h | 半衰期中等 |
---Phase 3: Toxicogenomics (CTD)
阶段3:毒理基因组学(CTD)
When: Compound name is resolved
Objective: Map chemical-gene-disease relationships from curated CTD data
适用场景: 已解析化合物名称
目标: 从经过整理的CTD数据中映射化学-基因-疾病关系
Tools Used
使用的工具
| Tool | Function | Parameter |
|---|---|---|
| Genes affected by chemical | |
| Diseases linked to chemical exposure | |
| 工具 | 功能 | 参数 |
|---|---|---|
| 受化学物质影响的基因 | |
| 与化学物质暴露相关的疾病 | |
Workflow
工作流程
- Call
CTD_get_chemical_gene_interactions(input_terms=compound_name) - Call
CTD_get_chemical_diseases(input_terms=compound_name) - Parse gene interactions: extract gene symbols, interaction types (increases/decreases expression, binding, etc.)
- Parse disease associations: extract disease names, evidence types (marker/mechanism/therapeutic)
- Identify most affected biological processes from gene list
- 调用
CTD_get_chemical_gene_interactions(input_terms=compound_name) - 调用
CTD_get_chemical_diseases(input_terms=compound_name) - 解析基因相互作用: 提取基因符号、相互作用类型(上调/下调表达、结合等)
- 解析疾病关联: 提取疾病名称、关联类型(标记/机制/治疗)
- 从基因列表中识别受影响最显著的生物通路
Decision Logic
决策逻辑
- Direct evidence vs inferred: CTD separates curated direct evidence from inferred associations
- Therapeutic vs toxic: Disease associations can be therapeutic (drug treats disease) or adverse (chemical causes disease)
- Gene interaction types: Distinguish between expression changes, binding, and activity modulation
- Prioritize marker/mechanism: These indicate stronger causal evidence than simple associations
- Grade curated as [T2]: Direct curated CTD evidence from literature
- Grade inferred as [T3]: Computationally inferred associations
- 直接证据vs推断证据: CTD区分经过整理的直接证据与推断关联
- 治疗vs毒性: 疾病关联可能是治疗性(药物治疗疾病)或不良性(化学物质导致疾病)
- 基因相互作用类型: 区分表达变化、结合及活性调节
- 优先标记/机制: 此类关联表示更强的因果证据,优于简单关联
- 整理数据分级为[T2]: 来自文献的CTD直接整理证据
- 推断数据分级为[T3]: 计算推断的关联
Output Format
输出格式
markdown
undefinedmarkdown
undefinedToxicogenomics (CTD) [T2/T3]
毒理基因组学(CTD) [T2/T3]
Chemical-Gene Interactions (Top 20)
化学-基因相互作用(前20个)
| Gene | Interaction | Type | Evidence |
|---|---|---|---|
| CYP1A2 | increases expression | mRNA | [T2] curated |
| TP53 | affects activity | protein | [T2] curated |
| ... | ... | ... | ... |
Total interactions found: 156
Top affected pathways: Xenobiotic metabolism, Apoptosis, DNA damage response
| 基因 | 相互作用 | 类型 | 证据 |
|---|---|---|---|
| CYP1A2 | 上调表达 | mRNA | [T2] 经过整理 |
| TP53 | 影响活性 | 蛋白质 | [T2] 经过整理 |
| ... | ... | ... | ... |
发现的相互作用总数: 156
受影响最显著的通路: 外源性物质代谢、细胞凋亡、DNA损伤应答
Chemical-Disease Associations (Top 10)
化学-疾病关联(前10个)
| Disease | Association Type | Evidence |
|---|---|---|
| Liver Neoplasms | marker/mechanism | [T2] curated |
| Contact Dermatitis | therapeutic | [T2] curated |
| ... | ... | ... |
---| 疾病 | 关联类型 | 证据 |
|---|---|---|
| 肝肿瘤 | 标记/机制 | [T2] 经过整理 |
| 接触性皮炎 | 治疗性 | [T2] 经过整理 |
| ... | ... | ... |
---Phase 4: Regulatory Safety (FDA Labels)
阶段4:监管安全(FDA标签)
When: Compound has an approved drug name
Objective: Extract regulatory safety information from FDA drug labels
适用场景: 化合物为已获批药物名称
目标: 从FDA药物标签中提取监管安全信息
Tools Used
使用的工具
| Tool | Information Retrieved | Parameter |
|---|---|---|
| Black box warnings (most serious) | |
| Absolute contraindications | |
| Known adverse reactions | |
| Warnings and precautions | |
| Animal toxicology data | |
| Carcinogenicity/mutagenicity/fertility data | |
| 工具 | 提取的信息 | 参数 |
|---|---|---|
| 黑框警告(最严重) | |
| 绝对禁忌症 | |
| 已知不良反应 | |
| 警告与注意事项 | |
| 动物毒理学数据 | |
| 致癌性/致突变性/生育力数据 | |
Workflow
工作流程
- Call all 6 FDA tools in parallel (independent queries by drug name)
- Parse and structure each response
- Prioritize: Boxed Warnings > Contraindications > Warnings > Adverse Reactions
- All FDA label data is [T1] evidence (regulatory finding based on human/animal data)
- 并行调用所有6个FDA工具(按药物名称独立查询)
- 解析并结构化每个工具的返回结果
- 优先级: 黑框警告 > 禁忌症 > 警告 > 不良反应
- 所有FDA标签数据为[T1]证据(基于人体/动物数据的监管结论)
Decision Logic
决策逻辑
- Boxed warning present: Flag as CRITICAL safety concern in executive summary
- No FDA data: Chemical may not be an approved drug; note "Not an FDA-approved drug" and continue with other phases
- Multiple warnings: Categorize by organ system (hepatic, cardiac, renal, CNS, etc.)
- Nonclinical toxicology: Grade as [T2] (animal data supporting human risk)
- 存在黑框警告: 在执行摘要中标记为严重安全风险
- 无FDA数据: 该化学物质可能未获批为药物;标注“非FDA获批药物”并继续其他阶段分析
- 多个警告: 按器官系统分类(肝、心脏、肾、中枢神经系统等)
- 非临床毒理学: 分级为[T2](支持人体风险的动物数据)
Output Format
输出格式
markdown
undefinedmarkdown
undefinedRegulatory Safety (FDA) [T1]
监管安全(FDA) [T1]
Boxed Warning
黑框警告
PRESENT - Hepatotoxicity risk with doses >4g/day. Liver failure reported. [T1]
存在 - 日剂量超过4g时存在肝毒性风险,已有肝衰竭病例报告。[T1]
Contraindications
禁忌症
- Severe hepatic impairment [T1]
- Known hypersensitivity [T1]
- 严重肝功能不全 [T1]
- 已知过敏反应 [T1]
Adverse Reactions (by frequency)
不良反应(按发生频率)
| Reaction | Frequency | Severity |
|---|---|---|
| Nausea | Common (>1%) | Mild |
| Hepatotoxicity | Rare (<0.1%) | Severe |
| ... | ... | ... |
| 反应 | 发生频率 | 严重程度 |
|---|---|---|
| 恶心 | 常见(>1%) | 轻度 |
| 肝毒性 | 罕见(<0.1%) | 重度 |
| ... | ... | ... |
Nonclinical Toxicology [T2]
非临床毒理学 [T2]
- Carcinogenicity: No carcinogenic potential in 2-year rat/mouse studies
- Mutagenicity: Negative in Ames assay and in vivo micronucleus test
- Fertility: No effects on fertility at doses up to 10x human dose
---- 致癌性: 2年大鼠/小鼠研究未发现致癌潜力
- 致突变性: Ames试验及体内微核试验结果为阴性
- 生育力: 剂量达人体剂量10倍时未发现生育力影响
---Phase 5: Drug Safety Profile (DrugBank)
阶段5:药物安全档案(DrugBank)
When: Compound is a known drug
Objective: Retrieve curated drug safety data from DrugBank
适用场景: 化合物为已知药物
目标: 从DrugBank获取经过整理的药物安全数据
Tools Used
使用的工具
| Tool | Information | Parameters |
|---|---|---|
| Toxicity, contraindications | |
| 工具 | 提取的信息 | 参数 |
|---|---|---|
| 毒性、禁忌症 | |
Workflow
工作流程
- Call
drugbank_get_safety_by_drug_name_or_drugbank_id(query=drug_name, case_sensitive=False, exact_match=False, limit=5) - Parse toxicity information, overdose data, contraindications
- Cross-reference with FDA data from Phase 4
- 调用
drugbank_get_safety_by_drug_name_or_drugbank_id(query=drug_name, case_sensitive=False, exact_match=False, limit=5) - 解析毒性信息、过量数据、禁忌症
- 与阶段4的FDA数据进行交叉验证
Decision Logic
决策逻辑
- Toxicity field: Contains LD50 values, overdose symptoms, organ toxicity data
- DrugBank ID: Note if found for cross-referencing
- Conflict with FDA: If DrugBank and FDA disagree, note discrepancy and defer to FDA [T1]
- Not found: Chemical may not be in DrugBank; continue with other phases
- 毒性字段: 包含LD50值、过量症状、器官毒性数据
- DrugBank ID: 如果找到,标注以便交叉引用
- 与FDA数据冲突: 如果DrugBank与FDA数据不一致,标注差异并以FDA[T1]数据为准
- 未找到数据: 该化学物质可能未收录于DrugBank;继续其他阶段分析
Phase 6: Chemical-Protein Interactions (STITCH)
阶段6:化学-蛋白质相互作用(STITCH)
When: Compound can be identified by name or SMILES
Objective: Map chemical-protein interaction network for off-target assessment
适用场景: 可通过名称或SMILES识别化合物
目标: 绘制化学-蛋白质相互作用网络以评估脱靶效应
Tools Used
使用的工具
| Tool | Function | Parameters |
|---|---|---|
| Resolve chemical name to STITCH ID | |
| Get chemical-protein interactions | |
| Get interaction network | |
| 工具 | 功能 | 参数 |
|---|---|---|
| 将化学物质名称解析为STITCH ID | |
| 获取化学-蛋白质相互作用 | |
| 获取相互作用网络 | |
Workflow
工作流程
- Resolve compound:
STITCH_resolve_identifier(identifier=compound_name, species=9606) - Get interactions:
STITCH_get_chemical_protein_interactions(identifiers=[stitch_id], species=9606, required_score=700) - Identify off-target proteins (not the intended drug target)
- Flag safety-relevant targets: hERG (cardiac), CYP enzymes (metabolism), nuclear receptors (endocrine)
- 解析化合物:
STITCH_resolve_identifier(identifier=compound_name, species=9606) - 获取相互作用:
STITCH_get_chemical_protein_interactions(identifiers=[stitch_id], species=9606, required_score=700) - 识别脱靶蛋白质(非预期药物靶点)
- 标记与安全相关的靶点: hERG(心脏)、CYP酶(代谢)、核受体(内分泌)
Decision Logic
决策逻辑
- High confidence (>900): Well-established interaction [T2]
- Medium confidence (700-900): Probable interaction [T3]
- Low confidence (400-700): Possible interaction, needs validation [T4]
- Safety-relevant targets: Flag interactions with known safety targets
- No STITCH data: Chemical may be too novel; note and continue
- 高置信度(>900): 已充分验证的相互作用 [T2]
- 中等置信度(700-900): 可能存在的相互作用 [T3]
- 低置信度(400-700): 可能存在的相互作用,需验证 [T4]
- 安全相关靶点: 标记与已知安全靶点的相互作用
- 无STITCH数据: 该化学物质可能过于新颖;标注并继续其他分析
Phase 7: Structural Alerts (ChEMBL)
阶段7:结构警报(ChEMBL)
When: ChEMBL molecule ID is available (from Phase 0)
Objective: Check for known toxic substructures
适用场景: 已获取ChEMBL分子ID(来自阶段0)
目标: 检查已知毒性子结构
Tools Used
使用的工具
| Tool | Function | Parameters |
|---|---|---|
| Find structural alert matches | |
| 工具 | 功能 | 参数 |
|---|---|---|
| 查找结构警报匹配 | |
Workflow
工作流程
- If ChEMBL ID available:
ChEMBL_search_compound_structural_alerts(molecule_chembl_id=chembl_id, limit=20) - Parse alert types: PAINS (pan-assay interference), Brenk (medicinal chemistry), Glaxo (GSK structural alerts)
- Categorize severity: Some alerts are informational, others indicate likely toxicity
- 如果有ChEMBL ID:
ChEMBL_search_compound_structural_alerts(molecule_chembl_id=chembl_id, limit=20) - 解析警报类型: PAINS(泛试验干扰)、Brenk(药物化学)、Glaxo(GSK结构警报)
- 分类严重程度: 部分警报为信息性,部分则提示潜在毒性
Decision Logic
决策逻辑
- PAINS alerts: May cause false positives in screening; note for medicinal chemistry
- Brenk alerts: Known problematic substructures; flag if present
- No alerts: Good sign but not definitive proof of safety
- No ChEMBL ID: Skip this phase gracefully; note "structural alert analysis not available"
- PAINS警报: 可能导致筛选假阳性;为药物化学研究标注
- Brenk警报: 已知问题子结构;如果存在则标记
- 无警报: 是良好信号,但不能作为安全性的确定性证明
- 无ChEMBL ID: 优雅跳过该阶段;标注“结构警报分析不可用”
Synthesis: Integrated Risk Assessment (MANDATORY)
合成阶段:综合风险评估(强制要求)
Always the final section. Integrates all evidence into actionable risk classification.
必须为最后一个章节。将所有证据整合为可执行的风险分类。
Risk Classification Matrix
风险分类矩阵
| Risk Level | Criteria |
|---|---|
| CRITICAL | FDA boxed warning present OR multiple [T1] toxicity findings OR active DILI + active hERG |
| HIGH | FDA warnings present OR [T2] animal toxicity OR multiple active ADMET endpoints |
| MEDIUM | Some [T3] predictions positive OR CTD disease associations OR structural alerts |
| LOW | All ADMET endpoints negative AND no FDA/DrugBank safety flags AND no CTD concerns |
| INSUFFICIENT DATA | Fewer than 3 phases returned data; cannot make confident assessment |
| 风险等级 | 标准 |
|---|---|
| 严重 | 存在FDA黑框警告 或 多个[T1]毒性发现 或 活性DILI+活性hERG |
| 高 | 存在FDA警告 或 [T2]动物毒性 或 多个活性ADMET终点 |
| 中等 | 部分[T3]预测为阳性 或 CTD疾病关联 或 存在结构警报 |
| 低 | 所有ADMET终点为阴性 且 无FDA/DrugBank安全标记 且 无CTD风险 |
| 数据不足 | 少于3个阶段返回数据;无法做出可信评估 |
Synthesis Template
合成模板
markdown
undefinedmarkdown
undefinedIntegrated Risk Assessment
综合风险评估
Overall Risk Classification: [HIGH]
整体风险分类: [高]
Evidence Summary
证据摘要
| Dimension | Finding | Evidence Tier | Concern |
|---|---|---|---|
| ADMET Toxicity | DILI active, hERG active | [T3] | HIGH |
| FDA Label | Boxed warning for hepatotoxicity | [T1] | CRITICAL |
| CTD Toxicogenomics | 156 gene interactions, liver neoplasms | [T2] | HIGH |
| DrugBank | Known hepatotoxicity at high doses | [T2] | HIGH |
| STITCH | Binds CYP3A4, hERG | [T3] | MEDIUM |
| Structural Alerts | 2 Brenk alerts | [T3] | MEDIUM |
| 维度 | 发现 | 证据等级 | 风险 |
|---|---|---|---|
| ADMET毒性 | DILI活性、hERG活性 | [T3] | 高 |
| FDA标签 | 存在肝毒性黑框警告 | [T1] | 严重 |
| CTD毒理基因组学 | 156个基因相互作用、肝肿瘤关联 | [T2] | 高 |
| DrugBank | 高剂量下存在已知肝毒性 | [T2] | 高 |
| STITCH | 与CYP3A4、hERG结合 | [T3] | 中等 |
| 结构警报 | 2个Brenk警报 | [T3] | 中等 |
Key Safety Concerns
关键安全风险
- Hepatotoxicity [T1]: FDA boxed warning + ADMET-AI DILI prediction + CTD liver disease associations
- Cardiac Risk [T3]: ADMET-AI hERG prediction + STITCH hERG interaction
- Drug Interactions [T3]: CYP3A4 substrate/inhibitor, potential DDI risk
- 肝毒性 [T1]: FDA黑框警告 + ADMET-AI DILI预测 + CTD肝疾病关联
- 心脏风险 [T3]: ADMET-AI hERG预测 + STITCH hERG相互作用
- 药物相互作用 [T3]: CYP3A4底物/抑制剂,存在潜在药物相互作用风险
Data Gaps
数据缺口
- No in vivo genotoxicity data available
- STITCH interaction scores moderate (700-900)
- No environmental exposure data
- 无体内遗传毒性数据
- STITCH相互作用评分中等(700-900)
- 无环境暴露数据
Recommendations
建议
- Avoid doses >4g/day (hepatotoxicity threshold) [T1]
- Monitor liver function in chronic use [T1]
- Screen for CYP3A4 interactions before co-administration [T3]
- Consider cardiac monitoring for at-risk patients [T3]
---- 避免日剂量超过4g(肝毒性阈值) [T1]
- 长期使用时监测肝功能 [T1]
- 联合用药前筛查CYP3A4相互作用 [T3]
- 对高危患者考虑心脏监测 [T3]
---Mandatory Completeness Checklist
强制完整性检查清单
Before finalizing any report, verify:
- Phase 0: Compound fully disambiguated (SMILES + CID at minimum)
- Phase 1: At least 5 toxicity endpoints reported or "prediction unavailable" noted
- Phase 2: ADMET profile with A/D/M/E sections or "not available" noted
- Phase 3: CTD queried; gene interactions and disease associations reported or "no data in CTD"
- Phase 4: FDA labels queried; results or "not an FDA-approved drug" noted
- Phase 5: DrugBank queried; results or "not found in DrugBank" noted
- Phase 6: STITCH queried; results or "no STITCH data available" noted
- Phase 7: Structural alerts checked or "ChEMBL ID not available" noted
- Synthesis: Risk classification provided with evidence summary
- Evidence Grading: All findings have [T1]-[T4] annotations
- Data Gaps: Explicitly listed in synthesis section
在最终确定报告前,需验证:
- 阶段0: 化合物已完全消歧义(至少包含SMILES + CID)
- 阶段1: 至少报告5个毒性终点或标注“预测不可用”
- 阶段2: ADMET分析包含吸收/分布/代谢/排泄章节或标注“不可用”
- 阶段3: 已查询CTD;报告基因相互作用和疾病关联或标注“CTD无数据”
- 阶段4: 已查询FDA标签;报告结果或标注“非FDA获批药物”
- 阶段5: 已查询DrugBank;报告结果或标注“未收录于DrugBank”
- 阶段6: 已查询STITCH;报告结果或标注“无STITCH数据”
- 阶段7: 已检查结构警报或标注“ChEMBL ID不可用”
- 合成阶段: 提供风险分类及证据摘要
- 证据分级: 所有发现均有[T1]-[T4]标注
- 数据缺口: 在合成阶段明确列出
Tool Parameter Reference
工具参数参考
Critical Parameter Notes (verified from source code):
| Tool | Parameter Name | Type | Notes |
|---|---|---|---|
| All ADMETAI tools | | | Always a list, even for single compound |
| All CTD tools | | | Chemical name, MeSH name, CAS RN, or MeSH ID |
| All FDA tools | | | Brand or generic drug name |
| drugbank_get_safety_* | | str, bool, bool, int | All 4 required |
| STITCH_resolve_identifier | | str, int | species=9606 for human |
| STITCH_get_chemical_protein_interactions | | list[str], int, int | required_score=400 default |
| PubChem_get_CID_by_compound_name | | | Compound name (not SMILES) |
| PubChem_get_compound_properties_by_CID | | | Numeric CID |
| ChEMBL_search_compound_structural_alerts | | | ChEMBL ID (e.g., "CHEMBL112") |
关键参数说明(已从源代码验证):
| 工具 | 参数名称 | 类型 | 说明 |
|---|---|---|---|
| 所有ADMETAI工具 | | | 始终为列表,即使单个化合物 |
| 所有CTD工具 | | | 化学物质名称、MeSH名称、CAS号或MeSH ID |
| 所有FDA工具 | | | 商品名或通用药物名称 |
| drugbank_get_safety_* | | str, bool, bool, int | 4个参数均为必填 |
| STITCH_resolve_identifier | | str, int | species=9606代表人类 |
| STITCH_get_chemical_protein_interactions | | list[str], int, int | required_score默认值为400 |
| PubChem_get_CID_by_compound_name | | | 化合物名称(非SMILES) |
| PubChem_get_compound_properties_by_CID | | | 数字CID |
| ChEMBL_search_compound_structural_alerts | | | ChEMBL ID(如“CHEMBL112”) |
Response Format Notes
返回格式说明
- ADMET-AI: Returns with prediction values
{status: "success", data: {...}} - CTD: Returns list of interaction/association objects
- FDA: Returns with label text
{status, data} - DrugBank: Returns with drug records
{data: [...]} - STITCH: Returns list of interaction objects with scores
- PubChem CID lookup: Returns (may or may not have
{IdentifierList: {CID: [...]}}wrapper)data - PubChem properties: Returns dict with ,
CID,MolecularWeight,ConnectivitySMILESIUPACName
- ADMET-AI: 返回包含预测值
{status: "success", data: {...}} - CTD: 返回相互作用/关联对象列表
- FDA: 返回包含标签文本
{status, data} - DrugBank: 返回包含药物记录
{data: [...]} - STITCH: 返回带评分的相互作用对象列表
- PubChem CID查询: 返回(可能包含或不包含
{IdentifierList: {CID: [...]}}包装)data - PubChem属性查询: 返回包含、
CID、MolecularWeight、ConnectivitySMILES的字典IUPACName
Fallback Strategies
回退策略
Compound Resolution
化合物解析
- Primary: PubChem by name -> CID -> properties -> SMILES
- Fallback 1: ChEMBL search by name -> molecule -> SMILES
- Fallback 2: If SMILES provided directly, skip name resolution
- 主策略: PubChem按名称查询→CID→属性→SMILES
- 回退1: ChEMBL按名称查询→分子→SMILES
- 回退2: 如果直接提供SMILES,跳过名称解析
Toxicity Prediction
毒性预测
- Primary: All 9 ADMET-AI endpoints
- Fallback: If ADMET-AI fails for a compound, note "prediction failed" and continue with database evidence
- Note: ADMET-AI may fail for very large or unusual SMILES
- 主策略: 所有9个ADMET-AI终点
- 回退: 如果ADMET-AI对某化合物调用失败,标注“预测失败”并继续使用数据库证据
- 注意: ADMET-AI可能对超大或特殊SMILES调用失败
Regulatory Data
监管数据
- Primary: FDA labels by drug name
- Fallback: If FDA returns no data, try alternative drug names (brand vs generic)
- Note: Non-drug chemicals (pesticides, industrial) will not have FDA labels
- 主策略: 按药物名称查询FDA标签
- 回退: 如果FDA未返回数据,尝试替代药物名称(商品名vs通用名)
- 注意: 非药物化学物质(农药、工业化学品)无FDA标签
CTD Data
CTD数据
- Primary: Search by common chemical name
- Fallback: Try MeSH name if common name fails
- Note: Novel compounds may not be in CTD
- 主策略: 按常用化学物质名称查询
- 回退: 如果常用名称失败,尝试MeSH名称
- 注意: 新型化合物可能未收录于CTD
Common Use Patterns
常见使用模式
Pattern 1: Novel Compound Assessment
模式1:新型化合物评估
Input: SMILES string for new molecule
Workflow: Phase 0 (SMILES->CID) -> Phase 1 (toxicity) -> Phase 2 (ADMET) -> Phase 7 (structural alerts) -> Synthesis
Output: Predictive safety profile for novel compound输入: 新型分子的SMILES字符串
工作流程: 阶段0(SMILES→CID)→阶段1(毒性预测)→阶段2(ADMET分析)→阶段7(结构警报)→合成阶段
输出: 新型化合物的预测安全档案Pattern 2: Approved Drug Safety Review
模式2:获批药物安全审查
Input: Drug name (e.g., "Acetaminophen")
Workflow: All phases (0-7 + Synthesis)
Output: Complete safety dossier with regulatory + predictive + database evidence输入: 药物名称(如“Acetaminophen”)
工作流程: 所有阶段(0-7 + 合成阶段)
输出: 包含监管、预测、数据库证据的完整安全档案Pattern 3: Environmental Chemical Risk
模式3:环境化学风险评估
Input: Chemical name (e.g., "Bisphenol A")
Workflow: Phase 0 -> Phase 1 -> Phase 2 -> Phase 3 (CTD, key for env chemicals) -> Phase 6 -> Synthesis
Output: Environmental health risk assessment focused on gene-disease associations输入: 化学物质名称(如“Bisphenol A”)
工作流程: 阶段0→阶段1→阶段2→阶段3(CTD,对环境化学品至关重要)→阶段6→合成阶段
输出: 聚焦基因-疾病关联的环境健康风险评估Pattern 4: Batch Toxicity Screening
模式4:批量毒性筛选
Input: Multiple SMILES strings
Workflow: Phase 0 -> Phase 1 (batch) -> Phase 2 (batch) -> Comparative table -> Synthesis
Output: Comparative toxicity table ranking compounds by safety输入: 多个SMILES字符串
工作流程: 阶段0→阶段1(批量)→阶段2(批量)→对比表格→合成阶段
输出: 按安全性排序的化合物毒性对比表格Pattern 5: Toxicogenomic Deep-Dive
模式5:毒理基因组学深度分析
Input: Chemical name + specific gene or disease interest
Workflow: Phase 0 -> Phase 3 (CTD expanded) -> Literature search -> Synthesis
Output: Detailed chemical-gene-disease mechanistic analysis输入: 化学物质名称 + 特定基因或疾病研究需求
工作流程: 阶段0→阶段3(扩展CTD分析)→文献检索→合成阶段
输出: 详细的化学-基因-疾病机制分析Output Report Structure
输出报告结构
All analyses generate a structured markdown report with progressive sections:
markdown
undefined所有分析生成结构化markdown报告,包含以下递进章节:
markdown
undefinedChemical Safety & Toxicology Report: [Compound Name]
化学安全与毒理学报告: [化合物名称]
Generated: YYYY-MM-DD HH:MM
Compound: [Name] | SMILES: [SMILES] | CID: [CID]
生成时间: YYYY-MM-DD HH:MM
化合物: [名称] | SMILES: [SMILES] | CID: [CID]
Executive Summary
执行摘要
[2-3 sentence overview with risk classification and key findings, all graded]
[2-3句话概述,包含风险分类和关键发现,均需分级]
1. Compound Identity
1. 化合物身份信息
[Phase 0 results - disambiguation table]
[阶段0结果 - 消歧表格]
2. Predictive Toxicology
2. 预测毒理学
[Phase 1 results - ADMET-AI toxicity endpoints]
[阶段1结果 - ADMET-AI毒性终点]
3. ADMET Profile
3. ADMET分析结果
[Phase 2 results - absorption, distribution, metabolism, excretion]
[阶段2结果 - 吸收、分布、代谢、排泄]
4. Toxicogenomics
4. 毒理基因组学
[Phase 3 results - CTD chemical-gene-disease relationships]
[阶段3结果 - CTD化学-基因-疾病关系]
5. Regulatory Safety
5. 监管安全
[Phase 4 results - FDA label information]
[阶段4结果 - FDA标签信息]
6. Drug Safety Profile
6. 药物安全档案
[Phase 5 results - DrugBank data]
[阶段5结果 - DrugBank数据]
7. Chemical-Protein Interactions
7. 化学-蛋白质相互作用
[Phase 6 results - STITCH network]
[阶段6结果 - STITCH网络]
8. Structural Alerts
8. 结构警报
[Phase 7 results - ChEMBL alerts]
[阶段7结果 - ChEMBL警报]
9. Integrated Risk Assessment
9. 综合风险评估
[Synthesis - risk classification, evidence summary, data gaps, recommendations]
[合成阶段 - 风险分类、证据摘要、数据缺口、建议]
Appendix: Methods and Data Sources
附录: 方法与数据来源
[Tool versions, databases queried, date of access]
---[工具版本、查询的数据库、访问日期]
---Limitations & Known Issues
局限性与已知问题
Tool-Specific
工具特定
- ADMET-AI: Predictions are computational [T3]; should not replace experimental testing
- CTD: Curated but may lag behind latest literature by 6-12 months
- FDA: Only covers FDA-approved drugs; not applicable to environmental chemicals or supplements
- DrugBank: Primarily drugs; limited coverage of industrial chemicals
- STITCH: Score thresholds affect sensitivity; lower scores increase false positives
- ChEMBL: Structural alerts require ChEMBL ID; not all compounds have one
- ADMET-AI: 预测为计算得出的[T3]证据;不能替代实验测试
- CTD: 经过整理但可能滞后于最新文献6-12个月
- FDA: 仅覆盖FDA获批药物;不适用于环境化学品或补充剂
- DrugBank: 主要收录药物;工业化学品覆盖有限
- STITCH: 评分阈值影响灵敏度;低评分会增加假阳性
- ChEMBL: 结构警报需要ChEMBL ID;并非所有化合物都有
Analysis
分析相关
- Novel compounds: May only have ADMET-AI predictions (no database evidence)
- Environmental chemicals: FDA/DrugBank phases will be empty; rely on CTD and ADMET-AI
- Batch mode: ADMET-AI can handle batches; other tools require individual queries
- Species specificity: Most data is human-centric; animal data noted where applicable
- 新型化合物: 可能仅能获取ADMET-AI预测结果(无数据库证据)
- 环境化学品: FDA/DrugBank阶段将无数据;依赖CTD和ADMET-AI
- 批量模式: ADMET-AI支持批量处理;其他工具需单独查询
- 物种特异性: 大多数数据以人类为中心;动物数据会单独标注
Technical
技术相关
- SMILES validity: Invalid SMILES will cause ADMET-AI failures
- Name ambiguity: Chemical names can be ambiguous; always verify with CID
- Rate limits: Some FDA endpoints may rate-limit for rapid queries
- SMILES有效性: 无效SMILES会导致ADMET-AI调用失败
- 名称歧义: 化学物质名称可能存在歧义;始终用CID验证
- 速率限制: 部分FDA端点可能对快速查询进行速率限制
Summary
总结
Chemical Safety & Toxicology Assessment Skill provides comprehensive safety evaluation by integrating:
- Predictive toxicology (ADMET-AI) - 9 tools covering toxicity, ADMET, physicochemical properties
- Toxicogenomics (CTD) - Chemical-gene-disease relationship mapping
- Regulatory safety (FDA) - 6 tools for label-based safety extraction
- Drug safety (DrugBank) - Curated toxicity and contraindication data
- Chemical interactions (STITCH) - Chemical-protein interaction networks
- Structural alerts (ChEMBL) - Known toxic substructure detection
Outputs: Structured markdown report with risk classification, evidence grading, and actionable recommendations
Best for: Drug safety assessment, chemical hazard profiling, environmental toxicology, ADMET characterization, toxicogenomic analysis
Total tools integrated: 25+ tools across 6 databases
化学安全与毒理学评估技能通过整合以下内容提供全面的安全评估:
- 预测毒理学(ADMET-AI)- 9个工具覆盖毒性、ADMET、理化特性
- 毒理基因组学(CTD)- 化学-基因-疾病关系映射
- 监管安全(FDA)- 6个工具提取标签安全信息
- 药物安全(DrugBank)- 经过整理的毒性和禁忌症数据
- 化学相互作用(STITCH)- 化学-蛋白质相互作用网络
- 结构警报(ChEMBL)- 已知毒性子结构检测
输出: 带有风险分类、证据分级和可执行建议的结构化markdown报告
最佳适用场景: 药物安全评估、化学危害分析、环境毒理学、ADMET特征分析、毒理基因组学分析
整合的工具总数: 6个数据库中的25+个工具