tooluniverse-crispr-screen-analysis
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseCRISPR Screen Analysis Workflow
CRISPR筛选分析工作流
Systematic analysis of CRISPR knockout/activation/interference screens to identify essential genes, synthetic lethal interactions, and therapeutic targets.
KEY PRINCIPLES:
- Report-first approach - Create comprehensive analysis report FIRST, then populate progressively
- Evidence grading - Grade all findings by confidence level (H/M/L based on statistical significance and validation data)
- Multi-dimensional analysis - Integrate essentiality, pathway context, druggability, and clinical relevance
- Citation requirements - Every conclusion must trace to source data (DepMap, literature, pathways)
- Mandatory completeness - All analysis sections must exist with data or explicit "No data" notes
- Context-aware interpretation - Consider cell line context, screen type, and biological pathway redundancy
对CRISPR敲除/激活/干扰筛选进行系统性分析,以识别必需基因、合成致死相互作用及治疗靶点。
核心原则:
- 报告优先法 - 先创建完整的分析报告,再逐步填充内容
- 证据分级 - 所有发现按置信度分级(基于统计显著性和验证数据分为高/中/低,对应★★★/★★☆/★☆☆)
- 多维度分析 - 整合必需性、通路背景、成药性及临床相关性
- 引用要求 - 每个结论必须可追溯至源数据(DepMap、文献、通路数据库)
- 完整性强制要求 - 所有分析章节必须存在,无数据时需明确标注“无数据”
- 背景感知解读 - 考虑细胞系背景、筛选类型及生物通路冗余性
When to Use This Skill
何时使用本技能
Apply when users:
- Have CRISPR screen hit lists (genes with significant phenotypes)
- Need to prioritize CRISPR hits for validation
- Want to identify essential genes for a specific cancer type
- Need synthetic lethal interaction analysis
- Ask "what are the top hits from my CRISPR screen?"
- Need drug target prioritization from functional genomics data
- Want pathway-level interpretation of screen results
当用户有以下需求时适用:
- 拥有CRISPR筛选的候选基因列表(具有显著表型的基因)
- 需要对CRISPR候选基因进行验证优先级排序
- 想要识别特定癌症类型的必需基因
- 需要进行合成致死相互作用分析
- 提问“我的CRISPR筛选中排名靠前的候选靶点是什么?”
- 需要从功能基因组数据中进行药物靶点优先级排序
- 想要对筛选结果进行通路层面的解读
⚠️ Known Issues & Workarounds
⚠️ 已知问题与解决方案
DepMap API Unavailability (2026-02-09)
DepMap API不可用(2026-02-09)
Issue: DepMap REST APIs (Sanger Cell Model Passports and Broad Institute) are currently non-operational.
Impact:
- PATH 0 (Gene Validation): DepMap gene registry unavailable
- PATH 1 (Essentiality Analysis): CRISPR dependency scores unavailable
Workaround: This skill now uses Pharos as fallback:
- Gene validation via
Pharos_get_target() - Druggability assessment via TDL (Target Development Level) classification
- TDL used as proxy for essentiality (Tclin targets are often essential)
- Evidence grading: Tclin=★★★, Tchem=★★☆, Tbio/Tdark=★☆☆
Data Quality Trade-off:
- ✅ Gene validation: 100% success rate (Pharos has comprehensive drug target coverage)
- ⚠️ Essentiality scores: Druggability-based proxy (TDL classification)
- ℹ️ All findings labeled with source (Pharos vs DepMap)
Timeline: Permanent fix (CSV download) estimated 1-2 weeks. See for details.
DEPMAP_ISSUE_ANALYSIS.md问题: DepMap REST API(Sanger细胞模型数据库和Broad研究所)当前无法使用。
影响:
- 路径0(基因验证):DepMap基因注册表不可用
- 路径1(必需性分析):CRISPR依赖性评分不可用
解决方案: 本技能现在使用Pharos作为备选方案:
- 通过进行基因验证
Pharos_get_target() - 通过TDL(靶点开发层级)分类评估成药性
- 将TDL用作必需性的替代指标(Tclin靶点通常为必需基因)
- 证据分级:Tclin=★★★,Tchem=★★☆,Tbio/Tdark=★☆☆
数据质量权衡:
- ✅ 基因验证:100%成功率(Pharos覆盖全面的药物靶点)
- ⚠️ 必需性评分:基于成药性的替代指标(TDL分类)
- ℹ️ 所有发现均标注数据源(Pharos或DepMap)
时间线: 永久修复方案(CSV下载)预计1-2周完成。详情见
DEPMAP_ISSUE_ANALYSIS.mdInput Types Supported
支持的输入类型
1. Gene List from User's Screen
1. 用户筛选得到的基因列表
- Format: Gene symbols (e.g., EGFR, KRAS, TP53)
- Minimum: 5 genes (for meaningful enrichment)
- Optimal: 20-100 genes (hits from primary screen)
- Context needed: Cancer type, screen type (dropout/enrichment), cell line used
- 格式: 基因符号(如EGFR、KRAS、TP53)
- 最低要求: 5个基因(以获得有意义的富集结果)
- 最优: 20-100个基因(初筛得到的候选靶点)
- 所需背景信息: 癌症类型、筛选类型(dropout/enrichment)、使用的细胞系
2. Cancer Type Query
2. 癌症类型查询
- Format: Cancer type name (e.g., "non-small cell lung cancer", "breast cancer")
- Workflow: Retrieve top essential genes for that cancer from DepMap
- Output: Ranked target list with essentiality scores
- 格式: 癌症类型名称(如“非小细胞肺癌”、“乳腺癌”)
- 工作流: 从DepMap检索该癌症的顶级必需基因
- 输出: 带必需性评分的靶点排名列表
3. Gene of Interest
3. 目标基因查询
- Format: Single gene symbol
- Workflow: Analyze essentiality across cancer types, identify selective dependencies
- Output: Target validation report with tissue specificity
- 格式: 单个基因符号
- 工作流: 分析该基因在不同癌症类型中的必需性,识别选择性依赖性
- 输出: 带组织特异性的靶点验证报告
Critical Workflow Requirements
关键工作流要求
1. Report-First Approach (MANDATORY)
1. 报告优先法(强制要求)
DO NOT show intermediate tool outputs. Instead:
-
Create report file FIRST before any analysis:
- File name:
CRISPR_screen_analysis_[CONTEXT].md - Initialize with all section headers
- Add placeholder: in each section
[Analyzing...]
- File name:
-
Progressively update as data arrives:
- Replace with findings
[Analyzing...] - Include "No significant enrichment" when appropriate
- Document failed analyses explicitly
- Replace
-
Final deliverable: Complete markdown report + optional plots (if user requests)
禁止展示中间工具输出,应遵循以下步骤:
-
先创建报告文件再进行任何分析:
- 文件名:
CRISPR_screen_analysis_[CONTEXT].md - 初始化所有章节标题
- 在每个章节添加占位符:
[分析中...]
- 文件名:
-
随着数据获取逐步更新:
- 用发现结果替换
[分析中...] - 适当时标注“无显著富集”
- 明确记录失败的分析
- 用发现结果替换
-
最终交付物: 完整的markdown报告 + 可选的图表(若用户要求)
2. Evidence Grading System (MANDATORY)
2. 证据分级系统(强制要求)
Grade every finding by confidence level:
| Level | Symbol | Criteria | Examples |
|---|---|---|---|
| HIGH | ★★★ | DepMap score <-1.0, p<0.01, validated in literature | Strong essential gene, clinical drug target |
| MEDIUM | ★★☆ | DepMap score -0.5 to -1.0, p<0.05, pathway coherence | Moderate dependency, pathway member |
| LOW | ★☆☆ | DepMap score >-0.5, marginal significance, weak validation | Weak hit, potential off-target |
为每个发现按置信度分级:
| 级别 | 符号 | 标准 | 示例 |
|---|---|---|---|
| 高 | ★★★ | DepMap评分<-1.0,p<0.01,有文献验证 | 强必需基因、临床药物靶点 |
| 中 | ★★☆ | DepMap评分-0.5至-1.0,p<0.05,通路一致性良好 | 中度依赖性、通路成员 |
| 低 | ★☆☆ | DepMap评分>-0.5,显著性较弱,验证证据不足 | 弱候选靶点、潜在脱靶效应 |
3. Contextualization Requirements
3. 背景关联要求
Every gene-level finding must include:
- Essentiality score (DepMap gene effect)
- Pan-cancer vs selective (is it essential in all cancers or specific subset?)
- Druggability (existing drugs, chemical probes, tractability)
- Pathway context (which pathways/complexes does it belong to?)
- Clinical relevance (approved targets, ongoing trials, biomarkers)
每个基因层面的发现必须包含:
- 必需性评分(DepMap基因效应值)
- 泛癌vs选择性(是否在所有癌症中必需,还是仅在特定亚群中必需?)
- 成药性(现有药物、化学探针、可靶向性)
- 通路背景(属于哪些通路/复合物?)
- 临床相关性(已获批靶点、正在进行的临床试验、生物标志物)
Core Analysis Strategy: 7 Research Paths
核心分析策略:7个研究路径
User Input (gene list OR cancer type OR single gene)
│
├─ PATH 0: Input Processing & Validation
│ ├─ Validate gene symbols
│ ├─ Determine analysis mode
│ └─ Set context parameters
│
├─ PATH 1: Gene Essentiality Analysis (DepMap)
│ ├─ Query gene dependencies for each hit
│ ├─ Retrieve essentiality scores across cell lines
│ ├─ Calculate pan-cancer vs selective essentiality
│ └─ Rank genes by dependency strength
│
├─ PATH 2: Pathway & Functional Enrichment
│ ├─ GO enrichment (biological process, molecular function)
│ ├─ Pathway enrichment (Reactome, WikiPathways, KEGG)
│ ├─ Hallmark gene set enrichment (MSigDB)
│ └─ Identify pathway-level vulnerabilities
│
├─ PATH 3: Protein-Protein Interaction Networks
│ ├─ Build PPI network for hit genes
│ ├─ Identify protein complexes
│ ├─ Find synthetic lethal candidates
│ └─ Hub gene analysis
│
├─ PATH 4: Druggability & Target Assessment
│ ├─ Check existing drugs (DGIdb, ChEMBL)
│ ├─ Assess chemical tractability (Pharos TDL)
│ ├─ Find chemical probes (Open Targets)
│ └─ Clinical trial status (ClinicalTrials.gov)
│
├─ PATH 5: Disease Association & Clinical Relevance
│ ├─ Gene-disease associations (Open Targets)
│ ├─ Somatic mutations in cancer (COSMIC, cBioPortal)
│ ├─ Expression in patient samples (GTEx, TCGA)
│ └─ Prognostic/predictive biomarker status
│
└─ PATH 6: Hit Prioritization & Validation Guidance
├─ Integrate all evidence dimensions
├─ Calculate priority score (essentiality + druggability + clinical relevance)
├─ Recommend validation experiments
└─ Identify top 5-10 targets for follow-up用户输入(基因列表或癌症类型或单个基因)
│
├─ 路径0: 输入处理与验证
│ ├─ 验证基因符号
│ ├─ 确定分析模式
│ └─ 设置背景参数
│
├─ 路径1: 基因必需性分析(DepMap)
│ ├─ 查询每个候选靶点的基因依赖性
│ ├─ 检索跨细胞系的必需性评分
│ ├─ 计算泛癌vs选择性必需性
│ └─ 按依赖性强度对基因排名
│
├─ 路径2: 通路与功能富集
│ ├─ GO富集(生物过程、分子功能)
│ ├─ 通路富集(Reactome、WikiPathways、KEGG)
│ ├─ 特征基因集富集(MSigDB Hallmark)
│ └─ 识别通路层面的脆弱性
│
├─ 路径3: 蛋白质-蛋白质相互作用网络
│ ├─ 为候选基因构建PPI网络
│ ├─ 识别蛋白质复合物
│ ├─ 寻找合成致死候选靶点
│ └─ 枢纽基因分析
│
├─ 路径4: 成药性与靶点评估
│ ├─ 检查现有药物(DGIdb、ChEMBL)
│ ├─ 评估化学可靶向性(Pharos TDL)
│ ├─ 寻找化学探针(Open Targets)
│ └─ 临床试验状态(ClinicalTrials.gov)
│
├─ 路径5: 疾病关联与临床相关性
│ ├─ 基因-疾病关联(Open Targets)
│ ├─ 癌症中的体细胞突变(COSMIC、cBioPortal)
│ ├─ 患者样本中的表达情况(GTEx、TCGA)
│ └─ 预后/预测生物标志物状态
│
└─ 路径6: 候选靶点优先级排序与验证指导
├─ 整合所有证据维度
├─ 计算优先级评分(必需性+成药性+临床相关性)
├─ 推荐验证实验
└─ 确定前5-10个待跟进的靶点PATH 0: Input Processing & Validation
路径0: 输入处理与验证
Determine Analysis Mode
确定分析模式
python
def determine_analysis_mode(user_input):
"""
Figure out what type of analysis to run.
Returns: 'gene_list', 'cancer_type', or 'single_gene'
"""
if isinstance(user_input, list) and len(user_input) >= 5:
return 'gene_list' # User provided hits from their screen
elif isinstance(user_input, str) and len(user_input.split()) > 1:
return 'cancer_type' # User asks about a cancer type
else:
return 'single_gene' # Single gene target validationpython
def determine_analysis_mode(user_input):
"""
Figure out what type of analysis to run.
Returns: 'gene_list', 'cancer_type', or 'single_gene'
"""
if isinstance(user_input, list) and len(user_input) >= 5:
return 'gene_list' # User provided hits from their screen
elif isinstance(user_input, str) and len(user_input.split()) > 1:
return 'cancer_type' # User asks about a cancer type
else:
return 'single_gene' # Single gene target validationGene Symbol Validation
基因符号验证
CRITICAL: Validate gene symbols with fallback to Open Targets if DepMap unavailable.
python
def validate_gene_symbols(tu, gene_list):
"""
Validate gene symbols with DepMap fallback to Open Targets.
Returns: dict with valid_genes, invalid_genes, suggestions, data_source
"""
validated = {
'valid': [],
'invalid': [],
'suggestions': {},
'data_source': None
}
# Try DepMap first
depmap_available = False
test_result = tu.tools.DepMap_search_genes(query="KRAS")
if (test_result.get('status') == 'success' and
not test_result.get('error', '').startswith('DepMap API')):
depmap_available = True
validated['data_source'] = 'DepMap (primary)'
if depmap_available:
# Use original DepMap validation logic
for gene in gene_list:
result = tu.tools.DepMap_search_genes(query=gene)
if result.get('status') == 'success':
genes = result.get('data', {}).get('genes', [])
exact_matches = [g for g in genes
if g.get('symbol', '').upper() == gene.upper()]
if exact_matches:
validated['valid'].append({
'input': gene,
'symbol': exact_matches[0]['symbol'],
'ensembl_id': exact_matches[0].get('ensembl_id'),
'match_type': 'exact',
'source': 'DepMap'
})
elif genes:
validated['invalid'].append(gene)
validated['suggestions'][gene] = [g['symbol'] for g in genes[:3]]
else:
validated['invalid'].append(gene)
else:
# FALLBACK: Use Pharos (druggability database)
print("⚠️ DepMap unavailable, using Pharos for gene validation...")
validated['data_source'] = 'Pharos (fallback - ★★☆)'
for gene in gene_list:
# Query Pharos to check if gene exists
result = tu.tools.Pharos_get_target(gene=gene)
if result.get('status') == 'success' and result.get('data'):
target_data = result.get('data', {})
validated['valid'].append({
'input': gene,
'symbol': target_data.get('name', gene),
'tdl': target_data.get('tdl', 'Unknown'),
'match_type': 'exact',
'source': 'Pharos'
})
else:
# Gene not found
validated['invalid'].append(gene)
return validatedOutput for Report:
markdown
undefined关键要求: 验证基因符号,若DepMap不可用则使用Open Targets作为备选。
python
def validate_gene_symbols(tu, gene_list):
"""
Validate gene symbols with DepMap fallback to Open Targets.
Returns: dict with valid_genes, invalid_genes, suggestions, data_source
"""
validated = {
'valid': [],
'invalid': [],
'suggestions': {},
'data_source': None
}
# Try DepMap first
depmap_available = False
test_result = tu.tools.DepMap_search_genes(query="KRAS")
if (test_result.get('status') == 'success' and
not test_result.get('error', '').startswith('DepMap API')):
depmap_available = True
validated['data_source'] = 'DepMap (primary)'
if depmap_available:
# Use original DepMap validation logic
for gene in gene_list:
result = tu.tools.DepMap_search_genes(query=gene)
if result.get('status') == 'success':
genes = result.get('data', {}).get('genes', [])
exact_matches = [g for g in genes
if g.get('symbol', '').upper() == gene.upper()]
if exact_matches:
validated['valid'].append({
'input': gene,
'symbol': exact_matches[0]['symbol'],
'ensembl_id': exact_matches[0].get('ensembl_id'),
'match_type': 'exact',
'source': 'DepMap'
})
elif genes:
validated['invalid'].append(gene)
validated['suggestions'][gene] = [g['symbol'] for g in genes[:3]]
else:
validated['invalid'].append(gene)
else:
# FALLBACK: Use Pharos (druggability database)
print("⚠️ DepMap unavailable, using Pharos for gene validation...")
validated['data_source'] = 'Pharos (fallback - ★★☆)'
for gene in gene_list:
# Query Pharos to check if gene exists
result = tu.tools.Pharos_get_target(gene=gene)
if result.get('status') == 'success' and result.get('data'):
target_data = result.get('data', {})
validated['valid'].append({
'input': gene,
'symbol': target_data.get('name', gene),
'tdl': target_data.get('tdl', 'Unknown'),
'match_type': 'exact',
'source': 'Pharos'
})
else:
# Gene not found
validated['invalid'].append(gene)
return validated报告输出示例:
markdown
undefinedInput Validation
输入验证
Genes Provided: 25 gene symbols
Valid Genes: 23 (92%)
Invalid/Ambiguous: 2
Data Source: {data_source from validated dict}
Invalid Genes:
- → Gene symbol not recognized (mutation-specific identifier)
EGFRVIII - → Did you mean
P53? (use official gene symbol)TP53
Proceeding with 23 valid gene symbols for analysis.
Source: {DepMap gene registry OR Pharos (fallback)}
Note: If using Pharos fallback due to DepMap unavailability, validation provides gene symbols and TDL classification (druggability level). Validation is ★★☆ reliable.
---提供的基因: 25个基因符号
有效基因: 23个(92%)
无效/模糊基因: 2个
数据源: {validated字典中的data_source}
无效基因:
- → 基因符号未被识别(突变特异性标识符)
EGFRVIII - → 是否指
P53?(请使用官方基因符号)TP53
将基于23个有效基因符号进行分析
来源: {DepMap基因注册表或Pharos(备选)}
注意: 若因DepMap不可用而使用Pharos备选方案,验证将提供基因符号和TDL分类(成药层级)。验证置信度为★★☆。
---PATH 1: Gene Essentiality Analysis (DepMap with Open Targets Fallback)
路径1: 基因必需性分析(DepMap + Open Targets备选)
Retrieve Essentiality Scores
检索必需性评分
python
def analyze_gene_essentiality(tu, gene_list, cancer_type=None):
"""
Get gene essentiality data with DepMap fallback to Open Targets.
DepMap: Provides CRISPR dependency scores (gold standard - ★★★)
Open Targets: Provides tractability + safety as proxy (fallback - ★★☆)
"""
essentiality_data = []
# Check if DepMap is available
test_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol="KRAS")
depmap_available = (
test_result.get('status') == 'success' and
not test_result.get('error', '').startswith('DepMap API')
)
if depmap_available:
# Use original DepMap logic (optimal - ★★★)
for gene in gene_list:
dep_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol=gene)
if dep_result.get('status') == 'success':
gene_data = dep_result.get('data', {})
essentiality_data.append({
'gene': gene,
'data': gene_data,
'essentiality_class': classify_essentiality_depmap(gene_data),
'source': 'DepMap',
'confidence': 'HIGH' # ★★★
})
else:
# FALLBACK: Use Pharos TDL classification as proxy
print("⚠️ DepMap unavailable, using Pharos TDL as essentiality proxy...")
for gene in gene_list:
pharos_result = tu.tools.Pharos_get_target(gene=gene)
if pharos_result.get('status') == 'success' and pharos_result.get('data'):
target_data = pharos_result.get('data', {})
# Use TDL (Target Development Level) as proxy for essentiality
essentiality_class = classify_essentiality_pharos(target_data)
essentiality_data.append({
'gene': gene,
'data': target_data,
'essentiality_class': essentiality_class,
'source': 'Pharos',
'confidence': essentiality_class['confidence'],
'note': 'Essentiality inferred from TDL classification'
})
return essentiality_data
def classify_essentiality_depmap(gene_data):
"""Classify gene essentiality based on DepMap CRISPR scores."""
# Original DepMap classification logic
return {
'pan_cancer': False,
'selective': True,
'non_essential': False
}
def classify_essentiality_pharos(target_data):
"""
Infer essentiality from Pharos TDL classification (fallback method).
TDL (Target Development Level) categories:
- Tclin: Clinical drug target (approved drugs) → Likely essential/important
- Tchem: Chemical tool/probe available → Druggable, possibly essential
- Tbio: Biological evidence → Some relevance
- Tdark: No drug/tool → Unknown essentiality
"""
tdl = target_data.get('tdl', 'Unknown')
if tdl == 'Tclin':
return {
'classification': 'LIKELY_ESSENTIAL',
'confidence': 'HIGH', # ★★★
'tdl': tdl,
'rationale': (
'Approved drug target (Tclin). '
'Clinically validated targets are often essential. '
'For cell-line-specific scores, await DepMap restoration.'
)
}
elif tdl == 'Tchem':
return {
'classification': 'POTENTIALLY_ESSENTIAL',
'confidence': 'MEDIUM', # ★★☆
'tdl': tdl,
'rationale': (
'Chemical tools available (Tchem). '
'Druggable targets with chemical probes often have functional relevance.'
)
}
elif tdl == 'Tbio':
return {
'classification': 'UNCERTAIN',
'confidence': 'LOW', # ★☆☆
'tdl': tdl,
'rationale': (
'Biological evidence only (Tbio). '
'Limited druggability data. Essentiality unclear.'
)
}
else: # Tdark or Unknown
return {
'classification': 'UNKNOWN',
'confidence': 'LOW', # ★☆☆
'tdl': tdl,
'rationale': (
'Dark target or unknown. '
'No drug/tool data. Essentiality cannot be inferred.'
)
}python
def analyze_gene_essentiality(tu, gene_list, cancer_type=None):
"""
Get gene essentiality data with DepMap fallback to Open Targets.
DepMap: Provides CRISPR dependency scores (gold standard - ★★★)
Open Targets: Provides tractability + safety as proxy (fallback - ★★☆)
"""
essentiality_data = []
# Check if DepMap is available
test_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol="KRAS")
depmap_available = (
test_result.get('status') == 'success' and
not test_result.get('error', '').startswith('DepMap API')
)
if depmap_available:
# Use original DepMap logic (optimal - ★★★)
for gene in gene_list:
dep_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol=gene)
if dep_result.get('status') == 'success':
gene_data = dep_result.get('data', {})
essentiality_data.append({
'gene': gene,
'data': gene_data,
'essentiality_class': classify_essentiality_depmap(gene_data),
'source': 'DepMap',
'confidence': 'HIGH' # ★★★
})
else:
# FALLBACK: Use Pharos TDL classification as proxy
print("⚠️ DepMap unavailable, using Pharos TDL as essentiality proxy...")
for gene in gene_list:
pharos_result = tu.tools.Pharos_get_target(gene=gene)
if pharos_result.get('status') == 'success' and pharos_result.get('data'):
target_data = pharos_result.get('data', {})
# Use TDL (Target Development Level) as proxy for essentiality
essentiality_class = classify_essentiality_pharos(target_data)
essentiality_data.append({
'gene': gene,
'data': target_data,
'essentiality_class': essentiality_class,
'source': 'Pharos',
'confidence': essentiality_class['confidence'],
'note': 'Essentiality inferred from TDL classification'
})
return essentiality_data
def classify_essentiality_depmap(gene_data):
"""Classify gene essentiality based on DepMap CRISPR scores."""
# Original DepMap classification logic
return {
'pan_cancer': False,
'selective': True,
'non_essential': False
}
def classify_essentiality_pharos(target_data):
"""
Infer essentiality from Pharos TDL classification (fallback method).
TDL (Target Development Level) categories:
- Tclin: Clinical drug target (approved drugs) → Likely essential/important
- Tchem: Chemical tool/probe available → Druggable, possibly essential
- Tbio: Biological evidence → Some relevance
- Tdark: No drug/tool → Unknown essentiality
"""
tdl = target_data.get('tdl', 'Unknown')
if tdl == 'Tclin':
return {
'classification': 'LIKELY_ESSENTIAL',
'confidence': 'HIGH', # ★★★
'tdl': tdl,
'rationale': (
'Approved drug target (Tclin). '
'Clinically validated targets are often essential. '
'For cell-line-specific scores, await DepMap restoration.'
)
}
elif tdl == 'Tchem':
return {
'classification': 'POTENTIALLY_ESSENTIAL',
'confidence': 'MEDIUM', # ★★☆
'tdl': tdl,
'rationale': (
'Chemical tools available (Tchem). '
'Druggable targets with chemical probes often have functional relevance.'
)
}
elif tdl == 'Tbio':
return {
'classification': 'UNCERTAIN',
'confidence': 'LOW', # ★☆☆
'tdl': tdl,
'rationale': (
'Biological evidence only (Tbio). '
'Limited druggability data. Essentiality unclear.'
)
}
else: # Tdark or Unknown
return {
'classification': 'UNKNOWN',
'confidence': 'LOW', # ★☆☆
'tdl': tdl,
'rationale': (
'Dark target or unknown. '
'No drug/tool data. Essentiality cannot be inferred.'
)
}Cancer Type-Specific Analysis
癌症类型特异性分析
For cancer type queries, retrieve top essential genes:
python
def get_top_essential_genes_for_cancer(tu, cancer_type, top_n=50):
"""
Retrieve top essential genes for a specific cancer type from DepMap.
"""
# Get cell lines for this cancer type
cell_lines = tu.tools.DepMap_get_cell_lines(
cancer_type=cancer_type,
page_size=100
)
if not cell_lines.get('data', {}).get('cell_lines'):
return {'error': f'No cell lines found for {cancer_type}'}
# For each cell line, would need to query dependencies
# Note: DepMap API may not support direct "top genes by cancer type" query
# May need to aggregate manually or use different approach
return {
'cancer_type': cancer_type,
'cell_lines': cell_lines.get('data', {}).get('cell_lines', []),
'note': 'Full analysis requires per-cell-line dependency data aggregation'
}Output for Report (when DepMap available):
markdown
undefined针对癌症类型查询,检索顶级必需基因:
python
def get_top_essential_genes_for_cancer(tu, cancer_type, top_n=50):
"""
Retrieve top essential genes for a specific cancer type from DepMap.
"""
# Get cell lines for this cancer type
cell_lines = tu.tools.DepMap_get_cell_lines(
cancer_type=cancer_type,
page_size=100
)
if not cell_lines.get('data', {}).get('cell_lines'):
return {'error': f'No cell lines found for {cancer_type}'}
# For each cell line, would need to query dependencies
# Note: DepMap API may not support direct "top genes by cancer type" query
# May need to aggregate manually or use different approach
return {
'cancer_type': cancer_type,
'cell_lines': cell_lines.get('data', {}).get('cell_lines', []),
'note': 'Full analysis requires per-cell-line dependency data aggregation'
}DepMap可用时的报告输出:
markdown
undefined1. Gene Essentiality Analysis
1. 基因必需性分析
Data Source: DepMap CRISPR (24Q2) ✅
Confidence: ★★★ HIGH
数据源: DepMap CRISPR (24Q2) ✅
置信度: ★★★ 高
Strongly Essential Genes (DepMap Score < -1.0)
强必需基因(DepMap评分 < -1.0)
| Gene | Mean Effect | Essential Cell Lines (%) | Selectivity | Evidence |
|---|---|---|---|---|
| RPL5 | -1.45 | 98% (1,042/1,063) | Pan-cancer | ★★★ |
| RPS6 | -1.32 | 96% (1,019/1,063) | Pan-cancer | ★★★ |
| POLR2A | -1.28 | 95% (1,010/1,063) | Pan-cancer | ★★★ |
Interpretation: These genes are essential for cell survival across nearly all cancer types. They are core fitness genes (ribosomal proteins, RNA polymerase) and likely not selective therapeutic targets.
Source: DepMap via
DepMap_get_gene_dependencies
**Output for Report (when using Pharos fallback)**:
```markdown| 基因 | 平均效应值 | 必需细胞系占比 | 选择性 | 证据 |
|---|---|---|---|---|
| RPL5 | -1.45 | 98% (1,042/1,063) | 泛癌 | ★★★ |
| RPS6 | -1.32 | 96% (1,019/1,063) | 泛癌 | ★★★ |
| POLR2A | -1.28 | 95% (1,010/1,063) | 泛癌 | ★★★ |
解读: 这些基因在几乎所有癌症类型中对细胞存活都是必需的。它们是核心 fitness 基因(核糖体蛋白、RNA聚合酶),可能不适合作为选择性治疗靶点。
来源: DepMap via
DepMap_get_gene_dependencies
**使用Pharos备选方案时的报告输出**:
```markdown1. Gene Essentiality Analysis
1. 基因必需性分析
⚠️ Data Source: Pharos (DepMap CRISPR temporarily unavailable)
Analysis Method: Essentiality inferred from TDL (Target Development Level) classification
Confidence: Varies by TDL (Tclin=★★★, Tchem=★★☆, Tbio/Tdark=★☆☆)
⚠️ 数据源: Pharos(DepMap CRISPR暂时不可用)
分析方法: 基于TDL(靶点开发层级)分类推断必需性
置信度: 随TDL而异(Tclin=★★★, Tchem=★★☆, Tbio/Tdark=★☆☆)
Clinically Validated Targets (Tclin - Likely Essential)
临床验证靶点(Tclin - 可能必需)
| Gene | TDL | Clinical Status | Inference | Evidence |
|---|---|---|---|---|
| KRAS | Tclin | Approved drugs (sotorasib, adagrasib) | Likely essential in KRAS-mutant cancers | ★★★ |
| EGFR | Tclin | Multiple approved inhibitors | Likely essential in EGFR-mutant cancers | ★★★ |
Interpretation: Tclin targets have approved drugs, indicating clinical validation. These are likely essential in specific contexts (mutation-dependent).
| 基因 | TDL | 临床状态 | 推断结果 | 证据 |
|---|---|---|---|---|
| KRAS | Tclin | 已获批药物(sotorasib、adagrasib) | 在KRAS突变型癌症中可能必需 | ★★★ |
| EGFR | Tclin | 多种获批抑制剂 | 在EGFR突变型癌症中可能必需 | ★★★ |
解读: Tclin靶点拥有获批药物,表明已通过临床验证。这些靶点在特定背景下(依赖突变)可能是必需的。
Chemical Probe Available (Tchem - Potentially Essential)
有化学探针可用的靶点(Tchem - 潜在必需)
| Gene | TDL | Tool Status | Inference | Evidence |
|---|---|---|---|---|
| CDK2 | Tchem | Chemical probes available | Potentially essential (cell cycle) | ★★☆ |
| WEE1 | Tchem | Chemical inhibitors available | Potentially essential (DNA damage) | ★★☆ |
Interpretation: Tchem targets are druggable with chemical tools. Druggability suggests functional importance.
Note: TDL classification is a proxy for essentiality. For definitive CRISPR dependency scores, DepMap data required.
Source: Pharos via (fallback method)
Pharos_get_target| 基因 | TDL | 工具状态 | 推断结果 | 证据 |
|---|---|---|---|---|
| CDK2 | Tchem | 有化学探针可用 | 潜在必需(细胞周期相关) | ★★☆ |
| WEE1 | Tchem | 有化学抑制剂可用 | 潜在必需(DNA损伤相关) | ★★☆ |
解读: Tchem靶点可被化学工具靶向。成药性表明其具有功能相关性。
注意: TDL分类是必需性的替代指标。如需明确的CRISPR依赖性评分,需等待DepMap数据恢复。
来源: Pharos via (备选方法)
Pharos_get_targetSelectively Essential Genes (Tissue/Context-Specific)
选择性必需基因(组织/背景特异性)
| Gene | Mean Effect | Essential in | Non-Essential in | Selectivity Score | Evidence |
|---|---|---|---|---|---|
| KRAS | -0.85 | Pancreatic (95%), Lung (78%), Colon (82%) | Breast (12%), Glioma (8%) | High | ★★★ |
| EGFR | -0.72 | Lung (85%), Glioblastoma (76%) | Most others (<20%) | High | ★★★ |
| ESR1 | -0.68 | ER+ Breast (92%) | ER- Breast (5%), Other (<3%) | Very High | ★★★ |
Interpretation: Selectively essential genes show strong context-dependency and represent high-value therapeutic targets with potential for tissue-selective toxicity profiles.
Source: DepMap via
DepMap_get_gene_dependencies| 基因 | 平均效应值 | 在以下癌症中必需 | 在以下癌症中非必需 | 选择性评分 | 证据 |
|---|---|---|---|---|---|
| KRAS | -0.85 | 胰腺癌(95%)、肺癌(78%)、结肠癌(82%) | 乳腺癌(12%)、胶质瘤(8%) | 高 | ★★★ |
| EGFR | -0.72 | 肺癌(85%)、胶质母细胞瘤(76%) | 大多数其他癌症(<20%) | 高 | ★★★ |
| ESR1 | -0.68 | ER+乳腺癌(92%) | ER-乳腺癌(5%)、其他(<3%) | 极高 | ★★★ |
解读: 选择性必需基因表现出强背景依赖性,是具有组织选择性毒性潜力的高价值治疗靶点。
来源: DepMap via
DepMap_get_gene_dependenciesNon-Essential/Weak Hits (Score > -0.5)
弱必需/非必需候选靶点(评分 > -0.5)
| Gene | Mean Effect | % Essential | Interpretation |
|---|---|---|---|
| GENE1 | -0.25 | 15% | Weak dependency, potential off-target or passenger |
| GENE2 | -0.12 | 8% | Non-essential in most contexts |
Note: These genes may still be biologically relevant (e.g., synthetic lethal interactions, drug targets for specific contexts) but show weak essentiality in CRISPR screens.
Essentiality Summary:
- Pan-cancer essential: 12 genes (↓ deprioritize for selective targeting)
- Selectively essential: 18 genes (★ HIGH PRIORITY for validation)
- Weakly essential: 15 genes (context-dependent, requires further investigation)
All essentiality data from DepMap Portal (DepMap Public 24Q2 release)
---| 基因 | 平均效应值 | 必需占比 | 解读 |
|---|---|---|---|
| GENE1 | -0.25 | 15% | 弱依赖性,潜在脱靶或乘客基因 |
| GENE2 | -0.12 | 8% | 在大多数背景下非必需 |
注意: 这些基因仍可能具有生物学相关性(如合成致死相互作用、特定背景下的药物靶点),但在CRISPR筛选中显示出弱必需性。
必需性总结:
- 泛癌必需: 12个基因(↓ 不优先作为选择性靶点)
- 选择性必需: 18个基因(★ 验证高优先级)
- 弱必需: 15个基因(背景依赖性,需进一步研究)
所有必需性数据来自DepMap Portal(DepMap Public 24Q2版本)
---PATH 2: Pathway & Functional Enrichment
路径2: 通路与功能富集
Gene Set Enrichment Analysis
基因集富集分析
python
def perform_pathway_enrichment(tu, gene_list):
"""
Run enrichment analysis across multiple libraries.
"""
# Enrichr libraries to query
libraries = [
"WikiPathways_2024_Human",
"Reactome_Pathways_2024",
"MSigDB_Hallmark_2020",
"GO_Biological_Process_2023",
"GO_Molecular_Function_2023",
"GO_Cellular_Component_2023",
"KEGG_2024_Human"
]
result = tu.tools.enrichr_gene_enrichment_analysis(
gene_list=gene_list,
libs=libraries
)
# Parse results - Enrichr returns pathway rankings with p-values
return resultOutput for Report:
markdown
undefinedpython
def perform_pathway_enrichment(tu, gene_list):
"""
Run enrichment analysis across multiple libraries.
"""
# Enrichr libraries to query
libraries = [
"WikiPathways_2024_Human",
"Reactome_Pathways_2024",
"MSigDB_Hallmark_2020",
"GO_Biological_Process_2023",
"GO_Molecular_Function_2023",
"GO_Cellular_Component_2023",
"KEGG_2024_Human"
]
result = tu.tools.enrichr_gene_enrichment_analysis(
gene_list=gene_list,
libs=libraries
)
# Parse results - Enrichr returns pathway rankings with p-values
return result报告输出:
markdown
undefined2. Pathway & Functional Enrichment
2. 通路与功能富集
Top Enriched Pathways (p < 0.01, FDR < 0.05)
顶级富集通路(p < 0.01, FDR < 0.05)
Reactome Pathways
Reactome通路
| Pathway | Genes | p-value | FDR | Odds Ratio | Evidence |
|---|---|---|---|---|---|
| Cell Cycle Checkpoints | 12/18 | 1.2e-8 | 3.4e-6 | 15.3 | ★★★ |
| DNA Replication | 8/18 | 3.5e-6 | 4.2e-4 | 12.1 | ★★★ |
| G1/S Transition | 7/18 | 5.1e-5 | 2.1e-3 | 9.8 | ★★☆ |
Genes in pathway: CCNE1, CDK2, RB1, E2F1, CDC25A, CDC6, ORC1, MCM2
Interpretation: Strong enrichment in cell cycle control pathways suggests the screen identified proliferation-essential genes. These represent core cell cycle machinery.
| 通路 | 基因数 | p值 | FDR | 优势比 | 证据 |
|---|---|---|---|---|---|
| 细胞周期检查点 | 12/18 | 1.2e-8 | 3.4e-6 | 15.3 | ★★★ |
| DNA复制 | 8/18 | 3.5e-6 | 4.2e-4 | 12.1 | ★★★ |
| G1/S转换 | 7/18 | 5.1e-5 | 2.1e-3 | 9.8 | ★★☆ |
通路中的基因: CCNE1, CDK2, RB1, E2F1, CDC25A, CDC6, ORC1, MCM2
解读: 细胞周期调控通路的强富集表明,筛选识别出了增殖必需基因。这些基因代表核心细胞周期机制。
GO Biological Process
GO生物过程
| Term | Genes | p-value | FDR | Evidence |
|---|---|---|---|---|
| DNA replication initiation | 6/18 | 2.1e-7 | 1.5e-5 | ★★★ |
| G1/S transition of mitotic cell cycle | 8/18 | 8.3e-7 | 3.2e-5 | ★★★ |
| regulation of cyclin-dependent protein kinase activity | 5/18 | 1.2e-4 | 8.9e-3 | ★★☆ |
| 术语 | 基因数 | p值 | FDR | 证据 |
|---|---|---|---|---|
| DNA复制起始 | 6/18 | 2.1e-7 | 1.5e-5 | ★★★ |
| 有丝分裂细胞周期的G1/S转换 | 8/18 | 8.3e-7 | 3.2e-5 | ★★★ |
| 细胞周期蛋白依赖性激酶活性调控 | 5/18 | 1.2e-4 | 8.9e-3 | ★★☆ |
MSigDB Hallmark Gene Sets
MSigDB特征基因集
| Hallmark | Genes | p-value | FDR | Evidence |
|---|---|---|---|---|
| E2F Targets | 10/18 | 6.7e-10 | 1.2e-8 | ★★★ |
| G2M Checkpoint | 9/18 | 3.4e-8 | 2.1e-6 | ★★★ |
| MYC Targets V1 | 7/18 | 2.1e-5 | 9.8e-4 | ★★☆ |
Key Finding: Hits converge on E2F/RB pathway, suggesting screen successfully identified proliferation machinery. This is expected for dropout screens in proliferating cancer cells.
Source: Enrichr via
enrichr_gene_enrichment_analysis| 特征集 | 基因数 | p值 | FDR | 证据 |
|---|---|---|---|---|
| E2F靶点 | 10/18 | 6.7e-10 | 1.2e-8 | ★★★ |
| G2M检查点 | 9/18 | 3.4e-8 | 2.1e-6 | ★★★ |
| MYC靶点V1 | 7/18 | 2.1e-5 | 9.8e-4 | ★★☆ |
关键发现: 候选靶点集中在E2F/RB通路,表明筛选成功识别出增殖机制相关基因。这在增殖性癌细胞的dropout筛选中符合预期。
来源: Enrichr via
enrichr_gene_enrichment_analysisNo Significant Enrichment
无显著富集
GO Molecular Function: No terms pass FDR < 0.05
KEGG Pathways: Marginal enrichment (p < 0.05) but does not survive multiple testing correction
Interpretation: Gene list may be heterogeneous or represent diverse biological processes. Consider sub-clustering analysis.
---GO分子功能: 无术语通过FDR < 0.05的阈值
KEGG通路: 存在边缘富集(p < 0.05)但未通过多重检验校正
解读: 基因列表可能具有异质性,或代表多种生物学过程。建议考虑亚聚类分析。
---PATH 3: Protein-Protein Interaction Networks
路径3: 蛋白质-蛋白质相互作用网络
Build PPI Network
构建PPI网络
python
def build_ppi_network(tu, gene_list):
"""
Construct protein interaction network for hit genes.
"""
# Use STRING for comprehensive PPI data
ppi_result = tu.tools.STRING_get_protein_interactions(
protein_ids=gene_list,
species=9606 # Human
)
# Also check IntAct for curated interactions
interactions = []
for gene in gene_list:
# Get UniProt ID first
uniprot = resolve_gene_to_uniprot(tu, gene)
if uniprot:
intact_result = tu.tools.intact_get_interactions(identifier=uniprot)
interactions.append(intact_result)
return {
'string': ppi_result,
'intact': interactions
}
def identify_protein_complexes(ppi_data):
"""
Identify protein complexes from PPI network.
Could use complex detection algorithms or query Complex Portal.
"""
# Implementation for complex detection
passOutput for Report:
markdown
undefinedpython
def build_ppi_network(tu, gene_list):
"""
Construct protein interaction network for hit genes.
"""
# Use STRING for comprehensive PPI data
ppi_result = tu.tools.STRING_get_protein_interactions(
protein_ids=gene_list,
species=9606 # Human
)
# Also check IntAct for curated interactions
interactions = []
for gene in gene_list:
# Get UniProt ID first
uniprot = resolve_gene_to_uniprot(tu, gene)
if uniprot:
intact_result = tu.tools.intact_get_interactions(identifier=uniprot)
interactions.append(intact_result)
return {
'string': ppi_result,
'intact': interactions
}
def identify_protein_complexes(ppi_data):
"""
Identify protein complexes from PPI network.
Could use complex detection algorithms or query Complex Portal.
"""
# Implementation for complex detection
pass报告输出:
markdown
undefined3. Protein Interaction Network Analysis
3. 蛋白质相互作用网络分析
Network Statistics
网络统计
- Nodes: 45 proteins (from 45 input genes)
- Edges: 128 interactions (STRING combined score > 0.4)
- Network Density: 0.063
- Average Clustering Coefficient: 0.45
- Hub Genes (>10 interactions): CDK2, RB1, E2F1, CCNE1
Interpretation: High clustering coefficient indicates genes are functionally related and form coherent protein complexes.
Source: STRING via
STRING_get_protein_interactions- 节点: 45个蛋白质(来自45个输入基因)
- 边: 128个相互作用(STRING综合评分 > 0.4)
- 网络密度: 0.063
- 平均聚类系数: 0.45
- 枢纽基因(>10个相互作用): CDK2, RB1, E2F1, CCNE1
解读: 高聚类系数表明这些基因功能相关,形成连贯的蛋白质复合物。
来源: STRING via
STRING_get_protein_interactionsProtein Complexes Identified
识别出的蛋白质复合物
| Complex | Members | Function | Essential? |
|---|---|---|---|
| MCM Complex | MCM2, MCM3, MCM4, MCM5, MCM6, MCM7 | DNA replication helicase | Yes (pan-cancer) |
| Cyclin E-CDK2 | CCNE1, CCNE2, CDK2 | G1/S transition kinase | Yes (selective) |
| E2F/DP/RB | E2F1, E2F2, E2F3, RB1, TFDP1 | Transcription regulation | Yes (context-dependent) |
Key Finding: Screen hit multiple members of the same essential complexes. This provides validation (independent hits in same pathway) and suggests complex-level vulnerability.
Source: Complex Portal annotations + STRING clustering
| 复合物 | 成员 | 功能 | 是否必需? |
|---|---|---|---|
| MCM复合物 | MCM2, MCM3, MCM4, MCM5, MCM6, MCM7 | DNA复制解旋酶 | 是(泛癌) |
| Cyclin E-CDK2 | CCNE1, CCNE2, CDK2 | G1/S转换激酶 | 是(选择性) |
| E2F/DP/RB | E2F1, E2F2, E2F3, RB1, TFDP1 | 转录调控 | 是(背景依赖性) |
关键发现: 筛选命中了同一必需复合物的多个成员。这为结果提供了验证(同一通路中的独立命中),表明复合物层面存在脆弱性。
来源: Complex Portal注释 + STRING聚类
Synthetic Lethal Candidates
合成致死候选靶点
Based on PPI network and literature:
| Gene A (Hit) | Gene B (Candidate) | Relationship | Evidence | Source |
|---|---|---|---|---|
| RB1 | ARID1A | Synthetic lethal | ★★☆ | PMID:29534788 |
| KRAS | STK11 | Synthetic lethal | ★★★ | PMID:31010833 |
Recommendation: Test synthetic lethal candidates (Gene B) for combination therapy with inhibitors of Gene A.
---基于PPI网络和文献:
| 候选靶点A(命中基因) | 候选靶点B(待验证) | 关系 | 证据 | 来源 |
|---|---|---|---|---|
| RB1 | ARID1A | 合成致死 | ★★☆ | PMID:29534788 |
| KRAS | STK11 | 合成致死 | ★★★ | PMID:31010833 |
建议: 测试合成致死候选靶点(基因B)与基因A抑制剂的联合治疗效果。
---PATH 4: Druggability & Target Assessment
路径4: 成药性与靶点评估
Assess Drug Target Potential
评估药物靶点潜力
python
def assess_druggability(tu, gene_list):
"""
Evaluate druggability of hit genes.
"""
drug_targets = []
for gene in gene_list:
# Check Pharos for target development level
pharos = tu.tools.Pharos_get_target(gene=gene)
# Check DGIdb for existing drugs
dgidb = tu.tools.DGIdb_get_drug_gene_interactions(genes=[gene])
# Check Open Targets for chemical probes
ensembl_id = resolve_gene_to_ensembl(tu, gene)
if ensembl_id:
probes = tu.tools.OpenTargets_get_chemical_probes_by_target_ensemblId(
ensemblId=ensembl_id
)
# Check clinical trials
trials = tu.tools.search_clinical_trials(
intervention=gene,
pageSize=20
)
drug_targets.append({
'gene': gene,
'pharos_tdl': pharos.get('data', {}).get('tdl'),
'existing_drugs': dgidb,
'chemical_probes': probes,
'clinical_trials': trials
})
return drug_targetsOutput for Report:
markdown
undefinedpython
def assess_druggability(tu, gene_list):
"""
Evaluate druggability of hit genes.
"""
drug_targets = []
for gene in gene_list:
# Check Pharos for target development level
pharos = tu.tools.Pharos_get_target(gene=gene)
# Check DGIdb for existing drugs
dgidb = tu.tools.DGIdb_get_drug_gene_interactions(genes=[gene])
# Check Open Targets for chemical probes
ensembl_id = resolve_gene_to_ensembl(tu, gene)
if ensembl_id:
probes = tu.tools.OpenTargets_get_chemical_probes_by_target_ensemblId(
ensemblId=ensembl_id
)
# Check clinical trials
trials = tu.tools.search_clinical_trials(
intervention=gene,
pageSize=20
)
drug_targets.append({
'gene': gene,
'pharos_tdl': pharos.get('data', {}).get('tdl'),
'existing_drugs': dgidb,
'chemical_probes': probes,
'clinical_trials': trials
})
return drug_targets报告输出:
markdown
undefined4. Druggability & Clinical Target Assessment
4. 成药性与临床靶点评估
Target Development Level Classification (Pharos)
靶点开发层级分类(Pharos)
| TDL | Count | Genes | Interpretation |
|---|---|---|---|
| Tclin | 5 | EGFR, KRAS, CDK2, HDAC1, AURKA | Approved drug targets |
| Tchem | 8 | WEE1, PLK1, CHEK1, ... | Chemical matter available, druggable |
| Tbio | 12 | E2F1, RB1, ... | Biologically characterized, may need novel modalities |
| Tdark | 3 | GENE_X, GENE_Y, GENE_Z | Understudied, limited tool compounds |
Priority Ranking: Tclin > Tchem > Tbio for near-term drug development feasibility.
Source: Pharos/TCRD via
Pharos_get_target| TDL | 数量 | 基因 | 解读 |
|---|---|---|---|
| Tclin | 5 | EGFR, KRAS, CDK2, HDAC1, AURKA | 已获批药物靶点 |
| Tchem | 8 | WEE1, PLK1, CHEK1, ... | 有化学物质可用,可靶向 |
| Tbio | 12 | E2F1, RB1, ... | 已进行生物学表征,可能需要新的靶向方式 |
| Tdark | 3 | GENE_X, GENE_Y, GENE_Z | 研究不足,工具化合物有限 |
优先级排序: Tclin > Tchem > Tbio(从近期药物开发可行性角度)
来源: Pharos/TCRD via
Pharos_get_targetApproved Drugs & Clinical Tools
已获批药物与临床工具
| Gene | Drug(s) | Status | Indication | Source |
|---|---|---|---|---|
| EGFR | Erlotinib, Gefitinib, Osimertinib | Approved | NSCLC | DGIdb |
| CDK2 | Dinaciclib | Phase 2 | Hematologic malignancies | ClinicalTrials.gov |
| AURKA | Alisertib | Phase 3 | Lymphoma | ClinicalTrials.gov |
| WEE1 | Adavosertib | Phase 2 | Solid tumors | ClinicalTrials.gov |
Clinical Readiness: 5 genes have approved/late-stage drugs. These represent immediate repurposing opportunities.
Sources: DGIdb via , ClinicalTrials.gov
DGIdb_get_drug_gene_interactions| 基因 | 药物 | 状态 | 适应症 | 来源 |
|---|---|---|---|---|
| EGFR | 厄洛替尼、吉非替尼、奥希替尼 | 已获批 | 非小细胞肺癌 | DGIdb |
| CDK2 | Dinaciclib | 2期 | 血液系统恶性肿瘤 | ClinicalTrials.gov |
| AURKA | Alisertib | 3期 | 淋巴瘤 | ClinicalTrials.gov |
| WEE1 | Adavosertib | 2期 | 实体瘤 | ClinicalTrials.gov |
临床就绪度: 5个基因拥有获批/后期临床试验药物。这些代表了直接的药物重定位机会。
来源: DGIdb via , ClinicalTrials.gov
DGIdb_get_drug_gene_interactionsChemical Probes Available
可用化学探针
| Gene | Probe | Potency | Selectivity | Use | Source |
|---|---|---|---|---|---|
| CDK2 | Roscovitine | IC50 ~200nM | Moderate (pan-CDK) | Tool compound | SGC/Open Targets |
| HDAC1 | SAHA (Vorinostat) | IC50 ~10nM | Pan-HDAC | Approved drug, research tool | ChEMBL |
Source: Open Targets via
OpenTargets_get_chemical_probes_by_target_ensemblId| 基因 | 探针 | 效力 | 选择性 | 用途 | 来源 |
|---|---|---|---|---|---|
| CDK2 | Roscovitine | IC50 ~200nM | 中等(泛CDK) | 工具化合物 | SGC/Open Targets |
| HDAC1 | SAHA(伏立诺他) | IC50 ~10nM | 泛HDAC | 获批药物、研究工具 | ChEMBL |
来源: Open Targets via
OpenTargets_get_chemical_probes_by_target_ensemblIdNon-Druggable Hits Requiring Alternative Strategies
非可靶向候选靶点的替代策略
| Gene | Challenge | Recommended Approach |
|---|---|---|
| E2F1 | Transcription factor (no catalytic domain) | PROTACs, molecular glue degraders |
| RB1 | Tumor suppressor (loss-of-function) | Synthetic lethal approach (e.g., CDK4/6i) |
| MCM2 | Part of large complex, no pockets | Indirect targeting via cell cycle inhibitors |
Validation Priority: Focus on Tclin/Tchem hits with existing tool compounds for faster validation.
---| 基因 | 挑战 | 推荐方案 |
|---|---|---|
| E2F1 | 转录因子(无催化结构域) | PROTAC、分子胶降解剂 |
| RB1 | 肿瘤抑制因子(功能缺失) | 合成致死策略(如CDK4/6抑制剂) |
| MCM2 | 大型复合物的一部分,无结合口袋 | 通过细胞周期抑制剂间接靶向 |
验证优先级: 优先关注有现有工具化合物的Tclin/Tchem靶点,以加快验证速度。
---PATH 5: Disease Association & Clinical Relevance
路径5: 疾病关联与临床相关性
Cancer Genomics Integration
癌症基因组整合
python
def assess_clinical_relevance(tu, gene_list, cancer_type):
"""
Evaluate clinical relevance of hits in target cancer type.
"""
clinical_data = []
for gene in gene_list:
ensembl_id = resolve_gene_to_ensembl(tu, gene)
if ensembl_id:
# Disease associations
diseases = tu.tools.OpenTargets_get_diseases_phenotypes_by_target_ensemblId(
ensemblId=ensembl_id
)
# Mouse models
mouse = tu.tools.OpenTargets_get_biological_mouse_models_by_ensemblId(
ensemblId=ensembl_id
)
# COSMIC mutations (somatic alterations in cancer)
cosmic = tu.tools.COSMIC_get_gene_mutations(gene=gene)
# GTEx expression (is it expressed in relevant tissue?)
gtex = tu.tools.GTEx_get_median_gene_expression(
gencode_id=ensembl_id,
operation="median"
)
clinical_data.append({
'gene': gene,
'diseases': diseases,
'mutations': cosmic,
'expression': gtex,
'mouse_models': mouse
})
return clinical_dataOutput for Report:
markdown
undefinedpython
def assess_clinical_relevance(tu, gene_list, cancer_type):
"""
Evaluate clinical relevance of hits in target cancer type.
"""
clinical_data = []
for gene in gene_list:
ensembl_id = resolve_gene_to_ensembl(tu, gene)
if ensembl_id:
# Disease associations
diseases = tu.tools.OpenTargets_get_diseases_phenotypes_by_target_ensemblId(
ensemblId=ensembl_id
)
# Mouse models
mouse = tu.tools.OpenTargets_get_biological_mouse_models_by_ensemblId(
ensemblId=ensembl_id
)
# COSMIC mutations (somatic alterations in cancer)
cosmic = tu.tools.COSMIC_get_gene_mutations(gene=gene)
# GTEx expression (is it expressed in relevant tissue?)
gtex = tu.tools.GTEx_get_median_gene_expression(
gencode_id=ensembl_id,
operation="median"
)
clinical_data.append({
'gene': gene,
'diseases': diseases,
'mutations': cosmic,
'expression': gtex,
'mouse_models': mouse
})
return clinical_data报告输出:
markdown
undefined5. Clinical Relevance & Disease Association
5. 临床相关性与疾病关联
Cancer Genomic Alterations (COSMIC)
癌症基因组改变(COSMIC)
| Gene | Mutation Frequency | Cancer Types (Top 3) | Alteration Type | Evidence |
|---|---|---|---|---|
| KRAS | 22% across all cancers | Pancreatic (90%), Colon (45%), Lung (32%) | Activating mutations | ★★★ |
| EGFR | 8% across all cancers | Lung (15%), Glioma (30%), Breast (2%) | Amplification, mutations | ★★★ |
| TP53 | 42% across all cancers | Universal | Loss-of-function | ★★★ |
Interpretation: High mutation frequency indicates gene is driver in those cancer types. CRISPR essentiality + genomic alteration = strong therapeutic rationale.
Source: COSMIC via
COSMIC_get_gene_mutations| 基因 | 突变频率 | 主要癌症类型(前3) | 改变类型 | 证据 |
|---|---|---|---|---|
| KRAS | 所有癌症中占22% | 胰腺癌(90%)、结肠癌(45%)、肺癌(32%) | 激活突变 | ★★★ |
| EGFR | 所有癌症中占8% | 肺癌(15%)、胶质瘤(30%)、乳腺癌(2%) | 扩增、突变 | ★★★ |
| TP53 | 所有癌症中占42% | 所有癌症 | 功能缺失 | ★★★ |
解读: 高突变频率表明该基因是这些癌症类型中的驱动基因。CRISPR必需性 + 基因组改变 = 强有力的治疗依据。
来源: COSMIC via
COSMIC_get_gene_mutationsExpression in Normal vs Tumor Tissue (GTEx/TCGA)
正常组织vs肿瘤组织中的表达(GTEx/TCGA)
| Gene | Normal Lung (median TPM) | Lung Tumor (TCGA) | Tumor/Normal Ratio | Therapeutic Window |
|---|---|---|---|---|
| EGFR | 8.5 | 45.3 | 5.3x | Moderate |
| AURKA | 2.1 | 18.7 | 8.9x | Good |
| RPS6 | 125.3 | 132.1 | 1.05x | Poor (housekeeping) |
Interpretation: Genes with >3x tumor/normal expression offer better therapeutic window. Housekeeping genes (e.g., ribosomal) show poor selectivity.
Sources: GTEx via , TCGA data
GTEx_get_median_gene_expression| 基因 | 正常肺组织(中位TPM) | 肺癌组织(TCGA) | 肿瘤/正常比值 | 治疗窗口 |
|---|---|---|---|---|
| EGFR | 8.5 | 45.3 | 5.3倍 | 中等 |
| AURKA | 2.1 | 18.7 | 8.9倍 | 良好 |
| RPS6 | 125.3 | 132.1 | 1.05倍 | 差(管家基因) |
解读: 肿瘤/正常表达比值>3倍的基因具有更好的治疗窗口。管家基因(如核糖体基因)选择性差。
来源: GTEx via , TCGA数据
GTEx_get_median_gene_expressionPrognostic/Predictive Biomarker Status
预后/预测生物标志物状态
| Gene | Biomarker Type | Cancer | Association | Evidence | Source |
|---|---|---|---|---|---|
| KRAS | Predictive (negative) | Colorectal | KRAS mut → anti-EGFR resistance | ★★★ | FDA label |
| EGFR | Predictive (positive) | NSCLC | EGFR mut → TKI response | ★★★ | FDA companion dx |
| ESR1 | Predictive (positive) | Breast | ESR1 expression → endocrine therapy | ★★★ | Clinical guidelines |
Clinical Impact: 3 genes are established biomarkers with FDA-approved tests. Targeting these genes has strong clinical precedent.
---| 基因 | 生物标志物类型 | 癌症 | 关联关系 | 证据 | 来源 |
|---|---|---|---|---|---|
| KRAS | 预测性(阴性) | 结直肠癌 | KRAS突变→抗EGFR治疗耐药 | ★★★ | FDA标签 |
| EGFR | 预测性(阳性) | 非小细胞肺癌 | EGFR突变→TKI应答 | ★★★ | FDA伴随诊断 |
| ESR1 | 预测性(阳性) | 乳腺癌 | ESR1表达→内分泌治疗应答 | ★★★ | 临床指南 |
临床影响: 3个基因是已确立的生物标志物,拥有FDA批准的检测方法。靶向这些基因具有坚实的临床先例。
---PATH 6: Hit Prioritization & Validation Strategy
路径6: 候选靶点优先级排序与验证策略
Integrate All Evidence Dimensions
整合所有证据维度
python
def calculate_priority_score(gene_data):
"""
Calculate multi-dimensional priority score.
Components:
- Essentiality strength (DepMap score)
- Selectivity (tissue-specific vs pan-cancer)
- Druggability (Pharos TDL, existing compounds)
- Clinical relevance (mutations, expression, biomarkers)
- Validation feasibility (tool compounds available)
Returns score 0-100
"""
score = 0
# Essentiality (0-30 points)
if gene_data['depmap_score'] < -1.0:
score += 30
elif gene_data['depmap_score'] < -0.5:
score += 20
else:
score += 10
# Selectivity (0-25 points)
if gene_data['selective']: # Tissue-specific
score += 25
elif gene_data['pan_cancer']: # Pan-cancer (deprioritize)
score += 5
# Druggability (0-25 points)
if gene_data['pharos_tdl'] == 'Tclin':
score += 25
elif gene_data['pharos_tdl'] == 'Tchem':
score += 20
elif gene_data['pharos_tdl'] == 'Tbio':
score += 10
else:
score += 5
# Clinical relevance (0-20 points)
if gene_data['mutation_frequency'] > 20:
score += 10
if gene_data['biomarker_status']:
score += 10
return scoreOutput for Report:
markdown
undefinedpython
def calculate_priority_score(gene_data):
"""
Calculate multi-dimensional priority score.
Components:
- Essentiality strength (DepMap score)
- Selectivity (tissue-specific vs pan-cancer)
- Druggability (Pharos TDL, existing compounds)
- Clinical relevance (mutations, expression, biomarkers)
- Validation feasibility (tool compounds available)
Returns score 0-100
"""
score = 0
# Essentiality (0-30 points)
if gene_data['depmap_score'] < -1.0:
score += 30
elif gene_data['depmap_score'] < -0.5:
score += 20
else:
score += 10
# Selectivity (0-25 points)
if gene_data['selective']: # Tissue-specific
score += 25
elif gene_data['pan_cancer']: # Pan-cancer (deprioritize)
score += 5
# Druggability (0-25 points)
if gene_data['pharos_tdl'] == 'Tclin':
score += 25
elif gene_data['pharos_tdl'] == 'Tchem':
score += 20
elif gene_data['pharos_tdl'] == 'Tbio':
score += 10
else:
score += 5
# Clinical relevance (0-20 points)
if gene_data['mutation_frequency'] > 20:
score += 10
if gene_data['biomarker_status']:
score += 10
return score报告输出:
markdown
undefined6. Hit Prioritization & Validation Recommendations
6. 候选靶点优先级排序与验证建议
Top 10 Priority Targets (Multi-Dimensional Scoring)
前10个优先级靶点(多维度评分)
| Rank | Gene | Essentiality | Selectivity | Druggability | Clinical | Total Score | Recommendation |
|---|---|---|---|---|---|---|---|
| 1 | KRAS | 30/30 | 25/25 | 20/25 | 20/20 | 95/100 | High priority, validated drugs available |
| 2 | EGFR | 28/30 | 24/25 | 25/25 | 18/20 | 95/100 | High priority, approved drugs |
| 3 | WEE1 | 26/30 | 23/25 | 20/25 | 12/20 | 81/100 | Medium-high, Phase 2 drug available |
| 4 | AURKA | 24/30 | 22/25 | 20/25 | 14/20 | 80/100 | Medium-high, tool compounds exist |
| 5 | CDK2 | 25/30 | 20/25 | 20/25 | 10/20 | 75/100 | Medium, multiple tool compounds |
| 6 | CHEK1 | 23/30 | 21/25 | 18/25 | 10/20 | 72/100 | Medium, chemical probes available |
| 7 | PLK1 | 22/30 | 20/25 | 18/25 | 11/20 | 71/100 | Medium, clinical tool compounds |
| 8 | E2F1 | 24/30 | 22/25 | 10/25 | 12/20 | 68/100 | Medium-low, requires degrader strategy |
| 9 | HDAC1 | 20/30 | 18/25 | 25/25 | 8/20 | 71/100 | Medium, approved HDAC inhibitors |
| 10 | MCM2 | 28/30 | 10/25 | 5/25 | 8/20 | 51/100 | Low, pan-cancer essential, not druggable |
Scoring Rubric:
- Essentiality (30 pts): DepMap gene effect score magnitude
- Selectivity (25 pts): Tissue-specific vs pan-cancer dependency
- Druggability (25 pts): Pharos TDL, existing compounds, tractability
- Clinical (20 pts): Mutation frequency, biomarker status, expression
Priority Tiers:
- Tier 1 (Score >80): Immediate validation, existing tools/drugs available
- Tier 2 (Score 60-80): Medium priority, validation feasible with chemical probes
- Tier 3 (Score <60): Lower priority or requires novel approaches (PROTACs, etc.)
| 排名 | 基因 | 必需性 | 选择性 | 成药性 | 临床相关性 | 总分 | 建议 |
|---|---|---|---|---|---|---|---|
| 1 | KRAS | 30/30 | 25/25 | 20/25 | 20/20 | 95/100 | 高优先级,有已验证药物可用 |
| 2 | EGFR | 28/30 | 24/25 | 25/25 | 18/20 | 95/100 | 高优先级,有获批药物 |
| 3 | WEE1 | 26/30 | 23/25 | 20/25 | 12/20 | 81/100 | 中高优先级,有2期药物可用 |
| 4 | AURKA | 24/30 | 22/25 | 20/25 | 14/20 | 80/100 | 中高优先级,有工具化合物可用 |
| 5 | CDK2 | 25/30 | 20/25 | 20/25 | 10/20 | 75/100 | 中优先级,有多种工具化合物 |
| 6 | CHEK1 | 23/30 | 21/25 | 18/25 | 10/20 | 72/100 | 中优先级,有化学探针可用 |
| 7 | PLK1 | 22/30 | 20/25 | 18/25 | 11/20 | 71/100 | 中优先级,有临床工具化合物 |
| 8 | E2F1 | 24/30 | 22/25 | 10/25 | 12/20 | 68/100 | 中低优先级,需要降解剂策略 |
| 9 | HDAC1 | 20/30 | 18/25 | 25/25 | 8/20 | 71/100 | 中优先级,有获批HDAC抑制剂 |
| 10 | MCM2 | 28/30 | 10/25 | 5/25 | 8/20 | 51/100 | 低优先级,泛癌必需,不可靶向 |
评分规则:
- 必需性(30分): DepMap基因效应值的绝对值
- 选择性(25分): 组织特异性vs泛癌依赖性
- 成药性(25分): Pharos TDL、现有化合物、可靶向性
- 临床相关性(20分): 突变频率、生物标志物状态、表达情况
优先级层级:
- 1级(得分>80): 立即验证,有现有工具/药物可用
- 2级(得分60-80): 中优先级,可用化学探针进行验证
- 3级(得分<60): 低优先级或需要新方法(如PROTAC等)
Validation Experiment Recommendations
验证实验建议
Tier 1 Targets (KRAS, EGFR, WEE1)
1级靶点(KRAS、EGFR、WEE1)
1. KRAS
- Essentiality: Strong selective dependency in KRAS-mutant cancers
- Validation Approach:
- Test KRAS G12C inhibitor (sotorasib/adagrasib) in KRAS G12C-mutant cell lines from screen
- Orthogonal validation: siRNA/shRNA knockdown
- Rescue experiment: Re-express WT KRAS in KO cells
- Expected Outcome: Growth inhibition/cell death in KRAS-mutant lines only
- Tool Compounds: Sotorasib (AMG 510), Adagrasib (MRTX849), MRTX1133 (pan-KRAS)
- Timeline: 2-3 weeks for cell line validation
2. EGFR
- Essentiality: Selective in EGFR-mutant/amplified NSCLC, glioblastoma
- Validation Approach:
- Test EGFR TKI panel (erlotinib, osimertinib) in screen cell lines
- Dose-response curves to establish IC50
- Combination with standard chemotherapy
- Expected Outcome: Potent inhibition in EGFR-altered lines
- Tool Compounds: Erlotinib, Gefitinib, Osimertinib (all FDA-approved)
- Timeline: 1-2 weeks
3. WEE1
- Essentiality: Synthetic lethal with TP53 loss, selective in TP53-mutant cancers
- Validation Approach:
- Test adavosertib (WEE1 inhibitor) ± DNA damaging agents
- Stratify by TP53 status (mutant vs WT)
- Cell cycle analysis (premature mitotic entry)
- Expected Outcome: Selective killing of TP53-mutant cells + synergy with chemo
- Tool Compounds: Adavosertib (AZD1775), PD-166285
- Timeline: 2-3 weeks
1. KRAS
- 必需性: 在KRAS突变型癌症中具有强选择性依赖性
- 验证方法:
- 在筛选得到的KRAS G12C突变型细胞系中测试KRAS G12C抑制剂(sotorasib/adagrasib)
- 正交验证:siRNA/shRNA敲低
- 拯救实验:在敲除细胞中重新表达野生型KRAS
- 预期结果: 仅在KRAS突变型细胞系中出现生长抑制/细胞死亡
- 工具化合物: Sotorasib(AMG 510)、Adagrasib(MRTX849)、MRTX1133(泛KRAS)
- 时间线: 2-3周的细胞系验证
2. EGFR
- 必需性: 在EGFR突变/扩增的非小细胞肺癌、胶质母细胞瘤中具有选择性
- 验证方法:
- 在筛选细胞系中测试EGFR TKI面板(厄洛替尼、奥希替尼)
- 绘制剂量反应曲线以确定IC50
- 与标准化疗联合测试
- 预期结果: 在EGFR改变的细胞系中出现强效抑制
- 工具化合物: 厄洛替尼、吉非替尼、奥希替尼(均为FDA获批)
- 时间线: 1-2周
3. WEE1
- 必需性: 与TP53缺失具有合成致死性,在TP53突变型癌症中具有选择性
- 验证方法:
- 测试adavosertib(WEE1抑制剂)± DNA损伤剂
- 按TP53状态分层(突变型vs野生型)
- 细胞周期分析(提前进入有丝分裂)
- 预期结果: 选择性杀伤TP53突变型细胞 + 与化疗协同
- 工具化合物: Adavosertib(AZD1775)、PD-166285
- 时间线: 2-3周
Tier 2 Targets (AURKA, CDK2, CHEK1, PLK1)
2级靶点(AURKA、CDK2、CHEK1、PLK1)
General Strategy:
- Pharmacological validation with 2-3 selective inhibitors per target
- Orthogonal genetic validation (CRISPRi/shRNA)
- Pathway analysis (Western blots for downstream effectors)
- Combination screens with standard-of-care agents
Recommended Tool Compounds:
- AURKA: Alisertib (MLN8237), Aurora A Inhibitor I
- CDK2: Roscovitine (seliciclib), Dinaciclib
- CHEK1: Prexasertib (LY2606368), AZD7762
- PLK1: Volasertib (BI 6727), BI 2536
通用策略:
- 每个靶点使用2-3种选择性抑制剂进行药理学验证
- 正交遗传验证(CRISPRi/shRNA)
- 通路分析(下游效应因子的Western blot)
- 与标准治疗药物的联合筛选
推荐工具化合物:
- AURKA: Alisertib(MLN8237)、Aurora A抑制剂I
- CDK2: Roscovitine(seliciclib)、Dinaciclib
- CHEK1: Prexasertib(LY2606368)、AZD7762
- PLK1: Volasertib(BI 6727)、BI 2536
Tier 3 Targets - Alternative Validation Strategies
3级靶点 - 替代验证策略
E2F1, RB1 (Transcription factors):
- Challenge: No direct small molecule inhibitors
- Strategy:
- Test PROTACs if available
- Indirect validation via upstream targets (CDK4/6 inhibitors for RB pathway)
- Genetic validation only (CRISPRko, CRISPRi)
MCM2-7 Complex (Helicase, pan-essential):
- Challenge: Pan-cancer essential, poor therapeutic window
- Strategy: Deprioritize for drug development
- Note: Interesting for understanding replication biology, but not ideal therapeutic target
E2F1、RB1(转录因子):
- 挑战: 无直接小分子抑制剂
- 策略:
- 若有PROTAC则进行测试
- 通过上游靶点间接验证(如针对RB通路的CDK4/6抑制剂)
- 仅进行遗传验证(CRISPRko、CRISPRi)
MCM2-7复合物(解旋酶,泛必需):
- 挑战: 泛癌必需,治疗窗口差
- 策略: 优先排除在药物开发之外
- 注意: 对理解复制生物学有意义,但不是理想的治疗靶点
Validation Timeline
验证时间线
| Phase | Duration | Experiments | Deliverable |
|---|---|---|---|
| Phase 1 | Weeks 1-3 | Tier 1 target validation (KRAS, EGFR, WEE1) | Dose-response curves, potency data |
| Phase 2 | Weeks 4-6 | Tier 2 target validation (AURKA, CDK2, etc.) | Hit confirmation, selectivity data |
| Phase 3 | Weeks 7-10 | Mechanism studies, pathway analysis | Western blots, cell cycle, apoptosis |
| Phase 4 | Weeks 11-14 | Combination studies, in vivo pilot (top 2-3) | Synergy matrices, xenograft data |
| 阶段 | 时长 | 实验 | 交付物 |
|---|---|---|---|
| 阶段1 | 第1-3周 | 1级靶点验证(KRAS、EGFR、WEE1) | 剂量反应曲线、效力数据 |
| 阶段2 | 第4-6周 | 2级靶点验证(AURKA、CDK2等) | 命中确认、选择性数据 |
| 阶段3 | 第7-10周 | 机制研究、通路分析 | Western blot、细胞周期、凋亡数据 |
| 阶段4 | 第11-14周 | 联合研究、体内试点(前2-3个靶点) | 协同矩阵、异种移植数据 |
Success Criteria for Validation
验证成功标准
✅ Hit Confirmed if:
- Pharmacological inhibition phenocopies CRISPR knockout (≥50% growth inhibition at ≤1 µM)
- Dose-response curve shows IC50 consistent with essentiality score
- Effect is selective (active in screen cell line, inactive in control lines)
- Orthogonal genetic methods (siRNA, CRISPRi) reproduce phenotype
❌ Hit Rejected/Deprioritized if:
- Tool compounds show no effect despite strong CRISPR score (off-target CRISPR effect)
- Pan-cancer essential with no selectivity (poor therapeutic window)
- No druggable domain/strategy (TFs, scaffolds without chemical matter)
- Cannot be validated with available reagents
✅ 命中确认 若:
- 药理学抑制模拟CRISPR敲除效果(≤1 µM时生长抑制≥50%)
- 剂量反应曲线的IC50与必需性评分一致
- 效应具有选择性(在筛选细胞系中有效,在对照细胞系中无效)
- 正交遗传方法(siRNA、CRISPRi)可重现表型
❌ 命中被拒绝/降优先级 若:
- 工具化合物无效果,尽管CRISPR评分很高(CRISPR脱靶效应)
- 泛癌必需且无选择性(治疗窗口差)
- 无可靶向结构域/策略(转录因子、无化学结合位点的支架蛋白)
- 无法用现有试剂验证
Resource Requirements
资源需求
Reagents:
- Chemical compounds: $5-10K (tool compounds, commercial inhibitors)
- CRISPRi/shRNA validation: $3-5K (vectors, reagents)
- Cell culture & assays: $8-12K (plates, reagents, media)
Equipment:
- Cell culture facility (BSL2)
- Plate readers (viability assays, luminescence)
- Flow cytometer (cell cycle, apoptosis)
- Western blot equipment
Personnel: 1 postdoc + 1 technician for 3-4 months
Validation Strategy Summary: Focus validation efforts on Tier 1 targets (KRAS, EGFR, WEE1) with approved/late-stage drugs. These offer fastest path to clinical translation and have highest probability of success based on multi-dimensional scoring.
---试剂:
- 化学化合物: 5-10千美元(工具化合物、商用抑制剂)
- CRISPRi/shRNA验证: 3-5千美元(载体、试剂)
- 细胞培养与实验: 8-12千美元(培养板、试剂、培养基)
设备:
- 细胞培养设施(BSL2)
- 酶标仪(活力实验、发光检测)
- 流式细胞仪(细胞周期、凋亡)
- Western blot设备
人员: 1名博士后 + 1名技术员,耗时3-4个月
验证策略总结: 集中验证资源在1级靶点(KRAS、EGFR、WEE1),这些靶点拥有获批/后期临床试验药物。基于多维度评分,这些靶点转化为临床应用的速度最快,成功概率最高。
---Report Template (Initial File)
报告模板(初始文件)
File:
CRISPR_screen_analysis_[CONTEXT].mdmarkdown
undefined文件:
CRISPR_screen_analysis_[CONTEXT].mdmarkdown
undefinedCRISPR Screen Analysis Report: [CONTEXT]
CRISPR筛选分析报告: [背景信息]
Generated: [Date]
Input: [Gene list / Cancer type / Single gene]
Context: [Screen type, cell line, experimental details]
Status: In Progress
生成日期: [日期]
输入: [基因列表 / 癌症类型 / 单个基因]
背景: [筛选类型、细胞系、实验细节]
状态: 进行中
Executive Summary
执行摘要
[Analyzing...]
<!-- Will contain: key findings, top hits, recommended priorities -->
[分析中...]
<!-- 将包含:关键发现、顶级候选靶点、推荐优先级 -->
Input Validation
输入验证
[Analyzing...]
<!-- Gene symbol validation, invalid genes, suggestions -->
[分析中...]
<!-- 基因符号验证、无效基因、建议 -->
1. Gene Essentiality Analysis
1. 基因必需性分析
1.1 Strongly Essential Genes
1.1 强必需基因
[Analyzing...]
[分析中...]
1.2 Selectively Essential Genes
1.2 选择性必需基因
[Analyzing...]
[分析中...]
1.3 Weakly Essential / Non-Essential
1.3 弱必需 / 非必需基因
[Analyzing...]
[分析中...]
2. Pathway & Functional Enrichment
2. 通路与功能富集
2.1 Pathway Enrichment (Reactome, KEGG)
2.1 通路富集(Reactome、KEGG)
[Analyzing...]
[分析中...]
2.2 GO Enrichment (BP, MF, CC)
2.2 GO富集(生物过程、分子功能、细胞组分)
[Analyzing...]
[分析中...]
2.3 Hallmark Gene Sets (MSigDB)
2.3 特征基因集(MSigDB)
[Analyzing...]
[分析中...]
2.4 Pathway-Level Interpretation
2.4 通路层面解读
[Analyzing...]
[分析中...]
3. Protein Interaction Network
3. 蛋白质相互作用网络
3.1 Network Statistics
3.1 网络统计
[Analyzing...]
[分析中...]
3.2 Protein Complexes
3.2 蛋白质复合物
[Analyzing...]
[分析中...]
3.3 Hub Genes
3.3 枢纽基因
[Analyzing...]
[分析中...]
3.4 Synthetic Lethal Candidates
3.4 合成致死候选靶点
[Analyzing...]
[分析中...]
4. Druggability Assessment
4. 成药性评估
4.1 Target Development Level (Pharos)
4.1 靶点开发层级(Pharos)
[Analyzing...]
[分析中...]
4.2 Approved Drugs & Clinical Candidates
4.2 已获批药物与临床候选药物
[Analyzing...]
[分析中...]
4.3 Chemical Probes
4.3 化学探针
[Analyzing...]
[分析中...]
4.4 Non-Druggable Hits (Alternative Strategies)
4.4 非可靶向候选靶点(替代策略)
[Analyzing...]
[分析中...]
5. Clinical Relevance
5. 临床相关性
5.1 Cancer Genomic Alterations (COSMIC)
5.1 癌症基因组改变(COSMIC)
[Analyzing...]
[分析中...]
5.2 Expression in Tumor vs Normal
5.2 肿瘤vs正常组织中的表达
[Analyzing...]
[分析中...]
5.3 Prognostic/Predictive Biomarkers
5.3 预后/预测生物标志物
[Analyzing...]
[分析中...]
5.4 Mouse Models & Genetic Evidence
5.4 小鼠模型与遗传学证据
[Analyzing...]
[分析中...]
6. Hit Prioritization & Validation
6. 候选靶点优先级排序与验证
6.1 Multi-Dimensional Scoring
6.1 多维度评分
[Analyzing...]
[分析中...]
6.2 Top 10 Priority Targets
6.2 前10个优先级靶点
[Analyzing...]
[分析中...]
6.3 Validation Experiment Recommendations
6.3 验证实验建议
[Analyzing...]
[分析中...]
6.4 Validation Timeline & Resources
6.4 验证时间线与资源
[Analyzing...]
[分析中...]
7. Data Sources & Quality Control
7. 数据源与质量控制
7.1 Primary Data Sources
7.1 主要数据源
[Will be populated...]
[将填充...]
7.2 Tool Call Summary
7.2 工具调用汇总
[Will be populated...]
[将填充...]
7.3 Limitations & Caveats
7.3 局限性与注意事项
[Will be populated...]
[将填充...]
Appendix: Full Hit List
附录: 完整候选靶点列表
[Complete gene list with all scores...]
---[带所有评分的完整基因列表...]
---When NOT to Use This Skill
何时不使用本技能
- Drug target research (single gene) → Use skill instead
tooluniverse-target-research - Disease-centric query → Use skill
tooluniverse-disease-research - Chemical compound screening → Different workflow needed
- RNA-seq differential expression → Use differential expression analysis workflows
- Single gene lookup → Call DepMap tools directly
Use this skill when you have a gene list from a CRISPR screen and need comprehensive functional interpretation + target prioritization.
- 药物靶点研究(单个基因)→ 改用技能
tooluniverse-target-research - 疾病导向查询 → 改用技能
tooluniverse-disease-research - 化学化合物筛选 → 需要不同的工作流
- RNA-seq差异表达分析 → 使用差异表达分析工作流
- 单个基因查询 → 直接调用DepMap工具
当你拥有CRISPR筛选得到的基因列表,并需要进行全面的功能解读 + 靶点优先级排序时,使用本技能。
Example Queries That Trigger This Skill
触发本技能的示例查询
✅ Gene List Analysis:
- "Analyze these CRISPR screen hits: EGFR, KRAS, WEE1, PLK1, AURKA, ..."
- "I have 50 dropout genes from a CRISPR screen in lung cancer cells, what should I validate?"
- "Prioritize these genes for drug target development: [gene list]"
✅ Cancer Type Query:
- "What are the top essential genes in pancreatic cancer?"
- "Find druggable dependencies in triple-negative breast cancer"
✅ Single Gene Validation:
- "Is KRAS a good therapeutic target for lung cancer?"
- "Assess the druggability of WEE1 as a cancer target"
❌ Not Appropriate:
- "What is the function of EGFR?" → too broad, use target-research skill
- "Find drugs for lung cancer" → disease-centric, use drug-repurposing skill
- "Analyze this RNA-seq data" → different analytical workflow
✅ 基因列表分析:
- "分析这些CRISPR筛选候选靶点:EGFR, KRAS, WEE1, PLK1, AURKA, ..."
- "我有50个来自肺癌细胞CRISPR dropout筛选的基因,应该验证哪些?"
- "为药物靶点开发对这些基因进行优先级排序:[基因列表]"
✅ 癌症类型查询:
- "胰腺癌中的顶级必需基因是什么?"
- "寻找三阴性乳腺癌中的可靶向依赖性"
✅ 单个基因验证:
- "KRAS是肺癌的良好治疗靶点吗?"
- "评估WEE1作为癌症靶点的成药性"
❌ 不适用场景:
- "EGFR的功能是什么?" → 范围过广,使用target-research技能
- "寻找肺癌的治疗药物" → 疾病导向,使用药物重定位技能
- "分析这份RNA-seq数据" → 需要不同的分析工作流
Key Improvements from Existing Skills
与现有技能相比的关键改进
Based on patterns in and :
tooluniverse-target-researchtooluniverse-drug-research- Multi-dimensional scoring system (novel for CRISPR analysis)
- Validation experiment recommendations with timelines and reagents
- Tier-based prioritization (Tier 1/2/3 based on actionability)
- Tool compound suggestions for each druggable target
- Synthetic lethal candidate identification from PPI network
- Explicit selectivity analysis (pan-cancer vs tissue-selective)
- Success/failure criteria for validation experiments
基于和的使用模式:
tooluniverse-target-researchtooluniverse-drug-research- 多维度评分系统(CRISPR分析的创新点)
- 带时间线和试剂的验证实验建议
- 基于可行动性的层级优先级排序(1/2/3级)
- 每个可靶向靶点的工具化合物建议
- 从PPI网络识别合成致死候选靶点
- 明确的选择性分析(泛癌vs组织选择性)
- 验证实验的成功/失败标准
Quality Control Checklist
质量控制检查表
Before finalizing report:
- All input genes validated against DepMap registry
- Essentiality scores retrieved for all valid genes
- Pathway enrichment performed (minimum: GO BP, Reactome, Hallmark)
- PPI network constructed with interaction counts
- Druggability assessed for all hits (Pharos TDL + DGIdb)
- Top 10 priority targets table completed
- Validation recommendations provided for Tier 1 targets
- Evidence grades assigned (★★★, ★★☆, ★☆☆)
- All data sources cited explicitly
- "No data" explicitly stated when tools return empty results
- Executive summary synthesizes all findings
最终确定报告前:
- 所有输入基因已在DepMap注册表中验证
- 已为所有有效基因检索必需性评分
- 已进行通路富集分析(至少:GO生物过程、Reactome、Hallmark)
- 已构建带相互作用计数的PPI网络
- 已评估所有候选靶点的成药性(Pharos TDL + DGIdb)
- 已完成前10个优先级靶点表格
- 已为1级靶点提供验证建议
- 已分配证据等级(★★★, ★★☆, ★☆☆)
- 所有数据源已明确引用
- 工具返回空结果时已明确标注“无数据”
- 执行摘要已整合所有发现