tooluniverse-crispr-screen-analysis

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

CRISPR Screen Analysis Workflow

CRISPR筛选分析工作流

Systematic analysis of CRISPR knockout/activation/interference screens to identify essential genes, synthetic lethal interactions, and therapeutic targets.

KEY PRINCIPLES:

Report-first approach - Create comprehensive analysis report FIRST, then populate progressively
Evidence grading - Grade all findings by confidence level (H/M/L based on statistical significance and validation data)
Multi-dimensional analysis - Integrate essentiality, pathway context, druggability, and clinical relevance
Citation requirements - Every conclusion must trace to source data (DepMap, literature, pathways)
Mandatory completeness - All analysis sections must exist with data or explicit "No data" notes
Context-aware interpretation - Consider cell line context, screen type, and biological pathway redundancy

对CRISPR敲除/激活/干扰筛选进行系统性分析，以识别必需基因、合成致死相互作用及治疗靶点。

核心原则:

报告优先法 - 先创建完整的分析报告，再逐步填充内容
证据分级 - 所有发现按置信度分级（基于统计显著性和验证数据分为高/中/低，对应★★★/★★☆/★☆☆）
多维度分析 - 整合必需性、通路背景、成药性及临床相关性
引用要求 - 每个结论必须可追溯至源数据（DepMap、文献、通路数据库）
完整性强制要求 - 所有分析章节必须存在，无数据时需明确标注“无数据”
背景感知解读 - 考虑细胞系背景、筛选类型及生物通路冗余性

When to Use This Skill

何时使用本技能

Apply when users:

Have CRISPR screen hit lists (genes with significant phenotypes)
Need to prioritize CRISPR hits for validation
Want to identify essential genes for a specific cancer type
Need synthetic lethal interaction analysis
Ask "what are the top hits from my CRISPR screen?"
Need drug target prioritization from functional genomics data
Want pathway-level interpretation of screen results

当用户有以下需求时适用：

拥有CRISPR筛选的候选基因列表（具有显著表型的基因）
需要对CRISPR候选基因进行验证优先级排序
想要识别特定癌症类型的必需基因
需要进行合成致死相互作用分析
提问“我的CRISPR筛选中排名靠前的候选靶点是什么？”
需要从功能基因组数据中进行药物靶点优先级排序
想要对筛选结果进行通路层面的解读

⚠️ Known Issues & Workarounds

⚠️ 已知问题与解决方案

DepMap API Unavailability (2026-02-09)

DepMap API不可用（2026-02-09）

Issue: DepMap REST APIs (Sanger Cell Model Passports and Broad Institute) are currently non-operational.

Impact:

PATH 0 (Gene Validation): DepMap gene registry unavailable
PATH 1 (Essentiality Analysis): CRISPR dependency scores unavailable

Workaround: This skill now uses Pharos as fallback:

Gene validation via
```
Pharos_get_target()
```
Druggability assessment via TDL (Target Development Level) classification
TDL used as proxy for essentiality (Tclin targets are often essential)
Evidence grading: Tclin=★★★, Tchem=★★☆, Tbio/Tdark=★☆☆

Data Quality Trade-off:

✅ Gene validation: 100% success rate (Pharos has comprehensive drug target coverage)
⚠️ Essentiality scores: Druggability-based proxy (TDL classification)
ℹ️ All findings labeled with source (Pharos vs DepMap)

Timeline: Permanent fix (CSV download) estimated 1-2 weeks. See

DEPMAP_ISSUE_ANALYSIS.md

for details.

问题: DepMap REST API（Sanger细胞模型数据库和Broad研究所）当前无法使用。

影响:

路径0（基因验证）：DepMap基因注册表不可用
路径1（必需性分析）：CRISPR依赖性评分不可用

解决方案: 本技能现在使用Pharos作为备选方案：

通过
```
Pharos_get_target()
```
进行基因验证
通过TDL（靶点开发层级）分类评估成药性
将TDL用作必需性的替代指标（Tclin靶点通常为必需基因）
证据分级：Tclin=★★★，Tchem=★★☆，Tbio/Tdark=★☆☆

数据质量权衡:

✅ 基因验证：100%成功率（Pharos覆盖全面的药物靶点）
⚠️ 必需性评分：基于成药性的替代指标（TDL分类）
ℹ️ 所有发现均标注数据源（Pharos或DepMap）

时间线: 永久修复方案（CSV下载）预计1-2周完成。详情见

DEPMAP_ISSUE_ANALYSIS.md

Input Types Supported

支持的输入类型

1. Gene List from User's Screen

1. 用户筛选得到的基因列表

Format: Gene symbols (e.g., EGFR, KRAS, TP53)
Minimum: 5 genes (for meaningful enrichment)
Optimal: 20-100 genes (hits from primary screen)
Context needed: Cancer type, screen type (dropout/enrichment), cell line used

格式: 基因符号（如EGFR、KRAS、TP53）
最低要求: 5个基因（以获得有意义的富集结果）
最优: 20-100个基因（初筛得到的候选靶点）
所需背景信息: 癌症类型、筛选类型（dropout/enrichment）、使用的细胞系

2. Cancer Type Query

2. 癌症类型查询

Format: Cancer type name (e.g., "non-small cell lung cancer", "breast cancer")
Workflow: Retrieve top essential genes for that cancer from DepMap
Output: Ranked target list with essentiality scores

格式: 癌症类型名称（如“非小细胞肺癌”、“乳腺癌”）
工作流: 从DepMap检索该癌症的顶级必需基因
输出: 带必需性评分的靶点排名列表

3. Gene of Interest

3. 目标基因查询

Format: Single gene symbol
Workflow: Analyze essentiality across cancer types, identify selective dependencies
Output: Target validation report with tissue specificity

格式: 单个基因符号
工作流: 分析该基因在不同癌症类型中的必需性，识别选择性依赖性
输出: 带组织特异性的靶点验证报告

Critical Workflow Requirements

关键工作流要求

1. Report-First Approach (MANDATORY)

1. 报告优先法（强制要求）

DO NOT show intermediate tool outputs. Instead:

Create report file FIRST before any analysis:
- File name:
```
CRISPR_screen_analysis_[CONTEXT].md
```
- Initialize with all section headers
- Add placeholder:
```
[Analyzing...]
```
  in each section
Progressively update as data arrives:
- Replace
```
[Analyzing...]
```
  with findings
- Include "No significant enrichment" when appropriate
- Document failed analyses explicitly
Final deliverable: Complete markdown report + optional plots (if user requests)

禁止展示中间工具输出，应遵循以下步骤：

先创建报告文件再进行任何分析：
- 文件名:
```
CRISPR_screen_analysis_[CONTEXT].md
```
- 初始化所有章节标题
- 在每个章节添加占位符:
```
[分析中...]
```
随着数据获取逐步更新：
- 用发现结果替换
```
[分析中...]
```
- 适当时标注“无显著富集”
- 明确记录失败的分析
最终交付物: 完整的markdown报告 + 可选的图表（若用户要求）

2. Evidence Grading System (MANDATORY)

2. 证据分级系统（强制要求）

Grade every finding by confidence level:

Level	Symbol	Criteria	Examples
HIGH	★★★	DepMap score <-1.0, p<0.01, validated in literature	Strong essential gene, clinical drug target
MEDIUM	★★☆	DepMap score -0.5 to -1.0, p<0.05, pathway coherence	Moderate dependency, pathway member
LOW	★☆☆	DepMap score >-0.5, marginal significance, weak validation	Weak hit, potential off-target

为每个发现按置信度分级：

级别	符号	标准	示例
高	★★★	DepMap评分<-1.0，p<0.01，有文献验证	强必需基因、临床药物靶点
中	★★☆	DepMap评分-0.5至-1.0，p<0.05，通路一致性良好	中度依赖性、通路成员
低	★☆☆	DepMap评分>-0.5，显著性较弱，验证证据不足	弱候选靶点、潜在脱靶效应

3. Contextualization Requirements

3. 背景关联要求

Every gene-level finding must include:

Essentiality score (DepMap gene effect)
Pan-cancer vs selective (is it essential in all cancers or specific subset?)
Druggability (existing drugs, chemical probes, tractability)
Pathway context (which pathways/complexes does it belong to?)
Clinical relevance (approved targets, ongoing trials, biomarkers)

每个基因层面的发现必须包含：

必需性评分（DepMap基因效应值）
泛癌vs选择性（是否在所有癌症中必需，还是仅在特定亚群中必需？）
成药性（现有药物、化学探针、可靶向性）
通路背景（属于哪些通路/复合物？）
临床相关性（已获批靶点、正在进行的临床试验、生物标志物）

Core Analysis Strategy: 7 Research Paths

核心分析策略：7个研究路径

User Input (gene list OR cancer type OR single gene)
│
├─ PATH 0: Input Processing & Validation
│   ├─ Validate gene symbols
│   ├─ Determine analysis mode
│   └─ Set context parameters
│
├─ PATH 1: Gene Essentiality Analysis (DepMap)
│   ├─ Query gene dependencies for each hit
│   ├─ Retrieve essentiality scores across cell lines
│   ├─ Calculate pan-cancer vs selective essentiality
│   └─ Rank genes by dependency strength
│
├─ PATH 2: Pathway & Functional Enrichment
│   ├─ GO enrichment (biological process, molecular function)
│   ├─ Pathway enrichment (Reactome, WikiPathways, KEGG)
│   ├─ Hallmark gene set enrichment (MSigDB)
│   └─ Identify pathway-level vulnerabilities
│
├─ PATH 3: Protein-Protein Interaction Networks
│   ├─ Build PPI network for hit genes
│   ├─ Identify protein complexes
│   ├─ Find synthetic lethal candidates
│   └─ Hub gene analysis
│
├─ PATH 4: Druggability & Target Assessment
│   ├─ Check existing drugs (DGIdb, ChEMBL)
│   ├─ Assess chemical tractability (Pharos TDL)
│   ├─ Find chemical probes (Open Targets)
│   └─ Clinical trial status (ClinicalTrials.gov)
│
├─ PATH 5: Disease Association & Clinical Relevance
│   ├─ Gene-disease associations (Open Targets)
│   ├─ Somatic mutations in cancer (COSMIC, cBioPortal)
│   ├─ Expression in patient samples (GTEx, TCGA)
│   └─ Prognostic/predictive biomarker status
│
└─ PATH 6: Hit Prioritization & Validation Guidance
    ├─ Integrate all evidence dimensions
    ├─ Calculate priority score (essentiality + druggability + clinical relevance)
    ├─ Recommend validation experiments
    └─ Identify top 5-10 targets for follow-up

用户输入（基因列表或癌症类型或单个基因）
│
├─ 路径0: 输入处理与验证
│   ├─ 验证基因符号
│   ├─ 确定分析模式
│   └─ 设置背景参数
│
├─ 路径1: 基因必需性分析（DepMap）
│   ├─ 查询每个候选靶点的基因依赖性
│   ├─ 检索跨细胞系的必需性评分
│   ├─ 计算泛癌vs选择性必需性
│   └─ 按依赖性强度对基因排名
│
├─ 路径2: 通路与功能富集
│   ├─ GO富集（生物过程、分子功能）
│   ├─ 通路富集（Reactome、WikiPathways、KEGG）
│   ├─ 特征基因集富集（MSigDB Hallmark）
│   └─ 识别通路层面的脆弱性
│
├─ 路径3: 蛋白质-蛋白质相互作用网络
│   ├─ 为候选基因构建PPI网络
│   ├─ 识别蛋白质复合物
│   ├─ 寻找合成致死候选靶点
│   └─ 枢纽基因分析
│
├─ 路径4: 成药性与靶点评估
│   ├─ 检查现有药物（DGIdb、ChEMBL）
│   ├─ 评估化学可靶向性（Pharos TDL）
│   ├─ 寻找化学探针（Open Targets）
│   └─ 临床试验状态（ClinicalTrials.gov）
│
├─ 路径5: 疾病关联与临床相关性
│   ├─ 基因-疾病关联（Open Targets）
│   ├─ 癌症中的体细胞突变（COSMIC、cBioPortal）
│   ├─ 患者样本中的表达情况（GTEx、TCGA）
│   └─ 预后/预测生物标志物状态
│
└─ 路径6: 候选靶点优先级排序与验证指导
    ├─ 整合所有证据维度
    ├─ 计算优先级评分（必需性+成药性+临床相关性）
    ├─ 推荐验证实验
    └─ 确定前5-10个待跟进的靶点

PATH 0: Input Processing & Validation

路径0: 输入处理与验证

Determine Analysis Mode

确定分析模式

python

def determine_analysis_mode(user_input):
    """
    Figure out what type of analysis to run.

    Returns: 'gene_list', 'cancer_type', or 'single_gene'
    """
    if isinstance(user_input, list) and len(user_input) >= 5:
        return 'gene_list'  # User provided hits from their screen
    elif isinstance(user_input, str) and len(user_input.split()) > 1:
        return 'cancer_type'  # User asks about a cancer type
    else:
        return 'single_gene'  # Single gene target validation

python

def determine_analysis_mode(user_input):
    """
    Figure out what type of analysis to run.

    Returns: 'gene_list', 'cancer_type', or 'single_gene'
    """
    if isinstance(user_input, list) and len(user_input) >= 5:
        return 'gene_list'  # User provided hits from their screen
    elif isinstance(user_input, str) and len(user_input.split()) > 1:
        return 'cancer_type'  # User asks about a cancer type
    else:
        return 'single_gene'  # Single gene target validation

Gene Symbol Validation

基因符号验证

CRITICAL: Validate gene symbols with fallback to Open Targets if DepMap unavailable.

python

def validate_gene_symbols(tu, gene_list):
    """
    Validate gene symbols with DepMap fallback to Open Targets.

    Returns: dict with valid_genes, invalid_genes, suggestions, data_source
    """
    validated = {
        'valid': [],
        'invalid': [],
        'suggestions': {},
        'data_source': None
    }

    # Try DepMap first
    depmap_available = False
    test_result = tu.tools.DepMap_search_genes(query="KRAS")
    if (test_result.get('status') == 'success' and
        not test_result.get('error', '').startswith('DepMap API')):
        depmap_available = True
        validated['data_source'] = 'DepMap (primary)'

    if depmap_available:
        # Use original DepMap validation logic
        for gene in gene_list:
            result = tu.tools.DepMap_search_genes(query=gene)
            if result.get('status') == 'success':
                genes = result.get('data', {}).get('genes', [])
                exact_matches = [g for g in genes
                               if g.get('symbol', '').upper() == gene.upper()]

                if exact_matches:
                    validated['valid'].append({
                        'input': gene,
                        'symbol': exact_matches[0]['symbol'],
                        'ensembl_id': exact_matches[0].get('ensembl_id'),
                        'match_type': 'exact',
                        'source': 'DepMap'
                    })
                elif genes:
                    validated['invalid'].append(gene)
                    validated['suggestions'][gene] = [g['symbol'] for g in genes[:3]]
                else:
                    validated['invalid'].append(gene)
    else:
        # FALLBACK: Use Pharos (druggability database)
        print("⚠️  DepMap unavailable, using Pharos for gene validation...")
        validated['data_source'] = 'Pharos (fallback - ★★☆)'

        for gene in gene_list:
            # Query Pharos to check if gene exists
            result = tu.tools.Pharos_get_target(gene=gene)

            if result.get('status') == 'success' and result.get('data'):
                target_data = result.get('data', {})
                validated['valid'].append({
                    'input': gene,
                    'symbol': target_data.get('name', gene),
                    'tdl': target_data.get('tdl', 'Unknown'),
                    'match_type': 'exact',
                    'source': 'Pharos'
                })
            else:
                # Gene not found
                validated['invalid'].append(gene)

    return validated

Output for Report:

markdown

undefined

关键要求: 验证基因符号，若DepMap不可用则使用Open Targets作为备选。

python

def validate_gene_symbols(tu, gene_list):
    """
    Validate gene symbols with DepMap fallback to Open Targets.

    Returns: dict with valid_genes, invalid_genes, suggestions, data_source
    """
    validated = {
        'valid': [],
        'invalid': [],
        'suggestions': {},
        'data_source': None
    }

    # Try DepMap first
    depmap_available = False
    test_result = tu.tools.DepMap_search_genes(query="KRAS")
    if (test_result.get('status') == 'success' and
        not test_result.get('error', '').startswith('DepMap API')):
        depmap_available = True
        validated['data_source'] = 'DepMap (primary)'

    if depmap_available:
        # Use original DepMap validation logic
        for gene in gene_list:
            result = tu.tools.DepMap_search_genes(query=gene)
            if result.get('status') == 'success':
                genes = result.get('data', {}).get('genes', [])
                exact_matches = [g for g in genes
                               if g.get('symbol', '').upper() == gene.upper()]

                if exact_matches:
                    validated['valid'].append({
                        'input': gene,
                        'symbol': exact_matches[0]['symbol'],
                        'ensembl_id': exact_matches[0].get('ensembl_id'),
                        'match_type': 'exact',
                        'source': 'DepMap'
                    })
                elif genes:
                    validated['invalid'].append(gene)
                    validated['suggestions'][gene] = [g['symbol'] for g in genes[:3]]
                else:
                    validated['invalid'].append(gene)
    else:
        # FALLBACK: Use Pharos (druggability database)
        print("⚠️  DepMap unavailable, using Pharos for gene validation...")
        validated['data_source'] = 'Pharos (fallback - ★★☆)'

        for gene in gene_list:
            # Query Pharos to check if gene exists
            result = tu.tools.Pharos_get_target(gene=gene)

            if result.get('status') == 'success' and result.get('data'):
                target_data = result.get('data', {})
                validated['valid'].append({
                    'input': gene,
                    'symbol': target_data.get('name', gene),
                    'tdl': target_data.get('tdl', 'Unknown'),
                    'match_type': 'exact',
                    'source': 'Pharos'
                })
            else:
                # Gene not found
                validated['invalid'].append(gene)

    return validated

报告输出示例:

markdown

undefined

Input Validation

输入验证

Genes Provided: 25 gene symbols Valid Genes: 23 (92%) Invalid/Ambiguous: 2 Data Source: {data_source from validated dict}

Invalid Genes:

```
EGFRVIII
```
→ Gene symbol not recognized (mutation-specific identifier)
```
P53
```
→ Did you mean
```
TP53
```
? (use official gene symbol)

Proceeding with 23 valid gene symbols for analysis.

Source: {DepMap gene registry OR Pharos (fallback)}

Note: If using Pharos fallback due to DepMap unavailability, validation provides gene symbols and TDL classification (druggability level). Validation is ★★☆ reliable.

---

提供的基因: 25个基因符号 有效基因: 23个（92%） 无效/模糊基因: 2个 数据源: {validated字典中的data_source}

无效基因:

```
EGFRVIII
```
→ 基因符号未被识别（突变特异性标识符）
```
P53
```
→ 是否指
```
TP53
```
？（请使用官方基因符号）

将基于23个有效基因符号进行分析

来源: {DepMap基因注册表或Pharos（备选）}

注意: 若因DepMap不可用而使用Pharos备选方案，验证将提供基因符号和TDL分类（成药层级）。验证置信度为★★☆。

---

PATH 1: Gene Essentiality Analysis (DepMap with Open Targets Fallback)

路径1: 基因必需性分析（DepMap + Open Targets备选）

Retrieve Essentiality Scores

检索必需性评分

python

def analyze_gene_essentiality(tu, gene_list, cancer_type=None):
    """
    Get gene essentiality data with DepMap fallback to Open Targets.

    DepMap: Provides CRISPR dependency scores (gold standard - ★★★)
    Open Targets: Provides tractability + safety as proxy (fallback - ★★☆)
    """
    essentiality_data = []

    # Check if DepMap is available
    test_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol="KRAS")
    depmap_available = (
        test_result.get('status') == 'success' and
        not test_result.get('error', '').startswith('DepMap API')
    )

    if depmap_available:
        # Use original DepMap logic (optimal - ★★★)
        for gene in gene_list:
            dep_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol=gene)
            if dep_result.get('status') == 'success':
                gene_data = dep_result.get('data', {})
                essentiality_data.append({
                    'gene': gene,
                    'data': gene_data,
                    'essentiality_class': classify_essentiality_depmap(gene_data),
                    'source': 'DepMap',
                    'confidence': 'HIGH'  # ★★★
                })
    else:
        # FALLBACK: Use Pharos TDL classification as proxy
        print("⚠️  DepMap unavailable, using Pharos TDL as essentiality proxy...")

        for gene in gene_list:
            pharos_result = tu.tools.Pharos_get_target(gene=gene)

            if pharos_result.get('status') == 'success' and pharos_result.get('data'):
                target_data = pharos_result.get('data', {})

                # Use TDL (Target Development Level) as proxy for essentiality
                essentiality_class = classify_essentiality_pharos(target_data)

                essentiality_data.append({
                    'gene': gene,
                    'data': target_data,
                    'essentiality_class': essentiality_class,
                    'source': 'Pharos',
                    'confidence': essentiality_class['confidence'],
                    'note': 'Essentiality inferred from TDL classification'
                })

    return essentiality_data


def classify_essentiality_depmap(gene_data):
    """Classify gene essentiality based on DepMap CRISPR scores."""
    # Original DepMap classification logic
    return {
        'pan_cancer': False,
        'selective': True,
        'non_essential': False
    }


def classify_essentiality_pharos(target_data):
    """
    Infer essentiality from Pharos TDL classification (fallback method).

    TDL (Target Development Level) categories:
    - Tclin: Clinical drug target (approved drugs) → Likely essential/important
    - Tchem: Chemical tool/probe available → Druggable, possibly essential
    - Tbio: Biological evidence → Some relevance
    - Tdark: No drug/tool → Unknown essentiality
    """
    tdl = target_data.get('tdl', 'Unknown')

    if tdl == 'Tclin':
        return {
            'classification': 'LIKELY_ESSENTIAL',
            'confidence': 'HIGH',  # ★★★
            'tdl': tdl,
            'rationale': (
                'Approved drug target (Tclin). '
                'Clinically validated targets are often essential. '
                'For cell-line-specific scores, await DepMap restoration.'
            )
        }
    elif tdl == 'Tchem':
        return {
            'classification': 'POTENTIALLY_ESSENTIAL',
            'confidence': 'MEDIUM',  # ★★☆
            'tdl': tdl,
            'rationale': (
                'Chemical tools available (Tchem). '
                'Druggable targets with chemical probes often have functional relevance.'
            )
        }
    elif tdl == 'Tbio':
        return {
            'classification': 'UNCERTAIN',
            'confidence': 'LOW',  # ★☆☆
            'tdl': tdl,
            'rationale': (
                'Biological evidence only (Tbio). '
                'Limited druggability data. Essentiality unclear.'
            )
        }
    else:  # Tdark or Unknown
        return {
            'classification': 'UNKNOWN',
            'confidence': 'LOW',  # ★☆☆
            'tdl': tdl,
            'rationale': (
                'Dark target or unknown. '
                'No drug/tool data. Essentiality cannot be inferred.'
            )
        }

python

def analyze_gene_essentiality(tu, gene_list, cancer_type=None):
    """
    Get gene essentiality data with DepMap fallback to Open Targets.

    DepMap: Provides CRISPR dependency scores (gold standard - ★★★)
    Open Targets: Provides tractability + safety as proxy (fallback - ★★☆)
    """
    essentiality_data = []

    # Check if DepMap is available
    test_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol="KRAS")
    depmap_available = (
        test_result.get('status') == 'success' and
        not test_result.get('error', '').startswith('DepMap API')
    )

    if depmap_available:
        # Use original DepMap logic (optimal - ★★★)
        for gene in gene_list:
            dep_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol=gene)
            if dep_result.get('status') == 'success':
                gene_data = dep_result.get('data', {})
                essentiality_data.append({
                    'gene': gene,
                    'data': gene_data,
                    'essentiality_class': classify_essentiality_depmap(gene_data),
                    'source': 'DepMap',
                    'confidence': 'HIGH'  # ★★★
                })
    else:
        # FALLBACK: Use Pharos TDL classification as proxy
        print("⚠️  DepMap unavailable, using Pharos TDL as essentiality proxy...")

        for gene in gene_list:
            pharos_result = tu.tools.Pharos_get_target(gene=gene)

            if pharos_result.get('status') == 'success' and pharos_result.get('data'):
                target_data = pharos_result.get('data', {})

                # Use TDL (Target Development Level) as proxy for essentiality
                essentiality_class = classify_essentiality_pharos(target_data)

                essentiality_data.append({
                    'gene': gene,
                    'data': target_data,
                    'essentiality_class': essentiality_class,
                    'source': 'Pharos',
                    'confidence': essentiality_class['confidence'],
                    'note': 'Essentiality inferred from TDL classification'
                })

    return essentiality_data


def classify_essentiality_depmap(gene_data):
    """Classify gene essentiality based on DepMap CRISPR scores."""
    # Original DepMap classification logic
    return {
        'pan_cancer': False,
        'selective': True,
        'non_essential': False
    }


def classify_essentiality_pharos(target_data):
    """
    Infer essentiality from Pharos TDL classification (fallback method).

    TDL (Target Development Level) categories:
    - Tclin: Clinical drug target (approved drugs) → Likely essential/important
    - Tchem: Chemical tool/probe available → Druggable, possibly essential
    - Tbio: Biological evidence → Some relevance
    - Tdark: No drug/tool → Unknown essentiality
    """
    tdl = target_data.get('tdl', 'Unknown')

    if tdl == 'Tclin':
        return {
            'classification': 'LIKELY_ESSENTIAL',
            'confidence': 'HIGH',  # ★★★
            'tdl': tdl,
            'rationale': (
                'Approved drug target (Tclin). '
                'Clinically validated targets are often essential. '
                'For cell-line-specific scores, await DepMap restoration.'
            )
        }
    elif tdl == 'Tchem':
        return {
            'classification': 'POTENTIALLY_ESSENTIAL',
            'confidence': 'MEDIUM',  # ★★☆
            'tdl': tdl,
            'rationale': (
                'Chemical tools available (Tchem). '
                'Druggable targets with chemical probes often have functional relevance.'
            )
        }
    elif tdl == 'Tbio':
        return {
            'classification': 'UNCERTAIN',
            'confidence': 'LOW',  # ★☆☆
            'tdl': tdl,
            'rationale': (
                'Biological evidence only (Tbio). '
                'Limited druggability data. Essentiality unclear.'
            )
        }
    else:  # Tdark or Unknown
        return {
            'classification': 'UNKNOWN',
            'confidence': 'LOW',  # ★☆☆
            'tdl': tdl,
            'rationale': (
                'Dark target or unknown. '
                'No drug/tool data. Essentiality cannot be inferred.'
            )
        }

Cancer Type-Specific Analysis

癌症类型特异性分析

For cancer type queries, retrieve top essential genes:

python

def get_top_essential_genes_for_cancer(tu, cancer_type, top_n=50):
    """
    Retrieve top essential genes for a specific cancer type from DepMap.
    """
    # Get cell lines for this cancer type
    cell_lines = tu.tools.DepMap_get_cell_lines(
        cancer_type=cancer_type,
        page_size=100
    )

    if not cell_lines.get('data', {}).get('cell_lines'):
        return {'error': f'No cell lines found for {cancer_type}'}

    # For each cell line, would need to query dependencies
    # Note: DepMap API may not support direct "top genes by cancer type" query
    # May need to aggregate manually or use different approach

    return {
        'cancer_type': cancer_type,
        'cell_lines': cell_lines.get('data', {}).get('cell_lines', []),
        'note': 'Full analysis requires per-cell-line dependency data aggregation'
    }

Output for Report (when DepMap available):

markdown

undefined

针对癌症类型查询，检索顶级必需基因：

python

def get_top_essential_genes_for_cancer(tu, cancer_type, top_n=50):
    """
    Retrieve top essential genes for a specific cancer type from DepMap.
    """
    # Get cell lines for this cancer type
    cell_lines = tu.tools.DepMap_get_cell_lines(
        cancer_type=cancer_type,
        page_size=100
    )

    if not cell_lines.get('data', {}).get('cell_lines'):
        return {'error': f'No cell lines found for {cancer_type}'}

    # For each cell line, would need to query dependencies
    # Note: DepMap API may not support direct "top genes by cancer type" query
    # May need to aggregate manually or use different approach

    return {
        'cancer_type': cancer_type,
        'cell_lines': cell_lines.get('data', {}).get('cell_lines', []),
        'note': 'Full analysis requires per-cell-line dependency data aggregation'
    }

DepMap可用时的报告输出:

markdown

undefined

1. Gene Essentiality Analysis

1. 基因必需性分析

Data Source: DepMap CRISPR (24Q2) ✅ Confidence: ★★★ HIGH

数据源: DepMap CRISPR (24Q2) ✅ 置信度: ★★★ 高

Strongly Essential Genes (DepMap Score < -1.0)

强必需基因（DepMap评分 < -1.0）

Gene	Mean Effect	Essential Cell Lines (%)	Selectivity	Evidence
RPL5	-1.45	98% (1,042/1,063)	Pan-cancer	★★★
RPS6	-1.32	96% (1,019/1,063)	Pan-cancer	★★★
POLR2A	-1.28	95% (1,010/1,063)	Pan-cancer	★★★

Interpretation: These genes are essential for cell survival across nearly all cancer types. They are core fitness genes (ribosomal proteins, RNA polymerase) and likely not selective therapeutic targets.

Source: DepMap via
DepMap_get_gene_dependencies


**Output for Report (when using Pharos fallback)**:
```markdown

基因	平均效应值	必需细胞系占比	选择性	证据
RPL5	-1.45	98% (1,042/1,063)	泛癌	★★★
RPS6	-1.32	96% (1,019/1,063)	泛癌	★★★
POLR2A	-1.28	95% (1,010/1,063)	泛癌	★★★

解读: 这些基因在几乎所有癌症类型中对细胞存活都是必需的。它们是核心 fitness 基因（核糖体蛋白、RNA聚合酶），可能不适合作为选择性治疗靶点。

来源: DepMap via
DepMap_get_gene_dependencies


**使用Pharos备选方案时的报告输出**:
```markdown

1. Gene Essentiality Analysis

1. 基因必需性分析

⚠️ Data Source: Pharos (DepMap CRISPR temporarily unavailable) Analysis Method: Essentiality inferred from TDL (Target Development Level) classification Confidence: Varies by TDL (Tclin=★★★, Tchem=★★☆, Tbio/Tdark=★☆☆)

⚠️ 数据源: Pharos（DepMap CRISPR暂时不可用） 分析方法: 基于TDL（靶点开发层级）分类推断必需性 置信度: 随TDL而异（Tclin=★★★, Tchem=★★☆, Tbio/Tdark=★☆☆）

Clinically Validated Targets (Tclin - Likely Essential)

临床验证靶点（Tclin - 可能必需）

Gene	TDL	Clinical Status	Inference	Evidence
KRAS	Tclin	Approved drugs (sotorasib, adagrasib)	Likely essential in KRAS-mutant cancers	★★★
EGFR	Tclin	Multiple approved inhibitors	Likely essential in EGFR-mutant cancers	★★★

Interpretation: Tclin targets have approved drugs, indicating clinical validation. These are likely essential in specific contexts (mutation-dependent).

基因	TDL	临床状态	推断结果	证据
KRAS	Tclin	已获批药物（sotorasib、adagrasib）	在KRAS突变型癌症中可能必需	★★★
EGFR	Tclin	多种获批抑制剂	在EGFR突变型癌症中可能必需	★★★

解读: Tclin靶点拥有获批药物，表明已通过临床验证。这些靶点在特定背景下（依赖突变）可能是必需的。

Chemical Probe Available (Tchem - Potentially Essential)

有化学探针可用的靶点（Tchem - 潜在必需）

Gene	TDL	Tool Status	Inference	Evidence
CDK2	Tchem	Chemical probes available	Potentially essential (cell cycle)	★★☆
WEE1	Tchem	Chemical inhibitors available	Potentially essential (DNA damage)	★★☆

Interpretation: Tchem targets are druggable with chemical tools. Druggability suggests functional importance.

Note: TDL classification is a proxy for essentiality. For definitive CRISPR dependency scores, DepMap data required.

Source: Pharos via
Pharos_get_target
(fallback method)

基因	TDL	工具状态	推断结果	证据
CDK2	Tchem	有化学探针可用	潜在必需（细胞周期相关）	★★☆
WEE1	Tchem	有化学抑制剂可用	潜在必需（DNA损伤相关）	★★☆

解读: Tchem靶点可被化学工具靶向。成药性表明其具有功能相关性。

注意: TDL分类是必需性的替代指标。如需明确的CRISPR依赖性评分，需等待DepMap数据恢复。

来源: Pharos via
Pharos_get_target
（备选方法）

Selectively Essential Genes (Tissue/Context-Specific)

选择性必需基因（组织/背景特异性）

Gene	Mean Effect	Essential in	Non-Essential in	Selectivity Score	Evidence
KRAS	-0.85	Pancreatic (95%), Lung (78%), Colon (82%)	Breast (12%), Glioma (8%)	High	★★★
EGFR	-0.72	Lung (85%), Glioblastoma (76%)	Most others (<20%)	High	★★★
ESR1	-0.68	ER+ Breast (92%)	ER- Breast (5%), Other (<3%)	Very High	★★★

Interpretation: Selectively essential genes show strong context-dependency and represent high-value therapeutic targets with potential for tissue-selective toxicity profiles.

Source: DepMap via
DepMap_get_gene_dependencies

基因	平均效应值	在以下癌症中必需	在以下癌症中非必需	选择性评分	证据
KRAS	-0.85	胰腺癌(95%)、肺癌(78%)、结肠癌(82%)	乳腺癌(12%)、胶质瘤(8%)	高	★★★
EGFR	-0.72	肺癌(85%)、胶质母细胞瘤(76%)	大多数其他癌症(<20%)	高	★★★
ESR1	-0.68	ER+乳腺癌(92%)	ER-乳腺癌(5%)、其他(<3%)	极高	★★★

解读: 选择性必需基因表现出强背景依赖性，是具有组织选择性毒性潜力的高价值治疗靶点。

来源: DepMap via
DepMap_get_gene_dependencies

Non-Essential/Weak Hits (Score > -0.5)

弱必需/非必需候选靶点（评分 > -0.5）

Gene	Mean Effect	% Essential	Interpretation
GENE1	-0.25	15%	Weak dependency, potential off-target or passenger
GENE2	-0.12	8%	Non-essential in most contexts

Note: These genes may still be biologically relevant (e.g., synthetic lethal interactions, drug targets for specific contexts) but show weak essentiality in CRISPR screens.

Essentiality Summary:

Pan-cancer essential: 12 genes (↓ deprioritize for selective targeting)
Selectively essential: 18 genes (★ HIGH PRIORITY for validation)
Weakly essential: 15 genes (context-dependent, requires further investigation)

All essentiality data from DepMap Portal (DepMap Public 24Q2 release)

---

基因	平均效应值	必需占比	解读
GENE1	-0.25	15%	弱依赖性，潜在脱靶或乘客基因
GENE2	-0.12	8%	在大多数背景下非必需

注意: 这些基因仍可能具有生物学相关性（如合成致死相互作用、特定背景下的药物靶点），但在CRISPR筛选中显示出弱必需性。

必需性总结:

泛癌必需: 12个基因（↓ 不优先作为选择性靶点）
选择性必需: 18个基因（★ 验证高优先级）
弱必需: 15个基因（背景依赖性，需进一步研究）

所有必需性数据来自DepMap Portal（DepMap Public 24Q2版本）

---

PATH 2: Pathway & Functional Enrichment

路径2: 通路与功能富集

Gene Set Enrichment Analysis

基因集富集分析

python

def perform_pathway_enrichment(tu, gene_list):
    """
    Run enrichment analysis across multiple libraries.
    """
    # Enrichr libraries to query
    libraries = [
        "WikiPathways_2024_Human",
        "Reactome_Pathways_2024",
        "MSigDB_Hallmark_2020",
        "GO_Biological_Process_2023",
        "GO_Molecular_Function_2023",
        "GO_Cellular_Component_2023",
        "KEGG_2024_Human"
    ]

    result = tu.tools.enrichr_gene_enrichment_analysis(
        gene_list=gene_list,
        libs=libraries
    )

    # Parse results - Enrichr returns pathway rankings with p-values
    return result

Output for Report:

markdown

undefined

python

def perform_pathway_enrichment(tu, gene_list):
    """
    Run enrichment analysis across multiple libraries.
    """
    # Enrichr libraries to query
    libraries = [
        "WikiPathways_2024_Human",
        "Reactome_Pathways_2024",
        "MSigDB_Hallmark_2020",
        "GO_Biological_Process_2023",
        "GO_Molecular_Function_2023",
        "GO_Cellular_Component_2023",
        "KEGG_2024_Human"
    ]

    result = tu.tools.enrichr_gene_enrichment_analysis(
        gene_list=gene_list,
        libs=libraries
    )

    # Parse results - Enrichr returns pathway rankings with p-values
    return result

报告输出:

markdown

undefined

2. Pathway & Functional Enrichment

2. 通路与功能富集

Top Enriched Pathways (p < 0.01, FDR < 0.05)

顶级富集通路（p < 0.01, FDR < 0.05）

Reactome Pathways

Reactome通路

Pathway	Genes	p-value	FDR	Odds Ratio	Evidence
Cell Cycle Checkpoints	12/18	1.2e-8	3.4e-6	15.3	★★★
DNA Replication	8/18	3.5e-6	4.2e-4	12.1	★★★
G1/S Transition	7/18	5.1e-5	2.1e-3	9.8	★★☆

Genes in pathway: CCNE1, CDK2, RB1, E2F1, CDC25A, CDC6, ORC1, MCM2

Interpretation: Strong enrichment in cell cycle control pathways suggests the screen identified proliferation-essential genes. These represent core cell cycle machinery.

通路	基因数	p值	FDR	优势比	证据
细胞周期检查点	12/18	1.2e-8	3.4e-6	15.3	★★★
DNA复制	8/18	3.5e-6	4.2e-4	12.1	★★★
G1/S转换	7/18	5.1e-5	2.1e-3	9.8	★★☆

通路中的基因: CCNE1, CDK2, RB1, E2F1, CDC25A, CDC6, ORC1, MCM2

解读: 细胞周期调控通路的强富集表明，筛选识别出了增殖必需基因。这些基因代表核心细胞周期机制。

GO Biological Process

GO生物过程

Term	Genes	p-value	FDR	Evidence
DNA replication initiation	6/18	2.1e-7	1.5e-5	★★★
G1/S transition of mitotic cell cycle	8/18	8.3e-7	3.2e-5	★★★
regulation of cyclin-dependent protein kinase activity	5/18	1.2e-4	8.9e-3	★★☆

术语	基因数	p值	FDR	证据
DNA复制起始	6/18	2.1e-7	1.5e-5	★★★
有丝分裂细胞周期的G1/S转换	8/18	8.3e-7	3.2e-5	★★★
细胞周期蛋白依赖性激酶活性调控	5/18	1.2e-4	8.9e-3	★★☆

MSigDB Hallmark Gene Sets

MSigDB特征基因集

Hallmark	Genes	p-value	FDR	Evidence
E2F Targets	10/18	6.7e-10	1.2e-8	★★★
G2M Checkpoint	9/18	3.4e-8	2.1e-6	★★★
MYC Targets V1	7/18	2.1e-5	9.8e-4	★★☆

Key Finding: Hits converge on E2F/RB pathway, suggesting screen successfully identified proliferation machinery. This is expected for dropout screens in proliferating cancer cells.

Source: Enrichr via
enrichr_gene_enrichment_analysis

特征集	基因数	p值	FDR	证据
E2F靶点	10/18	6.7e-10	1.2e-8	★★★
G2M检查点	9/18	3.4e-8	2.1e-6	★★★
MYC靶点V1	7/18	2.1e-5	9.8e-4	★★☆

关键发现: 候选靶点集中在E2F/RB通路，表明筛选成功识别出增殖机制相关基因。这在增殖性癌细胞的dropout筛选中符合预期。

来源: Enrichr via
enrichr_gene_enrichment_analysis

No Significant Enrichment

无显著富集

GO Molecular Function: No terms pass FDR < 0.05 KEGG Pathways: Marginal enrichment (p < 0.05) but does not survive multiple testing correction

Interpretation: Gene list may be heterogeneous or represent diverse biological processes. Consider sub-clustering analysis.

---

GO分子功能: 无术语通过FDR < 0.05的阈值 KEGG通路: 存在边缘富集（p < 0.05）但未通过多重检验校正

解读: 基因列表可能具有异质性，或代表多种生物学过程。建议考虑亚聚类分析。

---

PATH 3: Protein-Protein Interaction Networks

路径3: 蛋白质-蛋白质相互作用网络

Build PPI Network

构建PPI网络

python

def build_ppi_network(tu, gene_list):
    """
    Construct protein interaction network for hit genes.
    """
    # Use STRING for comprehensive PPI data
    ppi_result = tu.tools.STRING_get_protein_interactions(
        protein_ids=gene_list,
        species=9606  # Human
    )

    # Also check IntAct for curated interactions
    interactions = []
    for gene in gene_list:
        # Get UniProt ID first
        uniprot = resolve_gene_to_uniprot(tu, gene)
        if uniprot:
            intact_result = tu.tools.intact_get_interactions(identifier=uniprot)
            interactions.append(intact_result)

    return {
        'string': ppi_result,
        'intact': interactions
    }

def identify_protein_complexes(ppi_data):
    """
    Identify protein complexes from PPI network.

    Could use complex detection algorithms or query Complex Portal.
    """
    # Implementation for complex detection
    pass

Output for Report:

markdown

undefined

python

def build_ppi_network(tu, gene_list):
    """
    Construct protein interaction network for hit genes.
    """
    # Use STRING for comprehensive PPI data
    ppi_result = tu.tools.STRING_get_protein_interactions(
        protein_ids=gene_list,
        species=9606  # Human
    )

    # Also check IntAct for curated interactions
    interactions = []
    for gene in gene_list:
        # Get UniProt ID first
        uniprot = resolve_gene_to_uniprot(tu, gene)
        if uniprot:
            intact_result = tu.tools.intact_get_interactions(identifier=uniprot)
            interactions.append(intact_result)

    return {
        'string': ppi_result,
        'intact': interactions
    }

def identify_protein_complexes(ppi_data):
    """
    Identify protein complexes from PPI network.

    Could use complex detection algorithms or query Complex Portal.
    """
    # Implementation for complex detection
    pass

报告输出:

markdown

undefined

3. Protein Interaction Network Analysis

3. 蛋白质相互作用网络分析

Network Statistics

网络统计

Nodes: 45 proteins (from 45 input genes)
Edges: 128 interactions (STRING combined score > 0.4)
Network Density: 0.063
Average Clustering Coefficient: 0.45
Hub Genes (>10 interactions): CDK2, RB1, E2F1, CCNE1

Interpretation: High clustering coefficient indicates genes are functionally related and form coherent protein complexes.

Source: STRING via
STRING_get_protein_interactions

节点: 45个蛋白质（来自45个输入基因）
边: 128个相互作用（STRING综合评分 > 0.4）
网络密度: 0.063
平均聚类系数: 0.45
枢纽基因（>10个相互作用）: CDK2, RB1, E2F1, CCNE1

解读: 高聚类系数表明这些基因功能相关，形成连贯的蛋白质复合物。

来源: STRING via
STRING_get_protein_interactions

Protein Complexes Identified

识别出的蛋白质复合物

Complex	Members	Function	Essential?
MCM Complex	MCM2, MCM3, MCM4, MCM5, MCM6, MCM7	DNA replication helicase	Yes (pan-cancer)
Cyclin E-CDK2	CCNE1, CCNE2, CDK2	G1/S transition kinase	Yes (selective)
E2F/DP/RB	E2F1, E2F2, E2F3, RB1, TFDP1	Transcription regulation	Yes (context-dependent)

Key Finding: Screen hit multiple members of the same essential complexes. This provides validation (independent hits in same pathway) and suggests complex-level vulnerability.

Source: Complex Portal annotations + STRING clustering

复合物	成员	功能	是否必需？
MCM复合物	MCM2, MCM3, MCM4, MCM5, MCM6, MCM7	DNA复制解旋酶	是（泛癌）
Cyclin E-CDK2	CCNE1, CCNE2, CDK2	G1/S转换激酶	是（选择性）
E2F/DP/RB	E2F1, E2F2, E2F3, RB1, TFDP1	转录调控	是（背景依赖性）

关键发现: 筛选命中了同一必需复合物的多个成员。这为结果提供了验证（同一通路中的独立命中），表明复合物层面存在脆弱性。

来源: Complex Portal注释 + STRING聚类

Synthetic Lethal Candidates

合成致死候选靶点

Based on PPI network and literature:

Gene A (Hit)	Gene B (Candidate)	Relationship	Evidence	Source
RB1	ARID1A	Synthetic lethal	★★☆	PMID:29534788
KRAS	STK11	Synthetic lethal	★★★	PMID:31010833

Recommendation: Test synthetic lethal candidates (Gene B) for combination therapy with inhibitors of Gene A.

---

基于PPI网络和文献：

候选靶点A（命中基因）	候选靶点B（待验证）	关系	证据	来源
RB1	ARID1A	合成致死	★★☆	PMID:29534788
KRAS	STK11	合成致死	★★★	PMID:31010833

建议: 测试合成致死候选靶点（基因B）与基因A抑制剂的联合治疗效果。

---

PATH 4: Druggability & Target Assessment

路径4: 成药性与靶点评估

Assess Drug Target Potential

评估药物靶点潜力

python

def assess_druggability(tu, gene_list):
    """
    Evaluate druggability of hit genes.
    """
    drug_targets = []

    for gene in gene_list:
        # Check Pharos for target development level
        pharos = tu.tools.Pharos_get_target(gene=gene)

        # Check DGIdb for existing drugs
        dgidb = tu.tools.DGIdb_get_drug_gene_interactions(genes=[gene])

        # Check Open Targets for chemical probes
        ensembl_id = resolve_gene_to_ensembl(tu, gene)
        if ensembl_id:
            probes = tu.tools.OpenTargets_get_chemical_probes_by_target_ensemblId(
                ensemblId=ensembl_id
            )

        # Check clinical trials
        trials = tu.tools.search_clinical_trials(
            intervention=gene,
            pageSize=20
        )

        drug_targets.append({
            'gene': gene,
            'pharos_tdl': pharos.get('data', {}).get('tdl'),
            'existing_drugs': dgidb,
            'chemical_probes': probes,
            'clinical_trials': trials
        })

    return drug_targets

Output for Report:

markdown

undefined

python

def assess_druggability(tu, gene_list):
    """
    Evaluate druggability of hit genes.
    """
    drug_targets = []

    for gene in gene_list:
        # Check Pharos for target development level
        pharos = tu.tools.Pharos_get_target(gene=gene)

        # Check DGIdb for existing drugs
        dgidb = tu.tools.DGIdb_get_drug_gene_interactions(genes=[gene])

        # Check Open Targets for chemical probes
        ensembl_id = resolve_gene_to_ensembl(tu, gene)
        if ensembl_id:
            probes = tu.tools.OpenTargets_get_chemical_probes_by_target_ensemblId(
                ensemblId=ensembl_id
            )

        # Check clinical trials
        trials = tu.tools.search_clinical_trials(
            intervention=gene,
            pageSize=20
        )

        drug_targets.append({
            'gene': gene,
            'pharos_tdl': pharos.get('data', {}).get('tdl'),
            'existing_drugs': dgidb,
            'chemical_probes': probes,
            'clinical_trials': trials
        })

    return drug_targets

报告输出:

markdown

undefined

4. Druggability & Clinical Target Assessment

4. 成药性与临床靶点评估

Target Development Level Classification (Pharos)

靶点开发层级分类（Pharos）

TDL	Count	Genes	Interpretation
Tclin	5	EGFR, KRAS, CDK2, HDAC1, AURKA	Approved drug targets
Tchem	8	WEE1, PLK1, CHEK1, ...	Chemical matter available, druggable
Tbio	12	E2F1, RB1, ...	Biologically characterized, may need novel modalities
Tdark	3	GENE_X, GENE_Y, GENE_Z	Understudied, limited tool compounds

Priority Ranking: Tclin > Tchem > Tbio for near-term drug development feasibility.

Source: Pharos/TCRD via
Pharos_get_target

TDL	数量	基因	解读
Tclin	5	EGFR, KRAS, CDK2, HDAC1, AURKA	已获批药物靶点
Tchem	8	WEE1, PLK1, CHEK1, ...	有化学物质可用，可靶向
Tbio	12	E2F1, RB1, ...	已进行生物学表征，可能需要新的靶向方式
Tdark	3	GENE_X, GENE_Y, GENE_Z	研究不足，工具化合物有限

优先级排序: Tclin > Tchem > Tbio（从近期药物开发可行性角度）

来源: Pharos/TCRD via
Pharos_get_target

Approved Drugs & Clinical Tools

已获批药物与临床工具

Gene	Drug(s)	Status	Indication	Source
EGFR	Erlotinib, Gefitinib, Osimertinib	Approved	NSCLC	DGIdb
CDK2	Dinaciclib	Phase 2	Hematologic malignancies	ClinicalTrials.gov
AURKA	Alisertib	Phase 3	Lymphoma	ClinicalTrials.gov
WEE1	Adavosertib	Phase 2	Solid tumors	ClinicalTrials.gov

Clinical Readiness: 5 genes have approved/late-stage drugs. These represent immediate repurposing opportunities.

Sources: DGIdb via
DGIdb_get_drug_gene_interactions
, ClinicalTrials.gov

基因	药物	状态	适应症	来源
EGFR	厄洛替尼、吉非替尼、奥希替尼	已获批	非小细胞肺癌	DGIdb
CDK2	Dinaciclib	2期	血液系统恶性肿瘤	ClinicalTrials.gov
AURKA	Alisertib	3期	淋巴瘤	ClinicalTrials.gov
WEE1	Adavosertib	2期	实体瘤	ClinicalTrials.gov

临床就绪度: 5个基因拥有获批/后期临床试验药物。这些代表了直接的药物重定位机会。

来源: DGIdb via
DGIdb_get_drug_gene_interactions
, ClinicalTrials.gov

Chemical Probes Available

可用化学探针

Gene	Probe	Potency	Selectivity	Use	Source
CDK2	Roscovitine	IC50 ~200nM	Moderate (pan-CDK)	Tool compound	SGC/Open Targets
HDAC1	SAHA (Vorinostat)	IC50 ~10nM	Pan-HDAC	Approved drug, research tool	ChEMBL

Source: Open Targets via
OpenTargets_get_chemical_probes_by_target_ensemblId

基因	探针	效力	选择性	用途	来源
CDK2	Roscovitine	IC50 ~200nM	中等（泛CDK）	工具化合物	SGC/Open Targets
HDAC1	SAHA（伏立诺他）	IC50 ~10nM	泛HDAC	获批药物、研究工具	ChEMBL

来源: Open Targets via
OpenTargets_get_chemical_probes_by_target_ensemblId

Non-Druggable Hits Requiring Alternative Strategies

非可靶向候选靶点的替代策略

Gene	Challenge	Recommended Approach
E2F1	Transcription factor (no catalytic domain)	PROTACs, molecular glue degraders
RB1	Tumor suppressor (loss-of-function)	Synthetic lethal approach (e.g., CDK4/6i)
MCM2	Part of large complex, no pockets	Indirect targeting via cell cycle inhibitors

Validation Priority: Focus on Tclin/Tchem hits with existing tool compounds for faster validation.

---

基因	挑战	推荐方案
E2F1	转录因子（无催化结构域）	PROTAC、分子胶降解剂
RB1	肿瘤抑制因子（功能缺失）	合成致死策略（如CDK4/6抑制剂）
MCM2	大型复合物的一部分，无结合口袋	通过细胞周期抑制剂间接靶向

验证优先级: 优先关注有现有工具化合物的Tclin/Tchem靶点，以加快验证速度。

---

PATH 5: Disease Association & Clinical Relevance

路径5: 疾病关联与临床相关性

Cancer Genomics Integration

癌症基因组整合

python

def assess_clinical_relevance(tu, gene_list, cancer_type):
    """
    Evaluate clinical relevance of hits in target cancer type.
    """
    clinical_data = []

    for gene in gene_list:
        ensembl_id = resolve_gene_to_ensembl(tu, gene)

        if ensembl_id:
            # Disease associations
            diseases = tu.tools.OpenTargets_get_diseases_phenotypes_by_target_ensemblId(
                ensemblId=ensembl_id
            )

            # Mouse models
            mouse = tu.tools.OpenTargets_get_biological_mouse_models_by_ensemblId(
                ensemblId=ensembl_id
            )

        # COSMIC mutations (somatic alterations in cancer)
        cosmic = tu.tools.COSMIC_get_gene_mutations(gene=gene)

        # GTEx expression (is it expressed in relevant tissue?)
        gtex = tu.tools.GTEx_get_median_gene_expression(
            gencode_id=ensembl_id,
            operation="median"
        )

        clinical_data.append({
            'gene': gene,
            'diseases': diseases,
            'mutations': cosmic,
            'expression': gtex,
            'mouse_models': mouse
        })

    return clinical_data

Output for Report:

markdown

undefined

python

def assess_clinical_relevance(tu, gene_list, cancer_type):
    """
    Evaluate clinical relevance of hits in target cancer type.
    """
    clinical_data = []

    for gene in gene_list:
        ensembl_id = resolve_gene_to_ensembl(tu, gene)

        if ensembl_id:
            # Disease associations
            diseases = tu.tools.OpenTargets_get_diseases_phenotypes_by_target_ensemblId(
                ensemblId=ensembl_id
            )

            # Mouse models
            mouse = tu.tools.OpenTargets_get_biological_mouse_models_by_ensemblId(
                ensemblId=ensembl_id
            )

        # COSMIC mutations (somatic alterations in cancer)
        cosmic = tu.tools.COSMIC_get_gene_mutations(gene=gene)

        # GTEx expression (is it expressed in relevant tissue?)
        gtex = tu.tools.GTEx_get_median_gene_expression(
            gencode_id=ensembl_id,
            operation="median"
        )

        clinical_data.append({
            'gene': gene,
            'diseases': diseases,
            'mutations': cosmic,
            'expression': gtex,
            'mouse_models': mouse
        })

    return clinical_data

报告输出:

markdown

undefined

5. Clinical Relevance & Disease Association

5. 临床相关性与疾病关联

Cancer Genomic Alterations (COSMIC)

癌症基因组改变（COSMIC）

Gene	Mutation Frequency	Cancer Types (Top 3)	Alteration Type	Evidence
KRAS	22% across all cancers	Pancreatic (90%), Colon (45%), Lung (32%)	Activating mutations	★★★
EGFR	8% across all cancers	Lung (15%), Glioma (30%), Breast (2%)	Amplification, mutations	★★★
TP53	42% across all cancers	Universal	Loss-of-function	★★★

Interpretation: High mutation frequency indicates gene is driver in those cancer types. CRISPR essentiality + genomic alteration = strong therapeutic rationale.

Source: COSMIC via
COSMIC_get_gene_mutations

基因	突变频率	主要癌症类型（前3）	改变类型	证据
KRAS	所有癌症中占22%	胰腺癌(90%)、结肠癌(45%)、肺癌(32%)	激活突变	★★★
EGFR	所有癌症中占8%	肺癌(15%)、胶质瘤(30%)、乳腺癌(2%)	扩增、突变	★★★
TP53	所有癌症中占42%	所有癌症	功能缺失	★★★

解读: 高突变频率表明该基因是这些癌症类型中的驱动基因。CRISPR必需性 + 基因组改变 = 强有力的治疗依据。

来源: COSMIC via
COSMIC_get_gene_mutations

Expression in Normal vs Tumor Tissue (GTEx/TCGA)

正常组织vs肿瘤组织中的表达（GTEx/TCGA）

Gene	Normal Lung (median TPM)	Lung Tumor (TCGA)	Tumor/Normal Ratio	Therapeutic Window
EGFR	8.5	45.3	5.3x	Moderate
AURKA	2.1	18.7	8.9x	Good
RPS6	125.3	132.1	1.05x	Poor (housekeeping)

Interpretation: Genes with >3x tumor/normal expression offer better therapeutic window. Housekeeping genes (e.g., ribosomal) show poor selectivity.

Sources: GTEx via
GTEx_get_median_gene_expression
, TCGA data

基因	正常肺组织（中位TPM）	肺癌组织（TCGA）	肿瘤/正常比值	治疗窗口
EGFR	8.5	45.3	5.3倍	中等
AURKA	2.1	18.7	8.9倍	良好
RPS6	125.3	132.1	1.05倍	差（管家基因）

解读: 肿瘤/正常表达比值>3倍的基因具有更好的治疗窗口。管家基因（如核糖体基因）选择性差。

来源: GTEx via
GTEx_get_median_gene_expression
, TCGA数据

Prognostic/Predictive Biomarker Status

预后/预测生物标志物状态

Gene	Biomarker Type	Cancer	Association	Evidence	Source
KRAS	Predictive (negative)	Colorectal	KRAS mut → anti-EGFR resistance	★★★	FDA label
EGFR	Predictive (positive)	NSCLC	EGFR mut → TKI response	★★★	FDA companion dx
ESR1	Predictive (positive)	Breast	ESR1 expression → endocrine therapy	★★★	Clinical guidelines

Clinical Impact: 3 genes are established biomarkers with FDA-approved tests. Targeting these genes has strong clinical precedent.

---

基因	生物标志物类型	癌症	关联关系	证据	来源
KRAS	预测性（阴性）	结直肠癌	KRAS突变→抗EGFR治疗耐药	★★★	FDA标签
EGFR	预测性（阳性）	非小细胞肺癌	EGFR突变→TKI应答	★★★	FDA伴随诊断
ESR1	预测性（阳性）	乳腺癌	ESR1表达→内分泌治疗应答	★★★	临床指南

临床影响: 3个基因是已确立的生物标志物，拥有FDA批准的检测方法。靶向这些基因具有坚实的临床先例。

---

PATH 6: Hit Prioritization & Validation Strategy

路径6: 候选靶点优先级排序与验证策略

Integrate All Evidence Dimensions

整合所有证据维度

python

def calculate_priority_score(gene_data):
    """
    Calculate multi-dimensional priority score.

    Components:
    - Essentiality strength (DepMap score)
    - Selectivity (tissue-specific vs pan-cancer)
    - Druggability (Pharos TDL, existing compounds)
    - Clinical relevance (mutations, expression, biomarkers)
    - Validation feasibility (tool compounds available)

    Returns score 0-100
    """
    score = 0

    # Essentiality (0-30 points)
    if gene_data['depmap_score'] < -1.0:
        score += 30
    elif gene_data['depmap_score'] < -0.5:
        score += 20
    else:
        score += 10

    # Selectivity (0-25 points)
    if gene_data['selective']:  # Tissue-specific
        score += 25
    elif gene_data['pan_cancer']:  # Pan-cancer (deprioritize)
        score += 5

    # Druggability (0-25 points)
    if gene_data['pharos_tdl'] == 'Tclin':
        score += 25
    elif gene_data['pharos_tdl'] == 'Tchem':
        score += 20
    elif gene_data['pharos_tdl'] == 'Tbio':
        score += 10
    else:
        score += 5

    # Clinical relevance (0-20 points)
    if gene_data['mutation_frequency'] > 20:
        score += 10
    if gene_data['biomarker_status']:
        score += 10

    return score

Output for Report:

markdown

undefined

python

def calculate_priority_score(gene_data):
    """
    Calculate multi-dimensional priority score.

    Components:
    - Essentiality strength (DepMap score)
    - Selectivity (tissue-specific vs pan-cancer)
    - Druggability (Pharos TDL, existing compounds)
    - Clinical relevance (mutations, expression, biomarkers)
    - Validation feasibility (tool compounds available)

    Returns score 0-100
    """
    score = 0

    # Essentiality (0-30 points)
    if gene_data['depmap_score'] < -1.0:
        score += 30
    elif gene_data['depmap_score'] < -0.5:
        score += 20
    else:
        score += 10

    # Selectivity (0-25 points)
    if gene_data['selective']:  # Tissue-specific
        score += 25
    elif gene_data['pan_cancer']:  # Pan-cancer (deprioritize)
        score += 5

    # Druggability (0-25 points)
    if gene_data['pharos_tdl'] == 'Tclin':
        score += 25
    elif gene_data['pharos_tdl'] == 'Tchem':
        score += 20
    elif gene_data['pharos_tdl'] == 'Tbio':
        score += 10
    else:
        score += 5

    # Clinical relevance (0-20 points)
    if gene_data['mutation_frequency'] > 20:
        score += 10
    if gene_data['biomarker_status']:
        score += 10

    return score

报告输出:

markdown

undefined

6. Hit Prioritization & Validation Recommendations

6. 候选靶点优先级排序与验证建议

Top 10 Priority Targets (Multi-Dimensional Scoring)

前10个优先级靶点（多维度评分）

Rank	Gene	Essentiality	Selectivity	Druggability	Clinical	Total Score	Recommendation
1	KRAS	30/30	25/25	20/25	20/20	95/100	High priority, validated drugs available
2	EGFR	28/30	24/25	25/25	18/20	95/100	High priority, approved drugs
3	WEE1	26/30	23/25	20/25	12/20	81/100	Medium-high, Phase 2 drug available
4	AURKA	24/30	22/25	20/25	14/20	80/100	Medium-high, tool compounds exist
5	CDK2	25/30	20/25	20/25	10/20	75/100	Medium, multiple tool compounds
6	CHEK1	23/30	21/25	18/25	10/20	72/100	Medium, chemical probes available
7	PLK1	22/30	20/25	18/25	11/20	71/100	Medium, clinical tool compounds
8	E2F1	24/30	22/25	10/25	12/20	68/100	Medium-low, requires degrader strategy
9	HDAC1	20/30	18/25	25/25	8/20	71/100	Medium, approved HDAC inhibitors
10	MCM2	28/30	10/25	5/25	8/20	51/100	Low, pan-cancer essential, not druggable

Scoring Rubric:

Essentiality (30 pts): DepMap gene effect score magnitude
Selectivity (25 pts): Tissue-specific vs pan-cancer dependency
Druggability (25 pts): Pharos TDL, existing compounds, tractability
Clinical (20 pts): Mutation frequency, biomarker status, expression

Priority Tiers:

Tier 1 (Score >80): Immediate validation, existing tools/drugs available
Tier 2 (Score 60-80): Medium priority, validation feasible with chemical probes
Tier 3 (Score <60): Lower priority or requires novel approaches (PROTACs, etc.)

排名	基因	必需性	选择性	成药性	临床相关性	总分	建议
1	KRAS	30/30	25/25	20/25	20/20	95/100	高优先级，有已验证药物可用
2	EGFR	28/30	24/25	25/25	18/20	95/100	高优先级，有获批药物
3	WEE1	26/30	23/25	20/25	12/20	81/100	中高优先级，有2期药物可用
4	AURKA	24/30	22/25	20/25	14/20	80/100	中高优先级，有工具化合物可用
5	CDK2	25/30	20/25	20/25	10/20	75/100	中优先级，有多种工具化合物
6	CHEK1	23/30	21/25	18/25	10/20	72/100	中优先级，有化学探针可用
7	PLK1	22/30	20/25	18/25	11/20	71/100	中优先级，有临床工具化合物
8	E2F1	24/30	22/25	10/25	12/20	68/100	中低优先级，需要降解剂策略
9	HDAC1	20/30	18/25	25/25	8/20	71/100	中优先级，有获批HDAC抑制剂
10	MCM2	28/30	10/25	5/25	8/20	51/100	低优先级，泛癌必需，不可靶向

评分规则:

必需性（30分）: DepMap基因效应值的绝对值
选择性（25分）: 组织特异性vs泛癌依赖性
成药性（25分）: Pharos TDL、现有化合物、可靶向性
临床相关性（20分）: 突变频率、生物标志物状态、表达情况

优先级层级:

1级（得分>80）: 立即验证，有现有工具/药物可用
2级（得分60-80）: 中优先级，可用化学探针进行验证
3级（得分<60）: 低优先级或需要新方法（如PROTAC等）

Validation Experiment Recommendations

验证实验建议

Tier 1 Targets (KRAS, EGFR, WEE1)

1级靶点（KRAS、EGFR、WEE1）

1. KRAS

Essentiality: Strong selective dependency in KRAS-mutant cancers
Validation Approach:
- Test KRAS G12C inhibitor (sotorasib/adagrasib) in KRAS G12C-mutant cell lines from screen
- Orthogonal validation: siRNA/shRNA knockdown
- Rescue experiment: Re-express WT KRAS in KO cells
Expected Outcome: Growth inhibition/cell death in KRAS-mutant lines only
Tool Compounds: Sotorasib (AMG 510), Adagrasib (MRTX849), MRTX1133 (pan-KRAS)
Timeline: 2-3 weeks for cell line validation

2. EGFR

Essentiality: Selective in EGFR-mutant/amplified NSCLC, glioblastoma
Validation Approach:
- Test EGFR TKI panel (erlotinib, osimertinib) in screen cell lines
- Dose-response curves to establish IC50
- Combination with standard chemotherapy
Expected Outcome: Potent inhibition in EGFR-altered lines
Tool Compounds: Erlotinib, Gefitinib, Osimertinib (all FDA-approved)
Timeline: 1-2 weeks

3. WEE1

Essentiality: Synthetic lethal with TP53 loss, selective in TP53-mutant cancers
Validation Approach:
- Test adavosertib (WEE1 inhibitor) ± DNA damaging agents
- Stratify by TP53 status (mutant vs WT)
- Cell cycle analysis (premature mitotic entry)
Expected Outcome: Selective killing of TP53-mutant cells + synergy with chemo
Tool Compounds: Adavosertib (AZD1775), PD-166285
Timeline: 2-3 weeks

1. KRAS

必需性: 在KRAS突变型癌症中具有强选择性依赖性
验证方法:
- 在筛选得到的KRAS G12C突变型细胞系中测试KRAS G12C抑制剂（sotorasib/adagrasib）
- 正交验证：siRNA/shRNA敲低
- 拯救实验：在敲除细胞中重新表达野生型KRAS
预期结果: 仅在KRAS突变型细胞系中出现生长抑制/细胞死亡
工具化合物: Sotorasib（AMG 510）、Adagrasib（MRTX849）、MRTX1133（泛KRAS）
时间线: 2-3周的细胞系验证

2. EGFR

必需性: 在EGFR突变/扩增的非小细胞肺癌、胶质母细胞瘤中具有选择性
验证方法:
- 在筛选细胞系中测试EGFR TKI面板（厄洛替尼、奥希替尼）
- 绘制剂量反应曲线以确定IC50
- 与标准化疗联合测试
预期结果: 在EGFR改变的细胞系中出现强效抑制
工具化合物: 厄洛替尼、吉非替尼、奥希替尼（均为FDA获批）
时间线: 1-2周

3. WEE1

必需性: 与TP53缺失具有合成致死性，在TP53突变型癌症中具有选择性
验证方法:
- 测试adavosertib（WEE1抑制剂）± DNA损伤剂
- 按TP53状态分层（突变型vs野生型）
- 细胞周期分析（提前进入有丝分裂）
预期结果: 选择性杀伤TP53突变型细胞 + 与化疗协同
工具化合物: Adavosertib（AZD1775）、PD-166285
时间线: 2-3周

Tier 2 Targets (AURKA, CDK2, CHEK1, PLK1)

2级靶点（AURKA、CDK2、CHEK1、PLK1）

General Strategy:

Pharmacological validation with 2-3 selective inhibitors per target
Orthogonal genetic validation (CRISPRi/shRNA)
Pathway analysis (Western blots for downstream effectors)
Combination screens with standard-of-care agents

Recommended Tool Compounds:

AURKA: Alisertib (MLN8237), Aurora A Inhibitor I
CDK2: Roscovitine (seliciclib), Dinaciclib
CHEK1: Prexasertib (LY2606368), AZD7762
PLK1: Volasertib (BI 6727), BI 2536

通用策略:

每个靶点使用2-3种选择性抑制剂进行药理学验证
正交遗传验证（CRISPRi/shRNA）
通路分析（下游效应因子的Western blot）
与标准治疗药物的联合筛选

推荐工具化合物:

AURKA: Alisertib（MLN8237）、Aurora A抑制剂I
CDK2: Roscovitine（seliciclib）、Dinaciclib
CHEK1: Prexasertib（LY2606368）、AZD7762
PLK1: Volasertib（BI 6727）、BI 2536

Tier 3 Targets - Alternative Validation Strategies

3级靶点 - 替代验证策略

E2F1, RB1 (Transcription factors):

Challenge: No direct small molecule inhibitors
Strategy:
- Test PROTACs if available
- Indirect validation via upstream targets (CDK4/6 inhibitors for RB pathway)
- Genetic validation only (CRISPRko, CRISPRi)

MCM2-7 Complex (Helicase, pan-essential):

Challenge: Pan-cancer essential, poor therapeutic window
Strategy: Deprioritize for drug development
Note: Interesting for understanding replication biology, but not ideal therapeutic target

E2F1、RB1（转录因子）:

挑战: 无直接小分子抑制剂
策略:
- 若有PROTAC则进行测试
- 通过上游靶点间接验证（如针对RB通路的CDK4/6抑制剂）
- 仅进行遗传验证（CRISPRko、CRISPRi）

MCM2-7复合物（解旋酶，泛必需）:

挑战: 泛癌必需，治疗窗口差
策略: 优先排除在药物开发之外
注意: 对理解复制生物学有意义，但不是理想的治疗靶点

Validation Timeline

验证时间线

Phase	Duration	Experiments	Deliverable
Phase 1	Weeks 1-3	Tier 1 target validation (KRAS, EGFR, WEE1)	Dose-response curves, potency data
Phase 2	Weeks 4-6	Tier 2 target validation (AURKA, CDK2, etc.)	Hit confirmation, selectivity data
Phase 3	Weeks 7-10	Mechanism studies, pathway analysis	Western blots, cell cycle, apoptosis
Phase 4	Weeks 11-14	Combination studies, in vivo pilot (top 2-3)	Synergy matrices, xenograft data

阶段	时长	实验	交付物
阶段1	第1-3周	1级靶点验证（KRAS、EGFR、WEE1）	剂量反应曲线、效力数据
阶段2	第4-6周	2级靶点验证（AURKA、CDK2等）	命中确认、选择性数据
阶段3	第7-10周	机制研究、通路分析	Western blot、细胞周期、凋亡数据
阶段4	第11-14周	联合研究、体内试点（前2-3个靶点）	协同矩阵、异种移植数据

Success Criteria for Validation

验证成功标准

✅ Hit Confirmed if:

Pharmacological inhibition phenocopies CRISPR knockout (≥50% growth inhibition at ≤1 µM)
Dose-response curve shows IC50 consistent with essentiality score
Effect is selective (active in screen cell line, inactive in control lines)
Orthogonal genetic methods (siRNA, CRISPRi) reproduce phenotype

❌ Hit Rejected/Deprioritized if:

Tool compounds show no effect despite strong CRISPR score (off-target CRISPR effect)
Pan-cancer essential with no selectivity (poor therapeutic window)
No druggable domain/strategy (TFs, scaffolds without chemical matter)
Cannot be validated with available reagents

✅ 命中确认 若：

药理学抑制模拟CRISPR敲除效果（≤1 µM时生长抑制≥50%）
剂量反应曲线的IC50与必需性评分一致
效应具有选择性（在筛选细胞系中有效，在对照细胞系中无效）
正交遗传方法（siRNA、CRISPRi）可重现表型

❌ 命中被拒绝/降优先级 若：

工具化合物无效果，尽管CRISPR评分很高（CRISPR脱靶效应）
泛癌必需且无选择性（治疗窗口差）
无可靶向结构域/策略（转录因子、无化学结合位点的支架蛋白）
无法用现有试剂验证

Resource Requirements

资源需求

Reagents:

Chemical compounds: $5-10K (tool compounds, commercial inhibitors)
CRISPRi/shRNA validation: $3-5K (vectors, reagents)
Cell culture & assays: $8-12K (plates, reagents, media)

Equipment:

Cell culture facility (BSL2)
Plate readers (viability assays, luminescence)
Flow cytometer (cell cycle, apoptosis)
Western blot equipment

Personnel: 1 postdoc + 1 technician for 3-4 months

Validation Strategy Summary: Focus validation efforts on Tier 1 targets (KRAS, EGFR, WEE1) with approved/late-stage drugs. These offer fastest path to clinical translation and have highest probability of success based on multi-dimensional scoring.

---

试剂:

化学化合物: 5-10千美元（工具化合物、商用抑制剂）
CRISPRi/shRNA验证: 3-5千美元（载体、试剂）
细胞培养与实验: 8-12千美元（培养板、试剂、培养基）

设备:

细胞培养设施（BSL2）
酶标仪（活力实验、发光检测）
流式细胞仪（细胞周期、凋亡）
Western blot设备

人员: 1名博士后 + 1名技术员，耗时3-4个月

验证策略总结: 集中验证资源在1级靶点（KRAS、EGFR、WEE1），这些靶点拥有获批/后期临床试验药物。基于多维度评分，这些靶点转化为临床应用的速度最快，成功概率最高。

---

Report Template (Initial File)

报告模板（初始文件）

File:

CRISPR_screen_analysis_[CONTEXT].md

markdown

undefined

文件:

CRISPR_screen_analysis_[CONTEXT].md

markdown

undefined

CRISPR Screen Analysis Report: [CONTEXT]

CRISPR筛选分析报告: [背景信息]

Generated: [Date] Input: [Gene list / Cancer type / Single gene] Context: [Screen type, cell line, experimental details] Status: In Progress

生成日期: [日期] 输入: [基因列表 / 癌症类型 / 单个基因] 背景: [筛选类型、细胞系、实验细节] 状态: 进行中

Executive Summary

执行摘要

[Analyzing...]

[分析中...]

Input Validation

输入验证

[Analyzing...]

[分析中...]

1. Gene Essentiality Analysis

1. 基因必需性分析

1.1 Strongly Essential Genes

1.1 强必需基因

[Analyzing...]

[分析中...]

1.2 Selectively Essential Genes

1.2 选择性必需基因

[Analyzing...]

[分析中...]

1.3 Weakly Essential / Non-Essential

1.3 弱必需 / 非必需基因

[Analyzing...]

[分析中...]

2. Pathway & Functional Enrichment

2. 通路与功能富集

2.1 Pathway Enrichment (Reactome, KEGG)

2.1 通路富集（Reactome、KEGG）

[Analyzing...]

[分析中...]

2.2 GO Enrichment (BP, MF, CC)

2.2 GO富集（生物过程、分子功能、细胞组分）

[Analyzing...]

[分析中...]

2.3 Hallmark Gene Sets (MSigDB)

2.3 特征基因集（MSigDB）

[Analyzing...]

[分析中...]

2.4 Pathway-Level Interpretation

2.4 通路层面解读

[Analyzing...]

[分析中...]

3. Protein Interaction Network

3. 蛋白质相互作用网络

3.1 Network Statistics

3.1 网络统计

[Analyzing...]

[分析中...]

3.2 Protein Complexes

3.2 蛋白质复合物

[Analyzing...]

[分析中...]

3.3 Hub Genes

3.3 枢纽基因

[Analyzing...]

[分析中...]

3.4 Synthetic Lethal Candidates

3.4 合成致死候选靶点

[Analyzing...]

[分析中...]

4. Druggability Assessment

4. 成药性评估

4.1 Target Development Level (Pharos)

4.1 靶点开发层级（Pharos）

[Analyzing...]

[分析中...]

4.2 Approved Drugs & Clinical Candidates

4.2 已获批药物与临床候选药物

[Analyzing...]

[分析中...]

4.3 Chemical Probes

4.3 化学探针

[Analyzing...]

[分析中...]

4.4 Non-Druggable Hits (Alternative Strategies)

4.4 非可靶向候选靶点（替代策略）

[Analyzing...]

[分析中...]

5. Clinical Relevance

5. 临床相关性

5.1 Cancer Genomic Alterations (COSMIC)

5.1 癌症基因组改变（COSMIC）

[Analyzing...]

[分析中...]

5.2 Expression in Tumor vs Normal

5.2 肿瘤vs正常组织中的表达

[Analyzing...]

[分析中...]

5.3 Prognostic/Predictive Biomarkers

5.3 预后/预测生物标志物

[Analyzing...]

[分析中...]

5.4 Mouse Models & Genetic Evidence

5.4 小鼠模型与遗传学证据

[Analyzing...]

[分析中...]

6. Hit Prioritization & Validation

6. 候选靶点优先级排序与验证

6.1 Multi-Dimensional Scoring

6.1 多维度评分

[Analyzing...]

[分析中...]

6.2 Top 10 Priority Targets

6.2 前10个优先级靶点

[Analyzing...]

[分析中...]

6.3 Validation Experiment Recommendations

6.3 验证实验建议

[Analyzing...]

[分析中...]

6.4 Validation Timeline & Resources

6.4 验证时间线与资源

[Analyzing...]

[分析中...]

7. Data Sources & Quality Control

7. 数据源与质量控制

7.1 Primary Data Sources

7.1 主要数据源

[Will be populated...]

[将填充...]

7.2 Tool Call Summary

7.2 工具调用汇总

[Will be populated...]

[将填充...]

7.3 Limitations & Caveats

7.3 局限性与注意事项

[Will be populated...]

[将填充...]

Appendix: Full Hit List

附录: 完整候选靶点列表

[Complete gene list with all scores...]

---

[带所有评分的完整基因列表...]

---

When NOT to Use This Skill

何时不使用本技能

Drug target research (single gene) → Use
```
tooluniverse-target-research
```
skill instead
Disease-centric query → Use
```
tooluniverse-disease-research
```
skill
Chemical compound screening → Different workflow needed
RNA-seq differential expression → Use differential expression analysis workflows
Single gene lookup → Call DepMap tools directly

Use this skill when you have a gene list from a CRISPR screen and need comprehensive functional interpretation + target prioritization.

药物靶点研究（单个基因）→ 改用
```
tooluniverse-target-research
```
技能
疾病导向查询 → 改用
```
tooluniverse-disease-research
```
技能
化学化合物筛选 → 需要不同的工作流
RNA-seq差异表达分析 → 使用差异表达分析工作流
单个基因查询 → 直接调用DepMap工具

当你拥有CRISPR筛选得到的基因列表，并需要进行全面的功能解读 + 靶点优先级排序时，使用本技能。

Example Queries That Trigger This Skill

触发本技能的示例查询

✅ Gene List Analysis:

"Analyze these CRISPR screen hits: EGFR, KRAS, WEE1, PLK1, AURKA, ..."
"I have 50 dropout genes from a CRISPR screen in lung cancer cells, what should I validate?"
"Prioritize these genes for drug target development: [gene list]"

✅ Cancer Type Query:

"What are the top essential genes in pancreatic cancer?"
"Find druggable dependencies in triple-negative breast cancer"

✅ Single Gene Validation:

"Is KRAS a good therapeutic target for lung cancer?"
"Assess the druggability of WEE1 as a cancer target"

❌ Not Appropriate:

"What is the function of EGFR?" → too broad, use target-research skill
"Find drugs for lung cancer" → disease-centric, use drug-repurposing skill
"Analyze this RNA-seq data" → different analytical workflow

✅ 基因列表分析:

"分析这些CRISPR筛选候选靶点：EGFR, KRAS, WEE1, PLK1, AURKA, ..."
"我有50个来自肺癌细胞CRISPR dropout筛选的基因，应该验证哪些？"
"为药物靶点开发对这些基因进行优先级排序：[基因列表]"

✅ 癌症类型查询:

"胰腺癌中的顶级必需基因是什么？"
"寻找三阴性乳腺癌中的可靶向依赖性"

✅ 单个基因验证:

"KRAS是肺癌的良好治疗靶点吗？"
"评估WEE1作为癌症靶点的成药性"

❌ 不适用场景:

"EGFR的功能是什么？" → 范围过广，使用target-research技能
"寻找肺癌的治疗药物" → 疾病导向，使用药物重定位技能
"分析这份RNA-seq数据" → 需要不同的分析工作流

Key Improvements from Existing Skills

与现有技能相比的关键改进

Based on patterns in

tooluniverse-target-research

and

tooluniverse-drug-research

Multi-dimensional scoring system (novel for CRISPR analysis)
Validation experiment recommendations with timelines and reagents
Tier-based prioritization (Tier 1/2/3 based on actionability)
Tool compound suggestions for each druggable target
Synthetic lethal candidate identification from PPI network
Explicit selectivity analysis (pan-cancer vs tissue-selective)
Success/failure criteria for validation experiments

基于

tooluniverse-target-research

和

tooluniverse-drug-research

的使用模式：

多维度评分系统（CRISPR分析的创新点）
带时间线和试剂的验证实验建议
基于可行动性的层级优先级排序（1/2/3级）
每个可靶向靶点的工具化合物建议
从PPI网络识别合成致死候选靶点
明确的选择性分析（泛癌vs组织选择性）
验证实验的成功/失败标准