tooluniverse-crispr-screen-analysis

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

CRISPR Screen Analysis Workflow

CRISPR筛选分析工作流

Systematic analysis of CRISPR knockout/activation/interference screens to identify essential genes, synthetic lethal interactions, and therapeutic targets.
KEY PRINCIPLES:
  1. Report-first approach - Create comprehensive analysis report FIRST, then populate progressively
  2. Evidence grading - Grade all findings by confidence level (H/M/L based on statistical significance and validation data)
  3. Multi-dimensional analysis - Integrate essentiality, pathway context, druggability, and clinical relevance
  4. Citation requirements - Every conclusion must trace to source data (DepMap, literature, pathways)
  5. Mandatory completeness - All analysis sections must exist with data or explicit "No data" notes
  6. Context-aware interpretation - Consider cell line context, screen type, and biological pathway redundancy

对CRISPR敲除/激活/干扰筛选进行系统性分析,以识别必需基因、合成致死相互作用及治疗靶点。
核心原则:
  1. 报告优先法 - 先创建完整的分析报告,再逐步填充内容
  2. 证据分级 - 所有发现按置信度分级(基于统计显著性和验证数据分为高/中/低,对应★★★/★★☆/★☆☆)
  3. 多维度分析 - 整合必需性、通路背景、成药性及临床相关性
  4. 引用要求 - 每个结论必须可追溯至源数据(DepMap、文献、通路数据库)
  5. 完整性强制要求 - 所有分析章节必须存在,无数据时需明确标注“无数据”
  6. 背景感知解读 - 考虑细胞系背景、筛选类型及生物通路冗余性

When to Use This Skill

何时使用本技能

Apply when users:
  • Have CRISPR screen hit lists (genes with significant phenotypes)
  • Need to prioritize CRISPR hits for validation
  • Want to identify essential genes for a specific cancer type
  • Need synthetic lethal interaction analysis
  • Ask "what are the top hits from my CRISPR screen?"
  • Need drug target prioritization from functional genomics data
  • Want pathway-level interpretation of screen results

当用户有以下需求时适用:
  • 拥有CRISPR筛选的候选基因列表(具有显著表型的基因)
  • 需要对CRISPR候选基因进行验证优先级排序
  • 想要识别特定癌症类型的必需基因
  • 需要进行合成致死相互作用分析
  • 提问“我的CRISPR筛选中排名靠前的候选靶点是什么?”
  • 需要从功能基因组数据中进行药物靶点优先级排序
  • 想要对筛选结果进行通路层面的解读

⚠️ Known Issues & Workarounds

⚠️ 已知问题与解决方案

DepMap API Unavailability (2026-02-09)

DepMap API不可用(2026-02-09)

Issue: DepMap REST APIs (Sanger Cell Model Passports and Broad Institute) are currently non-operational.
Impact:
  • PATH 0 (Gene Validation): DepMap gene registry unavailable
  • PATH 1 (Essentiality Analysis): CRISPR dependency scores unavailable
Workaround: This skill now uses Pharos as fallback:
  • Gene validation via
    Pharos_get_target()
  • Druggability assessment via TDL (Target Development Level) classification
  • TDL used as proxy for essentiality (Tclin targets are often essential)
  • Evidence grading: Tclin=★★★, Tchem=★★☆, Tbio/Tdark=★☆☆
Data Quality Trade-off:
  • ✅ Gene validation: 100% success rate (Pharos has comprehensive drug target coverage)
  • ⚠️ Essentiality scores: Druggability-based proxy (TDL classification)
  • ℹ️ All findings labeled with source (Pharos vs DepMap)
Timeline: Permanent fix (CSV download) estimated 1-2 weeks. See
DEPMAP_ISSUE_ANALYSIS.md
for details.

问题: DepMap REST API(Sanger细胞模型数据库和Broad研究所)当前无法使用。
影响:
  • 路径0(基因验证):DepMap基因注册表不可用
  • 路径1(必需性分析):CRISPR依赖性评分不可用
解决方案: 本技能现在使用Pharos作为备选方案:
  • 通过
    Pharos_get_target()
    进行基因验证
  • 通过TDL(靶点开发层级)分类评估成药性
  • 将TDL用作必需性的替代指标(Tclin靶点通常为必需基因)
  • 证据分级:Tclin=★★★,Tchem=★★☆,Tbio/Tdark=★☆☆
数据质量权衡:
  • ✅ 基因验证:100%成功率(Pharos覆盖全面的药物靶点)
  • ⚠️ 必需性评分:基于成药性的替代指标(TDL分类)
  • ℹ️ 所有发现均标注数据源(Pharos或DepMap)
时间线: 永久修复方案(CSV下载)预计1-2周完成。详情见
DEPMAP_ISSUE_ANALYSIS.md

Input Types Supported

支持的输入类型

1. Gene List from User's Screen

1. 用户筛选得到的基因列表

  • Format: Gene symbols (e.g., EGFR, KRAS, TP53)
  • Minimum: 5 genes (for meaningful enrichment)
  • Optimal: 20-100 genes (hits from primary screen)
  • Context needed: Cancer type, screen type (dropout/enrichment), cell line used
  • 格式: 基因符号(如EGFR、KRAS、TP53)
  • 最低要求: 5个基因(以获得有意义的富集结果)
  • 最优: 20-100个基因(初筛得到的候选靶点)
  • 所需背景信息: 癌症类型、筛选类型(dropout/enrichment)、使用的细胞系

2. Cancer Type Query

2. 癌症类型查询

  • Format: Cancer type name (e.g., "non-small cell lung cancer", "breast cancer")
  • Workflow: Retrieve top essential genes for that cancer from DepMap
  • Output: Ranked target list with essentiality scores
  • 格式: 癌症类型名称(如“非小细胞肺癌”、“乳腺癌”)
  • 工作流: 从DepMap检索该癌症的顶级必需基因
  • 输出: 带必需性评分的靶点排名列表

3. Gene of Interest

3. 目标基因查询

  • Format: Single gene symbol
  • Workflow: Analyze essentiality across cancer types, identify selective dependencies
  • Output: Target validation report with tissue specificity

  • 格式: 单个基因符号
  • 工作流: 分析该基因在不同癌症类型中的必需性,识别选择性依赖性
  • 输出: 带组织特异性的靶点验证报告

Critical Workflow Requirements

关键工作流要求

1. Report-First Approach (MANDATORY)

1. 报告优先法(强制要求)

DO NOT show intermediate tool outputs. Instead:
  1. Create report file FIRST before any analysis:
    • File name:
      CRISPR_screen_analysis_[CONTEXT].md
    • Initialize with all section headers
    • Add placeholder:
      [Analyzing...]
      in each section
  2. Progressively update as data arrives:
    • Replace
      [Analyzing...]
      with findings
    • Include "No significant enrichment" when appropriate
    • Document failed analyses explicitly
  3. Final deliverable: Complete markdown report + optional plots (if user requests)
禁止展示中间工具输出,应遵循以下步骤:
  1. 先创建报告文件再进行任何分析:
    • 文件名:
      CRISPR_screen_analysis_[CONTEXT].md
    • 初始化所有章节标题
    • 在每个章节添加占位符:
      [分析中...]
  2. 随着数据获取逐步更新
    • 用发现结果替换
      [分析中...]
    • 适当时标注“无显著富集”
    • 明确记录失败的分析
  3. 最终交付物: 完整的markdown报告 + 可选的图表(若用户要求)

2. Evidence Grading System (MANDATORY)

2. 证据分级系统(强制要求)

Grade every finding by confidence level:
LevelSymbolCriteriaExamples
HIGH★★★DepMap score <-1.0, p<0.01, validated in literatureStrong essential gene, clinical drug target
MEDIUM★★☆DepMap score -0.5 to -1.0, p<0.05, pathway coherenceModerate dependency, pathway member
LOW★☆☆DepMap score >-0.5, marginal significance, weak validationWeak hit, potential off-target
为每个发现按置信度分级:
级别符号标准示例
★★★DepMap评分<-1.0,p<0.01,有文献验证强必需基因、临床药物靶点
★★☆DepMap评分-0.5至-1.0,p<0.05,通路一致性良好中度依赖性、通路成员
★☆☆DepMap评分>-0.5,显著性较弱,验证证据不足弱候选靶点、潜在脱靶效应

3. Contextualization Requirements

3. 背景关联要求

Every gene-level finding must include:
  • Essentiality score (DepMap gene effect)
  • Pan-cancer vs selective (is it essential in all cancers or specific subset?)
  • Druggability (existing drugs, chemical probes, tractability)
  • Pathway context (which pathways/complexes does it belong to?)
  • Clinical relevance (approved targets, ongoing trials, biomarkers)

每个基因层面的发现必须包含:
  • 必需性评分(DepMap基因效应值)
  • 泛癌vs选择性(是否在所有癌症中必需,还是仅在特定亚群中必需?)
  • 成药性(现有药物、化学探针、可靶向性)
  • 通路背景(属于哪些通路/复合物?)
  • 临床相关性(已获批靶点、正在进行的临床试验、生物标志物)

Core Analysis Strategy: 7 Research Paths

核心分析策略:7个研究路径

User Input (gene list OR cancer type OR single gene)
├─ PATH 0: Input Processing & Validation
│   ├─ Validate gene symbols
│   ├─ Determine analysis mode
│   └─ Set context parameters
├─ PATH 1: Gene Essentiality Analysis (DepMap)
│   ├─ Query gene dependencies for each hit
│   ├─ Retrieve essentiality scores across cell lines
│   ├─ Calculate pan-cancer vs selective essentiality
│   └─ Rank genes by dependency strength
├─ PATH 2: Pathway & Functional Enrichment
│   ├─ GO enrichment (biological process, molecular function)
│   ├─ Pathway enrichment (Reactome, WikiPathways, KEGG)
│   ├─ Hallmark gene set enrichment (MSigDB)
│   └─ Identify pathway-level vulnerabilities
├─ PATH 3: Protein-Protein Interaction Networks
│   ├─ Build PPI network for hit genes
│   ├─ Identify protein complexes
│   ├─ Find synthetic lethal candidates
│   └─ Hub gene analysis
├─ PATH 4: Druggability & Target Assessment
│   ├─ Check existing drugs (DGIdb, ChEMBL)
│   ├─ Assess chemical tractability (Pharos TDL)
│   ├─ Find chemical probes (Open Targets)
│   └─ Clinical trial status (ClinicalTrials.gov)
├─ PATH 5: Disease Association & Clinical Relevance
│   ├─ Gene-disease associations (Open Targets)
│   ├─ Somatic mutations in cancer (COSMIC, cBioPortal)
│   ├─ Expression in patient samples (GTEx, TCGA)
│   └─ Prognostic/predictive biomarker status
└─ PATH 6: Hit Prioritization & Validation Guidance
    ├─ Integrate all evidence dimensions
    ├─ Calculate priority score (essentiality + druggability + clinical relevance)
    ├─ Recommend validation experiments
    └─ Identify top 5-10 targets for follow-up

用户输入(基因列表或癌症类型或单个基因)
├─ 路径0: 输入处理与验证
│   ├─ 验证基因符号
│   ├─ 确定分析模式
│   └─ 设置背景参数
├─ 路径1: 基因必需性分析(DepMap)
│   ├─ 查询每个候选靶点的基因依赖性
│   ├─ 检索跨细胞系的必需性评分
│   ├─ 计算泛癌vs选择性必需性
│   └─ 按依赖性强度对基因排名
├─ 路径2: 通路与功能富集
│   ├─ GO富集(生物过程、分子功能)
│   ├─ 通路富集(Reactome、WikiPathways、KEGG)
│   ├─ 特征基因集富集(MSigDB Hallmark)
│   └─ 识别通路层面的脆弱性
├─ 路径3: 蛋白质-蛋白质相互作用网络
│   ├─ 为候选基因构建PPI网络
│   ├─ 识别蛋白质复合物
│   ├─ 寻找合成致死候选靶点
│   └─ 枢纽基因分析
├─ 路径4: 成药性与靶点评估
│   ├─ 检查现有药物(DGIdb、ChEMBL)
│   ├─ 评估化学可靶向性(Pharos TDL)
│   ├─ 寻找化学探针(Open Targets)
│   └─ 临床试验状态(ClinicalTrials.gov)
├─ 路径5: 疾病关联与临床相关性
│   ├─ 基因-疾病关联(Open Targets)
│   ├─ 癌症中的体细胞突变(COSMIC、cBioPortal)
│   ├─ 患者样本中的表达情况(GTEx、TCGA)
│   └─ 预后/预测生物标志物状态
└─ 路径6: 候选靶点优先级排序与验证指导
    ├─ 整合所有证据维度
    ├─ 计算优先级评分(必需性+成药性+临床相关性)
    ├─ 推荐验证实验
    └─ 确定前5-10个待跟进的靶点

PATH 0: Input Processing & Validation

路径0: 输入处理与验证

Determine Analysis Mode

确定分析模式

python
def determine_analysis_mode(user_input):
    """
    Figure out what type of analysis to run.

    Returns: 'gene_list', 'cancer_type', or 'single_gene'
    """
    if isinstance(user_input, list) and len(user_input) >= 5:
        return 'gene_list'  # User provided hits from their screen
    elif isinstance(user_input, str) and len(user_input.split()) > 1:
        return 'cancer_type'  # User asks about a cancer type
    else:
        return 'single_gene'  # Single gene target validation
python
def determine_analysis_mode(user_input):
    """
    Figure out what type of analysis to run.

    Returns: 'gene_list', 'cancer_type', or 'single_gene'
    """
    if isinstance(user_input, list) and len(user_input) >= 5:
        return 'gene_list'  # User provided hits from their screen
    elif isinstance(user_input, str) and len(user_input.split()) > 1:
        return 'cancer_type'  # User asks about a cancer type
    else:
        return 'single_gene'  # Single gene target validation

Gene Symbol Validation

基因符号验证

CRITICAL: Validate gene symbols with fallback to Open Targets if DepMap unavailable.
python
def validate_gene_symbols(tu, gene_list):
    """
    Validate gene symbols with DepMap fallback to Open Targets.

    Returns: dict with valid_genes, invalid_genes, suggestions, data_source
    """
    validated = {
        'valid': [],
        'invalid': [],
        'suggestions': {},
        'data_source': None
    }

    # Try DepMap first
    depmap_available = False
    test_result = tu.tools.DepMap_search_genes(query="KRAS")
    if (test_result.get('status') == 'success' and
        not test_result.get('error', '').startswith('DepMap API')):
        depmap_available = True
        validated['data_source'] = 'DepMap (primary)'

    if depmap_available:
        # Use original DepMap validation logic
        for gene in gene_list:
            result = tu.tools.DepMap_search_genes(query=gene)
            if result.get('status') == 'success':
                genes = result.get('data', {}).get('genes', [])
                exact_matches = [g for g in genes
                               if g.get('symbol', '').upper() == gene.upper()]

                if exact_matches:
                    validated['valid'].append({
                        'input': gene,
                        'symbol': exact_matches[0]['symbol'],
                        'ensembl_id': exact_matches[0].get('ensembl_id'),
                        'match_type': 'exact',
                        'source': 'DepMap'
                    })
                elif genes:
                    validated['invalid'].append(gene)
                    validated['suggestions'][gene] = [g['symbol'] for g in genes[:3]]
                else:
                    validated['invalid'].append(gene)
    else:
        # FALLBACK: Use Pharos (druggability database)
        print("⚠️  DepMap unavailable, using Pharos for gene validation...")
        validated['data_source'] = 'Pharos (fallback - ★★☆)'

        for gene in gene_list:
            # Query Pharos to check if gene exists
            result = tu.tools.Pharos_get_target(gene=gene)

            if result.get('status') == 'success' and result.get('data'):
                target_data = result.get('data', {})
                validated['valid'].append({
                    'input': gene,
                    'symbol': target_data.get('name', gene),
                    'tdl': target_data.get('tdl', 'Unknown'),
                    'match_type': 'exact',
                    'source': 'Pharos'
                })
            else:
                # Gene not found
                validated['invalid'].append(gene)

    return validated
Output for Report:
markdown
undefined
关键要求: 验证基因符号,若DepMap不可用则使用Open Targets作为备选。
python
def validate_gene_symbols(tu, gene_list):
    """
    Validate gene symbols with DepMap fallback to Open Targets.

    Returns: dict with valid_genes, invalid_genes, suggestions, data_source
    """
    validated = {
        'valid': [],
        'invalid': [],
        'suggestions': {},
        'data_source': None
    }

    # Try DepMap first
    depmap_available = False
    test_result = tu.tools.DepMap_search_genes(query="KRAS")
    if (test_result.get('status') == 'success' and
        not test_result.get('error', '').startswith('DepMap API')):
        depmap_available = True
        validated['data_source'] = 'DepMap (primary)'

    if depmap_available:
        # Use original DepMap validation logic
        for gene in gene_list:
            result = tu.tools.DepMap_search_genes(query=gene)
            if result.get('status') == 'success':
                genes = result.get('data', {}).get('genes', [])
                exact_matches = [g for g in genes
                               if g.get('symbol', '').upper() == gene.upper()]

                if exact_matches:
                    validated['valid'].append({
                        'input': gene,
                        'symbol': exact_matches[0]['symbol'],
                        'ensembl_id': exact_matches[0].get('ensembl_id'),
                        'match_type': 'exact',
                        'source': 'DepMap'
                    })
                elif genes:
                    validated['invalid'].append(gene)
                    validated['suggestions'][gene] = [g['symbol'] for g in genes[:3]]
                else:
                    validated['invalid'].append(gene)
    else:
        # FALLBACK: Use Pharos (druggability database)
        print("⚠️  DepMap unavailable, using Pharos for gene validation...")
        validated['data_source'] = 'Pharos (fallback - ★★☆)'

        for gene in gene_list:
            # Query Pharos to check if gene exists
            result = tu.tools.Pharos_get_target(gene=gene)

            if result.get('status') == 'success' and result.get('data'):
                target_data = result.get('data', {})
                validated['valid'].append({
                    'input': gene,
                    'symbol': target_data.get('name', gene),
                    'tdl': target_data.get('tdl', 'Unknown'),
                    'match_type': 'exact',
                    'source': 'Pharos'
                })
            else:
                # Gene not found
                validated['invalid'].append(gene)

    return validated
报告输出示例:
markdown
undefined

Input Validation

输入验证

Genes Provided: 25 gene symbols Valid Genes: 23 (92%) Invalid/Ambiguous: 2 Data Source: {data_source from validated dict}
Invalid Genes:
  • EGFRVIII
    → Gene symbol not recognized (mutation-specific identifier)
  • P53
    → Did you mean
    TP53
    ? (use official gene symbol)
Proceeding with 23 valid gene symbols for analysis.
Source: {DepMap gene registry OR Pharos (fallback)}

Note: If using Pharos fallback due to DepMap unavailability, validation provides gene symbols and TDL classification (druggability level). Validation is ★★☆ reliable.

---
提供的基因: 25个基因符号 有效基因: 23个(92%) 无效/模糊基因: 2个 数据源: {validated字典中的data_source}
无效基因:
  • EGFRVIII
    → 基因符号未被识别(突变特异性标识符)
  • P53
    → 是否指
    TP53
    ?(请使用官方基因符号)
将基于23个有效基因符号进行分析
来源: {DepMap基因注册表或Pharos(备选)}

注意: 若因DepMap不可用而使用Pharos备选方案,验证将提供基因符号和TDL分类(成药层级)。验证置信度为★★☆。

---

PATH 1: Gene Essentiality Analysis (DepMap with Open Targets Fallback)

路径1: 基因必需性分析(DepMap + Open Targets备选)

Retrieve Essentiality Scores

检索必需性评分

python
def analyze_gene_essentiality(tu, gene_list, cancer_type=None):
    """
    Get gene essentiality data with DepMap fallback to Open Targets.

    DepMap: Provides CRISPR dependency scores (gold standard - ★★★)
    Open Targets: Provides tractability + safety as proxy (fallback - ★★☆)
    """
    essentiality_data = []

    # Check if DepMap is available
    test_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol="KRAS")
    depmap_available = (
        test_result.get('status') == 'success' and
        not test_result.get('error', '').startswith('DepMap API')
    )

    if depmap_available:
        # Use original DepMap logic (optimal - ★★★)
        for gene in gene_list:
            dep_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol=gene)
            if dep_result.get('status') == 'success':
                gene_data = dep_result.get('data', {})
                essentiality_data.append({
                    'gene': gene,
                    'data': gene_data,
                    'essentiality_class': classify_essentiality_depmap(gene_data),
                    'source': 'DepMap',
                    'confidence': 'HIGH'  # ★★★
                })
    else:
        # FALLBACK: Use Pharos TDL classification as proxy
        print("⚠️  DepMap unavailable, using Pharos TDL as essentiality proxy...")

        for gene in gene_list:
            pharos_result = tu.tools.Pharos_get_target(gene=gene)

            if pharos_result.get('status') == 'success' and pharos_result.get('data'):
                target_data = pharos_result.get('data', {})

                # Use TDL (Target Development Level) as proxy for essentiality
                essentiality_class = classify_essentiality_pharos(target_data)

                essentiality_data.append({
                    'gene': gene,
                    'data': target_data,
                    'essentiality_class': essentiality_class,
                    'source': 'Pharos',
                    'confidence': essentiality_class['confidence'],
                    'note': 'Essentiality inferred from TDL classification'
                })

    return essentiality_data


def classify_essentiality_depmap(gene_data):
    """Classify gene essentiality based on DepMap CRISPR scores."""
    # Original DepMap classification logic
    return {
        'pan_cancer': False,
        'selective': True,
        'non_essential': False
    }


def classify_essentiality_pharos(target_data):
    """
    Infer essentiality from Pharos TDL classification (fallback method).

    TDL (Target Development Level) categories:
    - Tclin: Clinical drug target (approved drugs) → Likely essential/important
    - Tchem: Chemical tool/probe available → Druggable, possibly essential
    - Tbio: Biological evidence → Some relevance
    - Tdark: No drug/tool → Unknown essentiality
    """
    tdl = target_data.get('tdl', 'Unknown')

    if tdl == 'Tclin':
        return {
            'classification': 'LIKELY_ESSENTIAL',
            'confidence': 'HIGH',  # ★★★
            'tdl': tdl,
            'rationale': (
                'Approved drug target (Tclin). '
                'Clinically validated targets are often essential. '
                'For cell-line-specific scores, await DepMap restoration.'
            )
        }
    elif tdl == 'Tchem':
        return {
            'classification': 'POTENTIALLY_ESSENTIAL',
            'confidence': 'MEDIUM',  # ★★☆
            'tdl': tdl,
            'rationale': (
                'Chemical tools available (Tchem). '
                'Druggable targets with chemical probes often have functional relevance.'
            )
        }
    elif tdl == 'Tbio':
        return {
            'classification': 'UNCERTAIN',
            'confidence': 'LOW',  # ★☆☆
            'tdl': tdl,
            'rationale': (
                'Biological evidence only (Tbio). '
                'Limited druggability data. Essentiality unclear.'
            )
        }
    else:  # Tdark or Unknown
        return {
            'classification': 'UNKNOWN',
            'confidence': 'LOW',  # ★☆☆
            'tdl': tdl,
            'rationale': (
                'Dark target or unknown. '
                'No drug/tool data. Essentiality cannot be inferred.'
            )
        }
python
def analyze_gene_essentiality(tu, gene_list, cancer_type=None):
    """
    Get gene essentiality data with DepMap fallback to Open Targets.

    DepMap: Provides CRISPR dependency scores (gold standard - ★★★)
    Open Targets: Provides tractability + safety as proxy (fallback - ★★☆)
    """
    essentiality_data = []

    # Check if DepMap is available
    test_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol="KRAS")
    depmap_available = (
        test_result.get('status') == 'success' and
        not test_result.get('error', '').startswith('DepMap API')
    )

    if depmap_available:
        # Use original DepMap logic (optimal - ★★★)
        for gene in gene_list:
            dep_result = tu.tools.DepMap_get_gene_dependencies(gene_symbol=gene)
            if dep_result.get('status') == 'success':
                gene_data = dep_result.get('data', {})
                essentiality_data.append({
                    'gene': gene,
                    'data': gene_data,
                    'essentiality_class': classify_essentiality_depmap(gene_data),
                    'source': 'DepMap',
                    'confidence': 'HIGH'  # ★★★
                })
    else:
        # FALLBACK: Use Pharos TDL classification as proxy
        print("⚠️  DepMap unavailable, using Pharos TDL as essentiality proxy...")

        for gene in gene_list:
            pharos_result = tu.tools.Pharos_get_target(gene=gene)

            if pharos_result.get('status') == 'success' and pharos_result.get('data'):
                target_data = pharos_result.get('data', {})

                # Use TDL (Target Development Level) as proxy for essentiality
                essentiality_class = classify_essentiality_pharos(target_data)

                essentiality_data.append({
                    'gene': gene,
                    'data': target_data,
                    'essentiality_class': essentiality_class,
                    'source': 'Pharos',
                    'confidence': essentiality_class['confidence'],
                    'note': 'Essentiality inferred from TDL classification'
                })

    return essentiality_data


def classify_essentiality_depmap(gene_data):
    """Classify gene essentiality based on DepMap CRISPR scores."""
    # Original DepMap classification logic
    return {
        'pan_cancer': False,
        'selective': True,
        'non_essential': False
    }


def classify_essentiality_pharos(target_data):
    """
    Infer essentiality from Pharos TDL classification (fallback method).

    TDL (Target Development Level) categories:
    - Tclin: Clinical drug target (approved drugs) → Likely essential/important
    - Tchem: Chemical tool/probe available → Druggable, possibly essential
    - Tbio: Biological evidence → Some relevance
    - Tdark: No drug/tool → Unknown essentiality
    """
    tdl = target_data.get('tdl', 'Unknown')

    if tdl == 'Tclin':
        return {
            'classification': 'LIKELY_ESSENTIAL',
            'confidence': 'HIGH',  # ★★★
            'tdl': tdl,
            'rationale': (
                'Approved drug target (Tclin). '
                'Clinically validated targets are often essential. '
                'For cell-line-specific scores, await DepMap restoration.'
            )
        }
    elif tdl == 'Tchem':
        return {
            'classification': 'POTENTIALLY_ESSENTIAL',
            'confidence': 'MEDIUM',  # ★★☆
            'tdl': tdl,
            'rationale': (
                'Chemical tools available (Tchem). '
                'Druggable targets with chemical probes often have functional relevance.'
            )
        }
    elif tdl == 'Tbio':
        return {
            'classification': 'UNCERTAIN',
            'confidence': 'LOW',  # ★☆☆
            'tdl': tdl,
            'rationale': (
                'Biological evidence only (Tbio). '
                'Limited druggability data. Essentiality unclear.'
            )
        }
    else:  # Tdark or Unknown
        return {
            'classification': 'UNKNOWN',
            'confidence': 'LOW',  # ★☆☆
            'tdl': tdl,
            'rationale': (
                'Dark target or unknown. '
                'No drug/tool data. Essentiality cannot be inferred.'
            )
        }

Cancer Type-Specific Analysis

癌症类型特异性分析

For cancer type queries, retrieve top essential genes:
python
def get_top_essential_genes_for_cancer(tu, cancer_type, top_n=50):
    """
    Retrieve top essential genes for a specific cancer type from DepMap.
    """
    # Get cell lines for this cancer type
    cell_lines = tu.tools.DepMap_get_cell_lines(
        cancer_type=cancer_type,
        page_size=100
    )

    if not cell_lines.get('data', {}).get('cell_lines'):
        return {'error': f'No cell lines found for {cancer_type}'}

    # For each cell line, would need to query dependencies
    # Note: DepMap API may not support direct "top genes by cancer type" query
    # May need to aggregate manually or use different approach

    return {
        'cancer_type': cancer_type,
        'cell_lines': cell_lines.get('data', {}).get('cell_lines', []),
        'note': 'Full analysis requires per-cell-line dependency data aggregation'
    }
Output for Report (when DepMap available):
markdown
undefined
针对癌症类型查询,检索顶级必需基因:
python
def get_top_essential_genes_for_cancer(tu, cancer_type, top_n=50):
    """
    Retrieve top essential genes for a specific cancer type from DepMap.
    """
    # Get cell lines for this cancer type
    cell_lines = tu.tools.DepMap_get_cell_lines(
        cancer_type=cancer_type,
        page_size=100
    )

    if not cell_lines.get('data', {}).get('cell_lines'):
        return {'error': f'No cell lines found for {cancer_type}'}

    # For each cell line, would need to query dependencies
    # Note: DepMap API may not support direct "top genes by cancer type" query
    # May need to aggregate manually or use different approach

    return {
        'cancer_type': cancer_type,
        'cell_lines': cell_lines.get('data', {}).get('cell_lines', []),
        'note': 'Full analysis requires per-cell-line dependency data aggregation'
    }
DepMap可用时的报告输出:
markdown
undefined

1. Gene Essentiality Analysis

1. 基因必需性分析

Data Source: DepMap CRISPR (24Q2) ✅ Confidence: ★★★ HIGH
数据源: DepMap CRISPR (24Q2) ✅ 置信度: ★★★ 高

Strongly Essential Genes (DepMap Score < -1.0)

强必需基因(DepMap评分 < -1.0)

GeneMean EffectEssential Cell Lines (%)SelectivityEvidence
RPL5-1.4598% (1,042/1,063)Pan-cancer★★★
RPS6-1.3296% (1,019/1,063)Pan-cancer★★★
POLR2A-1.2895% (1,010/1,063)Pan-cancer★★★
Interpretation: These genes are essential for cell survival across nearly all cancer types. They are core fitness genes (ribosomal proteins, RNA polymerase) and likely not selective therapeutic targets.
Source: DepMap via
DepMap_get_gene_dependencies

**Output for Report (when using Pharos fallback)**:
```markdown
基因平均效应值必需细胞系占比选择性证据
RPL5-1.4598% (1,042/1,063)泛癌★★★
RPS6-1.3296% (1,019/1,063)泛癌★★★
POLR2A-1.2895% (1,010/1,063)泛癌★★★
解读: 这些基因在几乎所有癌症类型中对细胞存活都是必需的。它们是核心 fitness 基因(核糖体蛋白、RNA聚合酶),可能不适合作为选择性治疗靶点。
来源: DepMap via
DepMap_get_gene_dependencies

**使用Pharos备选方案时的报告输出**:
```markdown

1. Gene Essentiality Analysis

1. 基因必需性分析

⚠️ Data Source: Pharos (DepMap CRISPR temporarily unavailable) Analysis Method: Essentiality inferred from TDL (Target Development Level) classification Confidence: Varies by TDL (Tclin=★★★, Tchem=★★☆, Tbio/Tdark=★☆☆)
⚠️ 数据源: Pharos(DepMap CRISPR暂时不可用) 分析方法: 基于TDL(靶点开发层级)分类推断必需性 置信度: 随TDL而异(Tclin=★★★, Tchem=★★☆, Tbio/Tdark=★☆☆)

Clinically Validated Targets (Tclin - Likely Essential)

临床验证靶点(Tclin - 可能必需)

GeneTDLClinical StatusInferenceEvidence
KRASTclinApproved drugs (sotorasib, adagrasib)Likely essential in KRAS-mutant cancers★★★
EGFRTclinMultiple approved inhibitorsLikely essential in EGFR-mutant cancers★★★
Interpretation: Tclin targets have approved drugs, indicating clinical validation. These are likely essential in specific contexts (mutation-dependent).
基因TDL临床状态推断结果证据
KRASTclin已获批药物(sotorasib、adagrasib)在KRAS突变型癌症中可能必需★★★
EGFRTclin多种获批抑制剂在EGFR突变型癌症中可能必需★★★
解读: Tclin靶点拥有获批药物,表明已通过临床验证。这些靶点在特定背景下(依赖突变)可能是必需的。

Chemical Probe Available (Tchem - Potentially Essential)

有化学探针可用的靶点(Tchem - 潜在必需)

GeneTDLTool StatusInferenceEvidence
CDK2TchemChemical probes availablePotentially essential (cell cycle)★★☆
WEE1TchemChemical inhibitors availablePotentially essential (DNA damage)★★☆
Interpretation: Tchem targets are druggable with chemical tools. Druggability suggests functional importance.
Note: TDL classification is a proxy for essentiality. For definitive CRISPR dependency scores, DepMap data required.
Source: Pharos via
Pharos_get_target
(fallback method)
基因TDL工具状态推断结果证据
CDK2Tchem有化学探针可用潜在必需(细胞周期相关)★★☆
WEE1Tchem有化学抑制剂可用潜在必需(DNA损伤相关)★★☆
解读: Tchem靶点可被化学工具靶向。成药性表明其具有功能相关性。
注意: TDL分类是必需性的替代指标。如需明确的CRISPR依赖性评分,需等待DepMap数据恢复。
来源: Pharos via
Pharos_get_target
(备选方法)

Selectively Essential Genes (Tissue/Context-Specific)

选择性必需基因(组织/背景特异性)

GeneMean EffectEssential inNon-Essential inSelectivity ScoreEvidence
KRAS-0.85Pancreatic (95%), Lung (78%), Colon (82%)Breast (12%), Glioma (8%)High★★★
EGFR-0.72Lung (85%), Glioblastoma (76%)Most others (<20%)High★★★
ESR1-0.68ER+ Breast (92%)ER- Breast (5%), Other (<3%)Very High★★★
Interpretation: Selectively essential genes show strong context-dependency and represent high-value therapeutic targets with potential for tissue-selective toxicity profiles.
Source: DepMap via
DepMap_get_gene_dependencies
基因平均效应值在以下癌症中必需在以下癌症中非必需选择性评分证据
KRAS-0.85胰腺癌(95%)、肺癌(78%)、结肠癌(82%)乳腺癌(12%)、胶质瘤(8%)★★★
EGFR-0.72肺癌(85%)、胶质母细胞瘤(76%)大多数其他癌症(<20%)★★★
ESR1-0.68ER+乳腺癌(92%)ER-乳腺癌(5%)、其他(<3%)极高★★★
解读: 选择性必需基因表现出强背景依赖性,是具有组织选择性毒性潜力的高价值治疗靶点。
来源: DepMap via
DepMap_get_gene_dependencies

Non-Essential/Weak Hits (Score > -0.5)

弱必需/非必需候选靶点(评分 > -0.5)

GeneMean Effect% EssentialInterpretation
GENE1-0.2515%Weak dependency, potential off-target or passenger
GENE2-0.128%Non-essential in most contexts
Note: These genes may still be biologically relevant (e.g., synthetic lethal interactions, drug targets for specific contexts) but show weak essentiality in CRISPR screens.

Essentiality Summary:
  • Pan-cancer essential: 12 genes (↓ deprioritize for selective targeting)
  • Selectively essential: 18 genes (★ HIGH PRIORITY for validation)
  • Weakly essential: 15 genes (context-dependent, requires further investigation)
All essentiality data from DepMap Portal (DepMap Public 24Q2 release)

---
基因平均效应值必需占比解读
GENE1-0.2515%弱依赖性,潜在脱靶或乘客基因
GENE2-0.128%在大多数背景下非必需
注意: 这些基因仍可能具有生物学相关性(如合成致死相互作用、特定背景下的药物靶点),但在CRISPR筛选中显示出弱必需性。

必需性总结:
  • 泛癌必需: 12个基因(↓ 不优先作为选择性靶点)
  • 选择性必需: 18个基因(★ 验证高优先级)
  • 弱必需: 15个基因(背景依赖性,需进一步研究)
所有必需性数据来自DepMap Portal(DepMap Public 24Q2版本)

---

PATH 2: Pathway & Functional Enrichment

路径2: 通路与功能富集

Gene Set Enrichment Analysis

基因集富集分析

python
def perform_pathway_enrichment(tu, gene_list):
    """
    Run enrichment analysis across multiple libraries.
    """
    # Enrichr libraries to query
    libraries = [
        "WikiPathways_2024_Human",
        "Reactome_Pathways_2024",
        "MSigDB_Hallmark_2020",
        "GO_Biological_Process_2023",
        "GO_Molecular_Function_2023",
        "GO_Cellular_Component_2023",
        "KEGG_2024_Human"
    ]

    result = tu.tools.enrichr_gene_enrichment_analysis(
        gene_list=gene_list,
        libs=libraries
    )

    # Parse results - Enrichr returns pathway rankings with p-values
    return result
Output for Report:
markdown
undefined
python
def perform_pathway_enrichment(tu, gene_list):
    """
    Run enrichment analysis across multiple libraries.
    """
    # Enrichr libraries to query
    libraries = [
        "WikiPathways_2024_Human",
        "Reactome_Pathways_2024",
        "MSigDB_Hallmark_2020",
        "GO_Biological_Process_2023",
        "GO_Molecular_Function_2023",
        "GO_Cellular_Component_2023",
        "KEGG_2024_Human"
    ]

    result = tu.tools.enrichr_gene_enrichment_analysis(
        gene_list=gene_list,
        libs=libraries
    )

    # Parse results - Enrichr returns pathway rankings with p-values
    return result
报告输出:
markdown
undefined

2. Pathway & Functional Enrichment

2. 通路与功能富集

Top Enriched Pathways (p < 0.01, FDR < 0.05)

顶级富集通路(p < 0.01, FDR < 0.05)

Reactome Pathways
Reactome通路
PathwayGenesp-valueFDROdds RatioEvidence
Cell Cycle Checkpoints12/181.2e-83.4e-615.3★★★
DNA Replication8/183.5e-64.2e-412.1★★★
G1/S Transition7/185.1e-52.1e-39.8★★☆
Genes in pathway: CCNE1, CDK2, RB1, E2F1, CDC25A, CDC6, ORC1, MCM2
Interpretation: Strong enrichment in cell cycle control pathways suggests the screen identified proliferation-essential genes. These represent core cell cycle machinery.
通路基因数p值FDR优势比证据
细胞周期检查点12/181.2e-83.4e-615.3★★★
DNA复制8/183.5e-64.2e-412.1★★★
G1/S转换7/185.1e-52.1e-39.8★★☆
通路中的基因: CCNE1, CDK2, RB1, E2F1, CDC25A, CDC6, ORC1, MCM2
解读: 细胞周期调控通路的强富集表明,筛选识别出了增殖必需基因。这些基因代表核心细胞周期机制。
GO Biological Process
GO生物过程
TermGenesp-valueFDREvidence
DNA replication initiation6/182.1e-71.5e-5★★★
G1/S transition of mitotic cell cycle8/188.3e-73.2e-5★★★
regulation of cyclin-dependent protein kinase activity5/181.2e-48.9e-3★★☆
术语基因数p值FDR证据
DNA复制起始6/182.1e-71.5e-5★★★
有丝分裂细胞周期的G1/S转换8/188.3e-73.2e-5★★★
细胞周期蛋白依赖性激酶活性调控5/181.2e-48.9e-3★★☆
MSigDB Hallmark Gene Sets
MSigDB特征基因集
HallmarkGenesp-valueFDREvidence
E2F Targets10/186.7e-101.2e-8★★★
G2M Checkpoint9/183.4e-82.1e-6★★★
MYC Targets V17/182.1e-59.8e-4★★☆
Key Finding: Hits converge on E2F/RB pathway, suggesting screen successfully identified proliferation machinery. This is expected for dropout screens in proliferating cancer cells.
Source: Enrichr via
enrichr_gene_enrichment_analysis
特征集基因数p值FDR证据
E2F靶点10/186.7e-101.2e-8★★★
G2M检查点9/183.4e-82.1e-6★★★
MYC靶点V17/182.1e-59.8e-4★★☆
关键发现: 候选靶点集中在E2F/RB通路,表明筛选成功识别出增殖机制相关基因。这在增殖性癌细胞的dropout筛选中符合预期。
来源: Enrichr via
enrichr_gene_enrichment_analysis

No Significant Enrichment

无显著富集

GO Molecular Function: No terms pass FDR < 0.05 KEGG Pathways: Marginal enrichment (p < 0.05) but does not survive multiple testing correction
Interpretation: Gene list may be heterogeneous or represent diverse biological processes. Consider sub-clustering analysis.

---
GO分子功能: 无术语通过FDR < 0.05的阈值 KEGG通路: 存在边缘富集(p < 0.05)但未通过多重检验校正
解读: 基因列表可能具有异质性,或代表多种生物学过程。建议考虑亚聚类分析。

---

PATH 3: Protein-Protein Interaction Networks

路径3: 蛋白质-蛋白质相互作用网络

Build PPI Network

构建PPI网络

python
def build_ppi_network(tu, gene_list):
    """
    Construct protein interaction network for hit genes.
    """
    # Use STRING for comprehensive PPI data
    ppi_result = tu.tools.STRING_get_protein_interactions(
        protein_ids=gene_list,
        species=9606  # Human
    )

    # Also check IntAct for curated interactions
    interactions = []
    for gene in gene_list:
        # Get UniProt ID first
        uniprot = resolve_gene_to_uniprot(tu, gene)
        if uniprot:
            intact_result = tu.tools.intact_get_interactions(identifier=uniprot)
            interactions.append(intact_result)

    return {
        'string': ppi_result,
        'intact': interactions
    }

def identify_protein_complexes(ppi_data):
    """
    Identify protein complexes from PPI network.

    Could use complex detection algorithms or query Complex Portal.
    """
    # Implementation for complex detection
    pass
Output for Report:
markdown
undefined
python
def build_ppi_network(tu, gene_list):
    """
    Construct protein interaction network for hit genes.
    """
    # Use STRING for comprehensive PPI data
    ppi_result = tu.tools.STRING_get_protein_interactions(
        protein_ids=gene_list,
        species=9606  # Human
    )

    # Also check IntAct for curated interactions
    interactions = []
    for gene in gene_list:
        # Get UniProt ID first
        uniprot = resolve_gene_to_uniprot(tu, gene)
        if uniprot:
            intact_result = tu.tools.intact_get_interactions(identifier=uniprot)
            interactions.append(intact_result)

    return {
        'string': ppi_result,
        'intact': interactions
    }

def identify_protein_complexes(ppi_data):
    """
    Identify protein complexes from PPI network.

    Could use complex detection algorithms or query Complex Portal.
    """
    # Implementation for complex detection
    pass
报告输出:
markdown
undefined

3. Protein Interaction Network Analysis

3. 蛋白质相互作用网络分析

Network Statistics

网络统计

  • Nodes: 45 proteins (from 45 input genes)
  • Edges: 128 interactions (STRING combined score > 0.4)
  • Network Density: 0.063
  • Average Clustering Coefficient: 0.45
  • Hub Genes (>10 interactions): CDK2, RB1, E2F1, CCNE1
Interpretation: High clustering coefficient indicates genes are functionally related and form coherent protein complexes.
Source: STRING via
STRING_get_protein_interactions
  • 节点: 45个蛋白质(来自45个输入基因)
  • : 128个相互作用(STRING综合评分 > 0.4)
  • 网络密度: 0.063
  • 平均聚类系数: 0.45
  • 枢纽基因(>10个相互作用): CDK2, RB1, E2F1, CCNE1
解读: 高聚类系数表明这些基因功能相关,形成连贯的蛋白质复合物。
来源: STRING via
STRING_get_protein_interactions

Protein Complexes Identified

识别出的蛋白质复合物

ComplexMembersFunctionEssential?
MCM ComplexMCM2, MCM3, MCM4, MCM5, MCM6, MCM7DNA replication helicaseYes (pan-cancer)
Cyclin E-CDK2CCNE1, CCNE2, CDK2G1/S transition kinaseYes (selective)
E2F/DP/RBE2F1, E2F2, E2F3, RB1, TFDP1Transcription regulationYes (context-dependent)
Key Finding: Screen hit multiple members of the same essential complexes. This provides validation (independent hits in same pathway) and suggests complex-level vulnerability.
Source: Complex Portal annotations + STRING clustering
复合物成员功能是否必需?
MCM复合物MCM2, MCM3, MCM4, MCM5, MCM6, MCM7DNA复制解旋酶是(泛癌)
Cyclin E-CDK2CCNE1, CCNE2, CDK2G1/S转换激酶是(选择性)
E2F/DP/RBE2F1, E2F2, E2F3, RB1, TFDP1转录调控是(背景依赖性)
关键发现: 筛选命中了同一必需复合物的多个成员。这为结果提供了验证(同一通路中的独立命中),表明复合物层面存在脆弱性。
来源: Complex Portal注释 + STRING聚类

Synthetic Lethal Candidates

合成致死候选靶点

Based on PPI network and literature:
Gene A (Hit)Gene B (Candidate)RelationshipEvidenceSource
RB1ARID1ASynthetic lethal★★☆PMID:29534788
KRASSTK11Synthetic lethal★★★PMID:31010833
Recommendation: Test synthetic lethal candidates (Gene B) for combination therapy with inhibitors of Gene A.

---
基于PPI网络和文献:
候选靶点A(命中基因)候选靶点B(待验证)关系证据来源
RB1ARID1A合成致死★★☆PMID:29534788
KRASSTK11合成致死★★★PMID:31010833
建议: 测试合成致死候选靶点(基因B)与基因A抑制剂的联合治疗效果。

---

PATH 4: Druggability & Target Assessment

路径4: 成药性与靶点评估

Assess Drug Target Potential

评估药物靶点潜力

python
def assess_druggability(tu, gene_list):
    """
    Evaluate druggability of hit genes.
    """
    drug_targets = []

    for gene in gene_list:
        # Check Pharos for target development level
        pharos = tu.tools.Pharos_get_target(gene=gene)

        # Check DGIdb for existing drugs
        dgidb = tu.tools.DGIdb_get_drug_gene_interactions(genes=[gene])

        # Check Open Targets for chemical probes
        ensembl_id = resolve_gene_to_ensembl(tu, gene)
        if ensembl_id:
            probes = tu.tools.OpenTargets_get_chemical_probes_by_target_ensemblId(
                ensemblId=ensembl_id
            )

        # Check clinical trials
        trials = tu.tools.search_clinical_trials(
            intervention=gene,
            pageSize=20
        )

        drug_targets.append({
            'gene': gene,
            'pharos_tdl': pharos.get('data', {}).get('tdl'),
            'existing_drugs': dgidb,
            'chemical_probes': probes,
            'clinical_trials': trials
        })

    return drug_targets
Output for Report:
markdown
undefined
python
def assess_druggability(tu, gene_list):
    """
    Evaluate druggability of hit genes.
    """
    drug_targets = []

    for gene in gene_list:
        # Check Pharos for target development level
        pharos = tu.tools.Pharos_get_target(gene=gene)

        # Check DGIdb for existing drugs
        dgidb = tu.tools.DGIdb_get_drug_gene_interactions(genes=[gene])

        # Check Open Targets for chemical probes
        ensembl_id = resolve_gene_to_ensembl(tu, gene)
        if ensembl_id:
            probes = tu.tools.OpenTargets_get_chemical_probes_by_target_ensemblId(
                ensemblId=ensembl_id
            )

        # Check clinical trials
        trials = tu.tools.search_clinical_trials(
            intervention=gene,
            pageSize=20
        )

        drug_targets.append({
            'gene': gene,
            'pharos_tdl': pharos.get('data', {}).get('tdl'),
            'existing_drugs': dgidb,
            'chemical_probes': probes,
            'clinical_trials': trials
        })

    return drug_targets
报告输出:
markdown
undefined

4. Druggability & Clinical Target Assessment

4. 成药性与临床靶点评估

Target Development Level Classification (Pharos)

靶点开发层级分类(Pharos)

TDLCountGenesInterpretation
Tclin5EGFR, KRAS, CDK2, HDAC1, AURKAApproved drug targets
Tchem8WEE1, PLK1, CHEK1, ...Chemical matter available, druggable
Tbio12E2F1, RB1, ...Biologically characterized, may need novel modalities
Tdark3GENE_X, GENE_Y, GENE_ZUnderstudied, limited tool compounds
Priority Ranking: Tclin > Tchem > Tbio for near-term drug development feasibility.
Source: Pharos/TCRD via
Pharos_get_target
TDL数量基因解读
Tclin5EGFR, KRAS, CDK2, HDAC1, AURKA已获批药物靶点
Tchem8WEE1, PLK1, CHEK1, ...有化学物质可用,可靶向
Tbio12E2F1, RB1, ...已进行生物学表征,可能需要新的靶向方式
Tdark3GENE_X, GENE_Y, GENE_Z研究不足,工具化合物有限
优先级排序: Tclin > Tchem > Tbio(从近期药物开发可行性角度)
来源: Pharos/TCRD via
Pharos_get_target

Approved Drugs & Clinical Tools

已获批药物与临床工具

GeneDrug(s)StatusIndicationSource
EGFRErlotinib, Gefitinib, OsimertinibApprovedNSCLCDGIdb
CDK2DinaciclibPhase 2Hematologic malignanciesClinicalTrials.gov
AURKAAlisertibPhase 3LymphomaClinicalTrials.gov
WEE1AdavosertibPhase 2Solid tumorsClinicalTrials.gov
Clinical Readiness: 5 genes have approved/late-stage drugs. These represent immediate repurposing opportunities.
Sources: DGIdb via
DGIdb_get_drug_gene_interactions
, ClinicalTrials.gov
基因药物状态适应症来源
EGFR厄洛替尼、吉非替尼、奥希替尼已获批非小细胞肺癌DGIdb
CDK2Dinaciclib2期血液系统恶性肿瘤ClinicalTrials.gov
AURKAAlisertib3期淋巴瘤ClinicalTrials.gov
WEE1Adavosertib2期实体瘤ClinicalTrials.gov
临床就绪度: 5个基因拥有获批/后期临床试验药物。这些代表了直接的药物重定位机会。
来源: DGIdb via
DGIdb_get_drug_gene_interactions
, ClinicalTrials.gov

Chemical Probes Available

可用化学探针

GeneProbePotencySelectivityUseSource
CDK2RoscovitineIC50 ~200nMModerate (pan-CDK)Tool compoundSGC/Open Targets
HDAC1SAHA (Vorinostat)IC50 ~10nMPan-HDACApproved drug, research toolChEMBL
Source: Open Targets via
OpenTargets_get_chemical_probes_by_target_ensemblId
基因探针效力选择性用途来源
CDK2RoscovitineIC50 ~200nM中等(泛CDK)工具化合物SGC/Open Targets
HDAC1SAHA(伏立诺他)IC50 ~10nM泛HDAC获批药物、研究工具ChEMBL
来源: Open Targets via
OpenTargets_get_chemical_probes_by_target_ensemblId

Non-Druggable Hits Requiring Alternative Strategies

非可靶向候选靶点的替代策略

GeneChallengeRecommended Approach
E2F1Transcription factor (no catalytic domain)PROTACs, molecular glue degraders
RB1Tumor suppressor (loss-of-function)Synthetic lethal approach (e.g., CDK4/6i)
MCM2Part of large complex, no pocketsIndirect targeting via cell cycle inhibitors
Validation Priority: Focus on Tclin/Tchem hits with existing tool compounds for faster validation.

---
基因挑战推荐方案
E2F1转录因子(无催化结构域)PROTAC、分子胶降解剂
RB1肿瘤抑制因子(功能缺失)合成致死策略(如CDK4/6抑制剂)
MCM2大型复合物的一部分,无结合口袋通过细胞周期抑制剂间接靶向
验证优先级: 优先关注有现有工具化合物的Tclin/Tchem靶点,以加快验证速度。

---

PATH 5: Disease Association & Clinical Relevance

路径5: 疾病关联与临床相关性

Cancer Genomics Integration

癌症基因组整合

python
def assess_clinical_relevance(tu, gene_list, cancer_type):
    """
    Evaluate clinical relevance of hits in target cancer type.
    """
    clinical_data = []

    for gene in gene_list:
        ensembl_id = resolve_gene_to_ensembl(tu, gene)

        if ensembl_id:
            # Disease associations
            diseases = tu.tools.OpenTargets_get_diseases_phenotypes_by_target_ensemblId(
                ensemblId=ensembl_id
            )

            # Mouse models
            mouse = tu.tools.OpenTargets_get_biological_mouse_models_by_ensemblId(
                ensemblId=ensembl_id
            )

        # COSMIC mutations (somatic alterations in cancer)
        cosmic = tu.tools.COSMIC_get_gene_mutations(gene=gene)

        # GTEx expression (is it expressed in relevant tissue?)
        gtex = tu.tools.GTEx_get_median_gene_expression(
            gencode_id=ensembl_id,
            operation="median"
        )

        clinical_data.append({
            'gene': gene,
            'diseases': diseases,
            'mutations': cosmic,
            'expression': gtex,
            'mouse_models': mouse
        })

    return clinical_data
Output for Report:
markdown
undefined
python
def assess_clinical_relevance(tu, gene_list, cancer_type):
    """
    Evaluate clinical relevance of hits in target cancer type.
    """
    clinical_data = []

    for gene in gene_list:
        ensembl_id = resolve_gene_to_ensembl(tu, gene)

        if ensembl_id:
            # Disease associations
            diseases = tu.tools.OpenTargets_get_diseases_phenotypes_by_target_ensemblId(
                ensemblId=ensembl_id
            )

            # Mouse models
            mouse = tu.tools.OpenTargets_get_biological_mouse_models_by_ensemblId(
                ensemblId=ensembl_id
            )

        # COSMIC mutations (somatic alterations in cancer)
        cosmic = tu.tools.COSMIC_get_gene_mutations(gene=gene)

        # GTEx expression (is it expressed in relevant tissue?)
        gtex = tu.tools.GTEx_get_median_gene_expression(
            gencode_id=ensembl_id,
            operation="median"
        )

        clinical_data.append({
            'gene': gene,
            'diseases': diseases,
            'mutations': cosmic,
            'expression': gtex,
            'mouse_models': mouse
        })

    return clinical_data
报告输出:
markdown
undefined

5. Clinical Relevance & Disease Association

5. 临床相关性与疾病关联

Cancer Genomic Alterations (COSMIC)

癌症基因组改变(COSMIC)

GeneMutation FrequencyCancer Types (Top 3)Alteration TypeEvidence
KRAS22% across all cancersPancreatic (90%), Colon (45%), Lung (32%)Activating mutations★★★
EGFR8% across all cancersLung (15%), Glioma (30%), Breast (2%)Amplification, mutations★★★
TP5342% across all cancersUniversalLoss-of-function★★★
Interpretation: High mutation frequency indicates gene is driver in those cancer types. CRISPR essentiality + genomic alteration = strong therapeutic rationale.
Source: COSMIC via
COSMIC_get_gene_mutations
基因突变频率主要癌症类型(前3)改变类型证据
KRAS所有癌症中占22%胰腺癌(90%)、结肠癌(45%)、肺癌(32%)激活突变★★★
EGFR所有癌症中占8%肺癌(15%)、胶质瘤(30%)、乳腺癌(2%)扩增、突变★★★
TP53所有癌症中占42%所有癌症功能缺失★★★
解读: 高突变频率表明该基因是这些癌症类型中的驱动基因。CRISPR必需性 + 基因组改变 = 强有力的治疗依据。
来源: COSMIC via
COSMIC_get_gene_mutations

Expression in Normal vs Tumor Tissue (GTEx/TCGA)

正常组织vs肿瘤组织中的表达(GTEx/TCGA)

GeneNormal Lung (median TPM)Lung Tumor (TCGA)Tumor/Normal RatioTherapeutic Window
EGFR8.545.35.3xModerate
AURKA2.118.78.9xGood
RPS6125.3132.11.05xPoor (housekeeping)
Interpretation: Genes with >3x tumor/normal expression offer better therapeutic window. Housekeeping genes (e.g., ribosomal) show poor selectivity.
Sources: GTEx via
GTEx_get_median_gene_expression
, TCGA data
基因正常肺组织(中位TPM)肺癌组织(TCGA)肿瘤/正常比值治疗窗口
EGFR8.545.35.3倍中等
AURKA2.118.78.9倍良好
RPS6125.3132.11.05倍差(管家基因)
解读: 肿瘤/正常表达比值>3倍的基因具有更好的治疗窗口。管家基因(如核糖体基因)选择性差。
来源: GTEx via
GTEx_get_median_gene_expression
, TCGA数据

Prognostic/Predictive Biomarker Status

预后/预测生物标志物状态

GeneBiomarker TypeCancerAssociationEvidenceSource
KRASPredictive (negative)ColorectalKRAS mut → anti-EGFR resistance★★★FDA label
EGFRPredictive (positive)NSCLCEGFR mut → TKI response★★★FDA companion dx
ESR1Predictive (positive)BreastESR1 expression → endocrine therapy★★★Clinical guidelines
Clinical Impact: 3 genes are established biomarkers with FDA-approved tests. Targeting these genes has strong clinical precedent.

---
基因生物标志物类型癌症关联关系证据来源
KRAS预测性(阴性)结直肠癌KRAS突变→抗EGFR治疗耐药★★★FDA标签
EGFR预测性(阳性)非小细胞肺癌EGFR突变→TKI应答★★★FDA伴随诊断
ESR1预测性(阳性)乳腺癌ESR1表达→内分泌治疗应答★★★临床指南
临床影响: 3个基因是已确立的生物标志物,拥有FDA批准的检测方法。靶向这些基因具有坚实的临床先例。

---

PATH 6: Hit Prioritization & Validation Strategy

路径6: 候选靶点优先级排序与验证策略

Integrate All Evidence Dimensions

整合所有证据维度

python
def calculate_priority_score(gene_data):
    """
    Calculate multi-dimensional priority score.

    Components:
    - Essentiality strength (DepMap score)
    - Selectivity (tissue-specific vs pan-cancer)
    - Druggability (Pharos TDL, existing compounds)
    - Clinical relevance (mutations, expression, biomarkers)
    - Validation feasibility (tool compounds available)

    Returns score 0-100
    """
    score = 0

    # Essentiality (0-30 points)
    if gene_data['depmap_score'] < -1.0:
        score += 30
    elif gene_data['depmap_score'] < -0.5:
        score += 20
    else:
        score += 10

    # Selectivity (0-25 points)
    if gene_data['selective']:  # Tissue-specific
        score += 25
    elif gene_data['pan_cancer']:  # Pan-cancer (deprioritize)
        score += 5

    # Druggability (0-25 points)
    if gene_data['pharos_tdl'] == 'Tclin':
        score += 25
    elif gene_data['pharos_tdl'] == 'Tchem':
        score += 20
    elif gene_data['pharos_tdl'] == 'Tbio':
        score += 10
    else:
        score += 5

    # Clinical relevance (0-20 points)
    if gene_data['mutation_frequency'] > 20:
        score += 10
    if gene_data['biomarker_status']:
        score += 10

    return score
Output for Report:
markdown
undefined
python
def calculate_priority_score(gene_data):
    """
    Calculate multi-dimensional priority score.

    Components:
    - Essentiality strength (DepMap score)
    - Selectivity (tissue-specific vs pan-cancer)
    - Druggability (Pharos TDL, existing compounds)
    - Clinical relevance (mutations, expression, biomarkers)
    - Validation feasibility (tool compounds available)

    Returns score 0-100
    """
    score = 0

    # Essentiality (0-30 points)
    if gene_data['depmap_score'] < -1.0:
        score += 30
    elif gene_data['depmap_score'] < -0.5:
        score += 20
    else:
        score += 10

    # Selectivity (0-25 points)
    if gene_data['selective']:  # Tissue-specific
        score += 25
    elif gene_data['pan_cancer']:  # Pan-cancer (deprioritize)
        score += 5

    # Druggability (0-25 points)
    if gene_data['pharos_tdl'] == 'Tclin':
        score += 25
    elif gene_data['pharos_tdl'] == 'Tchem':
        score += 20
    elif gene_data['pharos_tdl'] == 'Tbio':
        score += 10
    else:
        score += 5

    # Clinical relevance (0-20 points)
    if gene_data['mutation_frequency'] > 20:
        score += 10
    if gene_data['biomarker_status']:
        score += 10

    return score
报告输出:
markdown
undefined

6. Hit Prioritization & Validation Recommendations

6. 候选靶点优先级排序与验证建议

Top 10 Priority Targets (Multi-Dimensional Scoring)

前10个优先级靶点(多维度评分)

RankGeneEssentialitySelectivityDruggabilityClinicalTotal ScoreRecommendation
1KRAS30/3025/2520/2520/2095/100High priority, validated drugs available
2EGFR28/3024/2525/2518/2095/100High priority, approved drugs
3WEE126/3023/2520/2512/2081/100Medium-high, Phase 2 drug available
4AURKA24/3022/2520/2514/2080/100Medium-high, tool compounds exist
5CDK225/3020/2520/2510/2075/100Medium, multiple tool compounds
6CHEK123/3021/2518/2510/2072/100Medium, chemical probes available
7PLK122/3020/2518/2511/2071/100Medium, clinical tool compounds
8E2F124/3022/2510/2512/2068/100Medium-low, requires degrader strategy
9HDAC120/3018/2525/258/2071/100Medium, approved HDAC inhibitors
10MCM228/3010/255/258/2051/100Low, pan-cancer essential, not druggable
Scoring Rubric:
  • Essentiality (30 pts): DepMap gene effect score magnitude
  • Selectivity (25 pts): Tissue-specific vs pan-cancer dependency
  • Druggability (25 pts): Pharos TDL, existing compounds, tractability
  • Clinical (20 pts): Mutation frequency, biomarker status, expression
Priority Tiers:
  • Tier 1 (Score >80): Immediate validation, existing tools/drugs available
  • Tier 2 (Score 60-80): Medium priority, validation feasible with chemical probes
  • Tier 3 (Score <60): Lower priority or requires novel approaches (PROTACs, etc.)
排名基因必需性选择性成药性临床相关性总分建议
1KRAS30/3025/2520/2520/2095/100高优先级,有已验证药物可用
2EGFR28/3024/2525/2518/2095/100高优先级,有获批药物
3WEE126/3023/2520/2512/2081/100中高优先级,有2期药物可用
4AURKA24/3022/2520/2514/2080/100中高优先级,有工具化合物可用
5CDK225/3020/2520/2510/2075/100中优先级,有多种工具化合物
6CHEK123/3021/2518/2510/2072/100中优先级,有化学探针可用
7PLK122/3020/2518/2511/2071/100中优先级,有临床工具化合物
8E2F124/3022/2510/2512/2068/100中低优先级,需要降解剂策略
9HDAC120/3018/2525/258/2071/100中优先级,有获批HDAC抑制剂
10MCM228/3010/255/258/2051/100低优先级,泛癌必需,不可靶向
评分规则:
  • 必需性(30分): DepMap基因效应值的绝对值
  • 选择性(25分): 组织特异性vs泛癌依赖性
  • 成药性(25分): Pharos TDL、现有化合物、可靶向性
  • 临床相关性(20分): 突变频率、生物标志物状态、表达情况
优先级层级:
  • 1级(得分>80): 立即验证,有现有工具/药物可用
  • 2级(得分60-80): 中优先级,可用化学探针进行验证
  • 3级(得分<60): 低优先级或需要新方法(如PROTAC等)

Validation Experiment Recommendations

验证实验建议

Tier 1 Targets (KRAS, EGFR, WEE1)
1级靶点(KRAS、EGFR、WEE1)
1. KRAS
  • Essentiality: Strong selective dependency in KRAS-mutant cancers
  • Validation Approach:
    • Test KRAS G12C inhibitor (sotorasib/adagrasib) in KRAS G12C-mutant cell lines from screen
    • Orthogonal validation: siRNA/shRNA knockdown
    • Rescue experiment: Re-express WT KRAS in KO cells
  • Expected Outcome: Growth inhibition/cell death in KRAS-mutant lines only
  • Tool Compounds: Sotorasib (AMG 510), Adagrasib (MRTX849), MRTX1133 (pan-KRAS)
  • Timeline: 2-3 weeks for cell line validation
2. EGFR
  • Essentiality: Selective in EGFR-mutant/amplified NSCLC, glioblastoma
  • Validation Approach:
    • Test EGFR TKI panel (erlotinib, osimertinib) in screen cell lines
    • Dose-response curves to establish IC50
    • Combination with standard chemotherapy
  • Expected Outcome: Potent inhibition in EGFR-altered lines
  • Tool Compounds: Erlotinib, Gefitinib, Osimertinib (all FDA-approved)
  • Timeline: 1-2 weeks
3. WEE1
  • Essentiality: Synthetic lethal with TP53 loss, selective in TP53-mutant cancers
  • Validation Approach:
    • Test adavosertib (WEE1 inhibitor) ± DNA damaging agents
    • Stratify by TP53 status (mutant vs WT)
    • Cell cycle analysis (premature mitotic entry)
  • Expected Outcome: Selective killing of TP53-mutant cells + synergy with chemo
  • Tool Compounds: Adavosertib (AZD1775), PD-166285
  • Timeline: 2-3 weeks
1. KRAS
  • 必需性: 在KRAS突变型癌症中具有强选择性依赖性
  • 验证方法:
    • 在筛选得到的KRAS G12C突变型细胞系中测试KRAS G12C抑制剂(sotorasib/adagrasib)
    • 正交验证:siRNA/shRNA敲低
    • 拯救实验:在敲除细胞中重新表达野生型KRAS
  • 预期结果: 仅在KRAS突变型细胞系中出现生长抑制/细胞死亡
  • 工具化合物: Sotorasib(AMG 510)、Adagrasib(MRTX849)、MRTX1133(泛KRAS)
  • 时间线: 2-3周的细胞系验证
2. EGFR
  • 必需性: 在EGFR突变/扩增的非小细胞肺癌、胶质母细胞瘤中具有选择性
  • 验证方法:
    • 在筛选细胞系中测试EGFR TKI面板(厄洛替尼、奥希替尼)
    • 绘制剂量反应曲线以确定IC50
    • 与标准化疗联合测试
  • 预期结果: 在EGFR改变的细胞系中出现强效抑制
  • 工具化合物: 厄洛替尼、吉非替尼、奥希替尼(均为FDA获批)
  • 时间线: 1-2周
3. WEE1
  • 必需性: 与TP53缺失具有合成致死性,在TP53突变型癌症中具有选择性
  • 验证方法:
    • 测试adavosertib(WEE1抑制剂)± DNA损伤剂
    • 按TP53状态分层(突变型vs野生型)
    • 细胞周期分析(提前进入有丝分裂)
  • 预期结果: 选择性杀伤TP53突变型细胞 + 与化疗协同
  • 工具化合物: Adavosertib(AZD1775)、PD-166285
  • 时间线: 2-3周
Tier 2 Targets (AURKA, CDK2, CHEK1, PLK1)
2级靶点(AURKA、CDK2、CHEK1、PLK1)
General Strategy:
  • Pharmacological validation with 2-3 selective inhibitors per target
  • Orthogonal genetic validation (CRISPRi/shRNA)
  • Pathway analysis (Western blots for downstream effectors)
  • Combination screens with standard-of-care agents
Recommended Tool Compounds:
  • AURKA: Alisertib (MLN8237), Aurora A Inhibitor I
  • CDK2: Roscovitine (seliciclib), Dinaciclib
  • CHEK1: Prexasertib (LY2606368), AZD7762
  • PLK1: Volasertib (BI 6727), BI 2536
通用策略:
  • 每个靶点使用2-3种选择性抑制剂进行药理学验证
  • 正交遗传验证(CRISPRi/shRNA)
  • 通路分析(下游效应因子的Western blot)
  • 与标准治疗药物的联合筛选
推荐工具化合物:
  • AURKA: Alisertib(MLN8237)、Aurora A抑制剂I
  • CDK2: Roscovitine(seliciclib)、Dinaciclib
  • CHEK1: Prexasertib(LY2606368)、AZD7762
  • PLK1: Volasertib(BI 6727)、BI 2536
Tier 3 Targets - Alternative Validation Strategies
3级靶点 - 替代验证策略
E2F1, RB1 (Transcription factors):
  • Challenge: No direct small molecule inhibitors
  • Strategy:
    • Test PROTACs if available
    • Indirect validation via upstream targets (CDK4/6 inhibitors for RB pathway)
    • Genetic validation only (CRISPRko, CRISPRi)
MCM2-7 Complex (Helicase, pan-essential):
  • Challenge: Pan-cancer essential, poor therapeutic window
  • Strategy: Deprioritize for drug development
  • Note: Interesting for understanding replication biology, but not ideal therapeutic target
E2F1、RB1(转录因子):
  • 挑战: 无直接小分子抑制剂
  • 策略:
    • 若有PROTAC则进行测试
    • 通过上游靶点间接验证(如针对RB通路的CDK4/6抑制剂)
    • 仅进行遗传验证(CRISPRko、CRISPRi)
MCM2-7复合物(解旋酶,泛必需):
  • 挑战: 泛癌必需,治疗窗口差
  • 策略: 优先排除在药物开发之外
  • 注意: 对理解复制生物学有意义,但不是理想的治疗靶点

Validation Timeline

验证时间线

PhaseDurationExperimentsDeliverable
Phase 1Weeks 1-3Tier 1 target validation (KRAS, EGFR, WEE1)Dose-response curves, potency data
Phase 2Weeks 4-6Tier 2 target validation (AURKA, CDK2, etc.)Hit confirmation, selectivity data
Phase 3Weeks 7-10Mechanism studies, pathway analysisWestern blots, cell cycle, apoptosis
Phase 4Weeks 11-14Combination studies, in vivo pilot (top 2-3)Synergy matrices, xenograft data
阶段时长实验交付物
阶段1第1-3周1级靶点验证(KRAS、EGFR、WEE1)剂量反应曲线、效力数据
阶段2第4-6周2级靶点验证(AURKA、CDK2等)命中确认、选择性数据
阶段3第7-10周机制研究、通路分析Western blot、细胞周期、凋亡数据
阶段4第11-14周联合研究、体内试点(前2-3个靶点)协同矩阵、异种移植数据

Success Criteria for Validation

验证成功标准

Hit Confirmed if:
  • Pharmacological inhibition phenocopies CRISPR knockout (≥50% growth inhibition at ≤1 µM)
  • Dose-response curve shows IC50 consistent with essentiality score
  • Effect is selective (active in screen cell line, inactive in control lines)
  • Orthogonal genetic methods (siRNA, CRISPRi) reproduce phenotype
Hit Rejected/Deprioritized if:
  • Tool compounds show no effect despite strong CRISPR score (off-target CRISPR effect)
  • Pan-cancer essential with no selectivity (poor therapeutic window)
  • No druggable domain/strategy (TFs, scaffolds without chemical matter)
  • Cannot be validated with available reagents
命中确认 若:
  • 药理学抑制模拟CRISPR敲除效果(≤1 µM时生长抑制≥50%)
  • 剂量反应曲线的IC50与必需性评分一致
  • 效应具有选择性(在筛选细胞系中有效,在对照细胞系中无效)
  • 正交遗传方法(siRNA、CRISPRi)可重现表型
命中被拒绝/降优先级 若:
  • 工具化合物无效果,尽管CRISPR评分很高(CRISPR脱靶效应)
  • 泛癌必需且无选择性(治疗窗口差)
  • 无可靶向结构域/策略(转录因子、无化学结合位点的支架蛋白)
  • 无法用现有试剂验证

Resource Requirements

资源需求

Reagents:
  • Chemical compounds: $5-10K (tool compounds, commercial inhibitors)
  • CRISPRi/shRNA validation: $3-5K (vectors, reagents)
  • Cell culture & assays: $8-12K (plates, reagents, media)
Equipment:
  • Cell culture facility (BSL2)
  • Plate readers (viability assays, luminescence)
  • Flow cytometer (cell cycle, apoptosis)
  • Western blot equipment
Personnel: 1 postdoc + 1 technician for 3-4 months

Validation Strategy Summary: Focus validation efforts on Tier 1 targets (KRAS, EGFR, WEE1) with approved/late-stage drugs. These offer fastest path to clinical translation and have highest probability of success based on multi-dimensional scoring.

---
试剂:
  • 化学化合物: 5-10千美元(工具化合物、商用抑制剂)
  • CRISPRi/shRNA验证: 3-5千美元(载体、试剂)
  • 细胞培养与实验: 8-12千美元(培养板、试剂、培养基)
设备:
  • 细胞培养设施(BSL2)
  • 酶标仪(活力实验、发光检测)
  • 流式细胞仪(细胞周期、凋亡)
  • Western blot设备
人员: 1名博士后 + 1名技术员,耗时3-4个月

验证策略总结: 集中验证资源在1级靶点(KRAS、EGFR、WEE1),这些靶点拥有获批/后期临床试验药物。基于多维度评分,这些靶点转化为临床应用的速度最快,成功概率最高。

---

Report Template (Initial File)

报告模板(初始文件)

File:
CRISPR_screen_analysis_[CONTEXT].md
markdown
undefined
文件:
CRISPR_screen_analysis_[CONTEXT].md
markdown
undefined

CRISPR Screen Analysis Report: [CONTEXT]

CRISPR筛选分析报告: [背景信息]

Generated: [Date] Input: [Gene list / Cancer type / Single gene] Context: [Screen type, cell line, experimental details] Status: In Progress

生成日期: [日期] 输入: [基因列表 / 癌症类型 / 单个基因] 背景: [筛选类型、细胞系、实验细节] 状态: 进行中

Executive Summary

执行摘要

[Analyzing...]
<!-- Will contain: key findings, top hits, recommended priorities -->
[分析中...]
<!-- 将包含:关键发现、顶级候选靶点、推荐优先级 -->

Input Validation

输入验证

[Analyzing...]
<!-- Gene symbol validation, invalid genes, suggestions -->
[分析中...]
<!-- 基因符号验证、无效基因、建议 -->

1. Gene Essentiality Analysis

1. 基因必需性分析

1.1 Strongly Essential Genes

1.1 强必需基因

[Analyzing...]
[分析中...]

1.2 Selectively Essential Genes

1.2 选择性必需基因

[Analyzing...]
[分析中...]

1.3 Weakly Essential / Non-Essential

1.3 弱必需 / 非必需基因

[Analyzing...]

[分析中...]

2. Pathway & Functional Enrichment

2. 通路与功能富集

2.1 Pathway Enrichment (Reactome, KEGG)

2.1 通路富集(Reactome、KEGG)

[Analyzing...]
[分析中...]

2.2 GO Enrichment (BP, MF, CC)

2.2 GO富集(生物过程、分子功能、细胞组分)

[Analyzing...]
[分析中...]

2.3 Hallmark Gene Sets (MSigDB)

2.3 特征基因集(MSigDB)

[Analyzing...]
[分析中...]

2.4 Pathway-Level Interpretation

2.4 通路层面解读

[Analyzing...]

[分析中...]

3. Protein Interaction Network

3. 蛋白质相互作用网络

3.1 Network Statistics

3.1 网络统计

[Analyzing...]
[分析中...]

3.2 Protein Complexes

3.2 蛋白质复合物

[Analyzing...]
[分析中...]

3.3 Hub Genes

3.3 枢纽基因

[Analyzing...]
[分析中...]

3.4 Synthetic Lethal Candidates

3.4 合成致死候选靶点

[Analyzing...]

[分析中...]

4. Druggability Assessment

4. 成药性评估

4.1 Target Development Level (Pharos)

4.1 靶点开发层级(Pharos)

[Analyzing...]
[分析中...]

4.2 Approved Drugs & Clinical Candidates

4.2 已获批药物与临床候选药物

[Analyzing...]
[分析中...]

4.3 Chemical Probes

4.3 化学探针

[Analyzing...]
[分析中...]

4.4 Non-Druggable Hits (Alternative Strategies)

4.4 非可靶向候选靶点(替代策略)

[Analyzing...]

[分析中...]

5. Clinical Relevance

5. 临床相关性

5.1 Cancer Genomic Alterations (COSMIC)

5.1 癌症基因组改变(COSMIC)

[Analyzing...]
[分析中...]

5.2 Expression in Tumor vs Normal

5.2 肿瘤vs正常组织中的表达

[Analyzing...]
[分析中...]

5.3 Prognostic/Predictive Biomarkers

5.3 预后/预测生物标志物

[Analyzing...]
[分析中...]

5.4 Mouse Models & Genetic Evidence

5.4 小鼠模型与遗传学证据

[Analyzing...]

[分析中...]

6. Hit Prioritization & Validation

6. 候选靶点优先级排序与验证

6.1 Multi-Dimensional Scoring

6.1 多维度评分

[Analyzing...]
[分析中...]

6.2 Top 10 Priority Targets

6.2 前10个优先级靶点

[Analyzing...]
[分析中...]

6.3 Validation Experiment Recommendations

6.3 验证实验建议

[Analyzing...]
[分析中...]

6.4 Validation Timeline & Resources

6.4 验证时间线与资源

[Analyzing...]

[分析中...]

7. Data Sources & Quality Control

7. 数据源与质量控制

7.1 Primary Data Sources

7.1 主要数据源

[Will be populated...]
[将填充...]

7.2 Tool Call Summary

7.2 工具调用汇总

[Will be populated...]
[将填充...]

7.3 Limitations & Caveats

7.3 局限性与注意事项

[Will be populated...]

[将填充...]

Appendix: Full Hit List

附录: 完整候选靶点列表

[Complete gene list with all scores...]

---
[带所有评分的完整基因列表...]

---

When NOT to Use This Skill

何时不使用本技能

  • Drug target research (single gene) → Use
    tooluniverse-target-research
    skill instead
  • Disease-centric query → Use
    tooluniverse-disease-research
    skill
  • Chemical compound screening → Different workflow needed
  • RNA-seq differential expression → Use differential expression analysis workflows
  • Single gene lookup → Call DepMap tools directly
Use this skill when you have a gene list from a CRISPR screen and need comprehensive functional interpretation + target prioritization.

  • 药物靶点研究(单个基因)→ 改用
    tooluniverse-target-research
    技能
  • 疾病导向查询 → 改用
    tooluniverse-disease-research
    技能
  • 化学化合物筛选 → 需要不同的工作流
  • RNA-seq差异表达分析 → 使用差异表达分析工作流
  • 单个基因查询 → 直接调用DepMap工具
当你拥有CRISPR筛选得到的基因列表,并需要进行全面的功能解读 + 靶点优先级排序时,使用本技能。

Example Queries That Trigger This Skill

触发本技能的示例查询

Gene List Analysis:
  • "Analyze these CRISPR screen hits: EGFR, KRAS, WEE1, PLK1, AURKA, ..."
  • "I have 50 dropout genes from a CRISPR screen in lung cancer cells, what should I validate?"
  • "Prioritize these genes for drug target development: [gene list]"
Cancer Type Query:
  • "What are the top essential genes in pancreatic cancer?"
  • "Find druggable dependencies in triple-negative breast cancer"
Single Gene Validation:
  • "Is KRAS a good therapeutic target for lung cancer?"
  • "Assess the druggability of WEE1 as a cancer target"
Not Appropriate:
  • "What is the function of EGFR?" → too broad, use target-research skill
  • "Find drugs for lung cancer" → disease-centric, use drug-repurposing skill
  • "Analyze this RNA-seq data" → different analytical workflow

基因列表分析:
  • "分析这些CRISPR筛选候选靶点:EGFR, KRAS, WEE1, PLK1, AURKA, ..."
  • "我有50个来自肺癌细胞CRISPR dropout筛选的基因,应该验证哪些?"
  • "为药物靶点开发对这些基因进行优先级排序:[基因列表]"
癌症类型查询:
  • "胰腺癌中的顶级必需基因是什么?"
  • "寻找三阴性乳腺癌中的可靶向依赖性"
单个基因验证:
  • "KRAS是肺癌的良好治疗靶点吗?"
  • "评估WEE1作为癌症靶点的成药性"
不适用场景:
  • "EGFR的功能是什么?" → 范围过广,使用target-research技能
  • "寻找肺癌的治疗药物" → 疾病导向,使用药物重定位技能
  • "分析这份RNA-seq数据" → 需要不同的分析工作流

Key Improvements from Existing Skills

与现有技能相比的关键改进

Based on patterns in
tooluniverse-target-research
and
tooluniverse-drug-research
:
  1. Multi-dimensional scoring system (novel for CRISPR analysis)
  2. Validation experiment recommendations with timelines and reagents
  3. Tier-based prioritization (Tier 1/2/3 based on actionability)
  4. Tool compound suggestions for each druggable target
  5. Synthetic lethal candidate identification from PPI network
  6. Explicit selectivity analysis (pan-cancer vs tissue-selective)
  7. Success/failure criteria for validation experiments

基于
tooluniverse-target-research
tooluniverse-drug-research
的使用模式:
  1. 多维度评分系统(CRISPR分析的创新点)
  2. 带时间线和试剂的验证实验建议
  3. 基于可行动性的层级优先级排序(1/2/3级)
  4. 每个可靶向靶点的工具化合物建议
  5. 从PPI网络识别合成致死候选靶点
  6. 明确的选择性分析(泛癌vs组织选择性)
  7. 验证实验的成功/失败标准

Quality Control Checklist

质量控制检查表

Before finalizing report:
  • All input genes validated against DepMap registry
  • Essentiality scores retrieved for all valid genes
  • Pathway enrichment performed (minimum: GO BP, Reactome, Hallmark)
  • PPI network constructed with interaction counts
  • Druggability assessed for all hits (Pharos TDL + DGIdb)
  • Top 10 priority targets table completed
  • Validation recommendations provided for Tier 1 targets
  • Evidence grades assigned (★★★, ★★☆, ★☆☆)
  • All data sources cited explicitly
  • "No data" explicitly stated when tools return empty results
  • Executive summary synthesizes all findings
最终确定报告前:
  • 所有输入基因已在DepMap注册表中验证
  • 已为所有有效基因检索必需性评分
  • 已进行通路富集分析(至少:GO生物过程、Reactome、Hallmark)
  • 已构建带相互作用计数的PPI网络
  • 已评估所有候选靶点的成药性(Pharos TDL + DGIdb)
  • 已完成前10个优先级靶点表格
  • 已为1级靶点提供验证建议
  • 已分配证据等级(★★★, ★★☆, ★☆☆)
  • 所有数据源已明确引用
  • 工具返回空结果时已明确标注“无数据”
  • 执行摘要已整合所有发现