Total 30,668 skills, Data Processing has 1471 skills
Showing 12 of 1471 skills
Runs local BLAT searches for DNA sequence alignment against hg38 or CHM13 using local .2bit references. Use when a user wants to align a DNA sequence without relying on UCSC API access.
This skill should be used when the user needs to query COSMIC Cancer Gene Census to check if genes are known cancer genes. Triggers include requests to annotate genes with cancer information, check if variants are in cancer genes, or retrieve cancer gene properties from COSMIC database.
Ingest, QC, and map reads with reproducible outputs. Use for raw read processing and coverage stats.
Track and reconcile taxonomy updates across NCBI, GTDB, ICTV, and community eukaryote frameworks with versioned provenance.
Unity Catalog governance patterns, permissions models, security best practices, and policy enforcement for enterprise data governance.
GSheet-CRUD API 使用指南。将 Google Sheets 作为 RESTful API 数据库。当用户需要通过 API 操作 Google Sheets 数据时使用此技能,包括:查询数据(GET)、插入数据(POST)、更新数据(PUT)、删除数据(DELETE)。
Time Travel CRDT Skill
Invariant manifold tangent to center eigenspace
Parse, search, analyze, and ingest LinkedIn GDPR data exports. This skill should be used when working with LinkedIn data — searching messages, analyzing connections, exporting to Markdown, or ingesting into RLAMA for semantic search. Requires a LinkedIn GDPR data export ZIP file.
Computational geometry with Shapely - create geometries, boolean operations, measurements, predicates
Researches SEC filings, earnings calls, analyst reports, and market data. Use when the album subject involves financial crimes, corporate stories, or market events.
Dimensional modeling and schema design for data products. Star schema patterns, slowly changing dimensions, denormalization decisions, and architecture decision records. Use when designing data models, reviewing schema designs, choosing between normalization strategies, or when someone asks "how should I model this data?" or "should I denormalize?" For OMOP CDM patterns specifically, see healthcare-data-domain.