All Skills

Total 50,503 skills, Data Processing has 2560 skills

Showing 12 of 2560 skills

Per page

Downloads

Sort

Data Processingtursodatabase/turso

storage-format

SQLite file format, B-trees, pages, cells, overflow, freelist that is used in tursodb

🇺🇸|EnglishTranslated

Data Processingdavila7/claude-code-templ...

nemo-curator

GPU-accelerated data curation for LLM training. Supports text/image/video/audio. Features fuzzy deduplication (16× faster), quality filtering (30+ heuristics), semantic deduplication, PII redaction, NSFW detection. Scales across GPUs with RAPIDS. Use for preparing high-quality training datasets, cleaning web data, or deduplicating large corpora.

🇺🇸|EnglishTranslated

Data Processingdavila7/claude-code-templ...

scanpy

Single-cell RNA-seq analysis. Load .h5ad/10X data, QC, normalization, PCA/UMAP/t-SNE, Leiden clustering, marker genes, cell type annotation, trajectory, for scRNA-seq analysis.

🇺🇸|EnglishTranslated

2 scripts/Checked

Data Processingdavila7/claude-code-templ...

scvi-tools

This skill should be used when working with single-cell omics data analysis using scvi-tools, including scRNA-seq, scATAC-seq, CITE-seq, spatial transcriptomics, and other single-cell modalities. Use this skill for probabilistic modeling, batch correction, dimensionality reduction, differential expression, cell type annotation, multimodal integration, and spatial analysis tasks.

🇺🇸|EnglishTranslated

Data Processingdavila7/claude-code-templ...

pysam

Genomic file toolkit. Read/write SAM/BAM/CRAM alignments, VCF/BCF variants, FASTA/FASTQ sequences, extract regions, calculate coverage, for NGS data processing pipelines.

🇺🇸|EnglishTranslated

Data Processingdavila7/claude-code-templ...

uniprot-database

Direct REST API access to UniProt. Protein searches, FASTA retrieval, ID mapping, Swiss-Prot/TrEMBL. For Python workflows with multiple databases, prefer bioservices (unified interface to 40+ services). Use this for direct HTTP/REST work or UniProt-specific control.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingdavila7/claude-code-templ...

omero-integration

Microscopy data management platform. Access images via Python, retrieve datasets, analyze pixels, manage ROIs/annotations, batch processing, for high-content screening and microscopy workflows.

🇺🇸|EnglishTranslated

Data Processingdavila7/claude-code-templ...

gtars

High-performance toolkit for genomic interval analysis in Rust with Python bindings. Use when working with genomic regions, BED files, coverage tracks, overlap detection, tokenization for ML models, or fragment analysis in computational genomics and machine learning applications.

🇺🇸|EnglishTranslated

Data Processingastronomer/agents

init

Initialize warehouse schema discovery. Generates .astro/warehouse.md with all table metadata for instant lookups. Run once per project, refresh when schema changes. Use when user says "/data:init" or asks to set up data discovery.

🇺🇸|EnglishTranslated

Data Processinglangchain-ai/deepagents

schema-exploration

For discovering and understanding database structure, tables, columns, and relationships

🇺🇸|EnglishTranslated

Data Processingvasilyu1983/ai-agents-pub...

document-xlsx

Create, edit, audit, and extract Excel spreadsheets (.xlsx): generate reports/exports, apply formulas/formatting/charts/data validation, parse existing workbooks, and avoid spreadsheet risks (formula injection, broken links, hidden rows). Supports ExcelJS, openpyxl, pandas, XlsxWriter, and SheetJS.

🇺🇸|EnglishTranslated

Data Processingdkyazzentwatwa/chatgpt-sk...

geo-visualizer

Create interactive maps with markers, heatmaps, routes, and choropleth layers. Use when visualizing geographic data, plotting locations, or creating map-based reports.

🇺🇸|EnglishTranslated

1 scripts/Checked