Search Results: data-preprocessing

Found 17 Skills

tcga-bulk-data-preprocessing-with-omicverse

Guide Claude through ingesting TCGA sample sheets, expression archives, and clinical carts into omicverse, initialising survival metadata, and exporting annotated AnnData files.

🇺🇸|EnglishTranslated

AI & Machine Learningk-dense-ai/claude-scienti...

scikit-learn

Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model evaluation, hyperparameter tuning, preprocessing, or building ML pipelines. Provides comprehensive reference documentation for algorithms, preprocessing techniques, pipelines, and best practices.

🇺🇸|EnglishTranslated

110

2 scripts/Checked

Data Processingdavila7/claude-code-templ...

flowio

Parse FCS (Flow Cytometry Standard) files v2.0-3.1. Extract events as NumPy arrays, read metadata/channels, convert to CSV/DataFrame, for flow cytometry data preprocessing.

🇺🇸|EnglishTranslated

AI & Machine Learningruvnet/ruflo

agent-data-ml-model

Agent skill for data-ml-model - invoke with $agent-data-ml-model

🇺🇸|EnglishTranslated

Data Processingasgard-ai-platform/skills

stat-eda

Conduct Exploratory Data Analysis (EDA) using descriptive statistics, visualizations, and data quality checks. Use this skill when the user has a dataset and needs to understand its structure, find patterns, detect anomalies, or prepare data for further analysis — even if they say 'what does this data look like', 'find interesting patterns', 'clean this data', or 'summarize this dataset'.

🇺🇸|EnglishTranslated

Data Processingnvidia/skills

dicom-series-preflight

Used for header-only preflight of one DICOM series folder before conversion or inference. Not for de-identification or clinical clearance.

🇺🇸|EnglishTranslated

3 scripts/Checked

Data Processingdkyazzentwatwa/chatgpt-sk...

feature-engineering-kit

Auto-generate features with encodings, scaling, polynomial features, and interaction terms for ML pipelines.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingmalue-ai/dazee-small

excel-fixer

Auto-detect and fix common Excel formatting issues like merged cells, inconsistent types, duplicate headers, and encoding problems.

🇺🇸|EnglishTranslated

Document Processingchujianyun/skills

opendataloader-pdf

PDF data extraction tool. Use it when users mention "PDF extraction", "PDF to Markdown", "PDF parsing", "extract PDF content", "PDF to JSON", "RAG PDF". OpenDataLoader PDF is currently the top-ranked PDF parser in benchmark tests, supporting local mode (fast, deterministic) and hybrid AI mode (for complex tables, scanned documents, formulas), with output formats including Markdown, JSON (with bounding boxes), and HTML. It is suitable for scenarios where structured data needs to be extracted from PDFs for RAG/LLM pipelines, or where batch processing of PDF documents is required.

🇨🇳|ChineseTranslated

Data Processingdavila7/claude-code-templ...

scanpy

Single-cell RNA-seq analysis. Load .h5ad/10X data, QC, normalization, PCA/UMAP/t-SNE, Leiden clustering, marker genes, cell type annotation, trajectory, for scRNA-seq analysis.

🇺🇸|EnglishTranslated

2 scripts/Checked

Data Processingjamditis/claude-skills-jo...

data-journalism

Data journalism workflows for analysis, visualization, and storytelling. Use when analyzing datasets, creating charts and maps, cleaning messy data, calculating statistics or building data-driven stories. Essential for reporters, newsrooms and researchers working with quantitative information.

🇺🇸|EnglishTranslated

Data Processingmeleantonio/awesome-econ-...

stata-data-cleaning

Clean and transform messy data in Stata with reproducible workflows

🇺🇸|EnglishTranslated