Search Results: feature-engineering

Found 22 Skills

Data Processingdkyazzentwatwa/chatgpt-sk...

feature-engineering-kit

Auto-generate features with encodings, scaling, polynomial features, and interaction terms for ML pipelines.

AI & Machine Learningpproenca/dot-skills

marketplace-recsys-feature-engineering

Use this skill whenever deciding what features to extract from raw marketplace assets — listing photos, owner-entered listing metadata, sitter wizard responses — to power item-to-item (similar listings), user-to-item (homefeed ranking), or user-to-user (mutual-fit matching) recommenders in a two-sided trust marketplace. Covers asset auditing, first-principles feature decomposition from the decision the user is making, vision-feature extraction (CLIP, room-type classification, amenity detection, aesthetic and quality scoring), listing text and metadata encoding (categoricals, multi-hot amenities, H3 geo-hashing, sentence-transformer description embeddings, structured pet triples), sitter wizard design (information-gain ordering, multiple-choice over free text, genuine skippability, hard constraint versus soft preference), derived-composition patterns for i2i / u2i / u2u (precomputed ANN shelves, multi-modal fusion, two-tower affinity, symmetric mutual-fit scoring, interpretable subscores), feature quality governance (single registry, training-serving parity, coverage and drift alarms, PII scrubbing, schema versioning), and incremental value proof (one feature at a time, ablation A/B, kill reviews, exploration slice, permanent feature-free baseline). Trigger even when the user does not explicitly say "feature engineering" but is asking how to get more signal out of listing photos, listing metadata, or the sitter onboarding wizard, or how to improve i2i / u2i / u2u quality without blindly ingesting a new model.

🇺🇸|EnglishTranslated

AI & Machine Learningjeremylongshore/claude-co...

feature-engineering-helper

Feature Engineering Helper - Auto-activating skill for ML Training. Triggers on: feature engineering helper, feature engineering helper Part of the ML Training skill category.

🇺🇸|EnglishTranslated

Data Processingdavila7/claude-code-templ...

senior-data-scientist

World-class data science skill for statistical modeling, experimentation, causal inference, and advanced analytics. Expertise in Python (NumPy, Pandas, Scikit-learn), R, SQL, statistical methods, A/B testing, time series, and business intelligence. Includes experiment design, feature engineering, model evaluation, and stakeholder communication. Use when designing experiments, building predictive models, performing causal analysis, or driving data-driven decisions.

🇺🇸|EnglishTranslated

3 scripts/Checked

AI & Machine Learningborghei/claude-skills

data-scientist

Expert data science covering machine learning, statistical modeling, experimentation, predictive analytics, and advanced analytics.

🇺🇸|EnglishTranslated

AI & Machine Learningvasilyu1983/ai-agents-pub...

ai-ml-timeseries

Operational patterns, templates, and decision rules for time series forecasting (modern best practices): tree-based methods (LightGBM), deep learning (Transformers, RNNs), future-guided learning, temporal validation, feature engineering, generative TS (Chronos), and production deployment. Emphasizes explainability, long-term dependency handling, and adaptive forecasting.

🇺🇸|EnglishTranslated

Data Processingbrojonat/llmsrules

tabular-eda

Profile a new tabular dataset before modeling. Find target leakage, missing data patterns, high-cardinality categoricals, near-constant features, redundant pairs, and non-linear relationships that Pearson correlation misses. Use whenever the user hands you a CSV or parquet and asks "what should I do with this?" Always run this skill before training any model on data you haven't seen before.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingtondevrel/scientific-agen...

duckdb

An analytical in-process SQL database management system. Designed for fast analytical queries (OLAP). Highly interoperable with Python's data ecosystem (Pandas, NumPy, Arrow, Polars). Supports querying files (CSV, Parquet, JSON) directly without an ingestion step. Use for complex SQL queries on Pandas/Polars data, querying large Parquet/CSV files directly, joining data from different sources, analytical pipelines, local datasets too big for Excel, intermediate data storage and feature engineering for ML.

🇺🇸|EnglishTranslated

AI & Machine Learningtondevrel/scientific-agen...

scikit-learn

The industry standard library for machine learning in Python. Provides simple and efficient tools for predictive data analysis, covering classification, regression, clustering, dimensionality reduction, model selection, and preprocessing.

🇺🇸|EnglishTranslated

Data Processingpersonamanagmentlayer/pcl

data-science-expert

Expert-level data science, analytics, visualization, and statistical modeling

🇺🇸|EnglishTranslated

AI & Machine Learningasgard-ai-platform/skills

algo-ad-ctr

Build CTR prediction models for estimating ad click-through rates from features. Use this skill when the user needs to predict click probability, build an ad ranking model, or evaluate ad creative performance — even if they say 'predict click rate', 'ad relevance scoring', or 'which ad will get more clicks'.

🇺🇸|EnglishTranslated

AI & Machine Learningvasilyu1983/ai-agents-pub...

ai-ml-data-science

End-to-end data science and ML engineering workflows: problem framing, data/EDA, feature engineering (feature stores), modelling, evaluation/reporting, plus SQL transformations with SQLMesh. Use for dataset exploration, feature design, model selection, metrics and slice analysis, model cards/eval reports, experiment reproducibility, and production handoff (monitoring and retraining).

🇺🇸|EnglishTranslated