Search Results: data-quality

Found 46 Skills

Data Processingsickn33/antigravity-aweso...

database

Database development and operations workflow covering SQL, NoSQL, database design, migrations, optimization, and data engineering.

🇺🇸|EnglishTranslated

Tools & Utilitiesdatadrivenconstruction/dd...

bim-consistency-checker

Check BIM model consistency: naming conventions, parameter completeness, spatial relationships, and data integrity across model elements.

🇺🇸|EnglishTranslated

Data Processingmembranedev/application-s...

melissa-data

Melissa Data integration. Manage data, records, and automate workflows. Use when the user wants to interact with Melissa Data data.

🇺🇸|EnglishTranslated

Data Processingmajesticlabs-dev/majestic...

great-expectations

Data validation using Great Expectations. Expectation suites, checkpoints, and data docs for pipeline monitoring.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingposthog/skills

auditing-warehouse-data-health

Audit the health of a PostHog project's data warehouse — find every broken or degraded pipeline item across sources, sync schemas, materialized views, batch exports, and transformations. Use when the user asks "what's broken in my warehouse?", "give me a health check", "audit my data pipeline", "why are some dashboards stale?", or wants a one-shot triage summary before deciding where to spend time. Produces a prioritized report of issues grouped by severity and type, with recommended next steps.

🇺🇸|EnglishTranslated

AI & Machine Learningprakharmnnit/skills-and-p...

backend-principle-eng-python-ml-pro-max

Principal backend engineering intelligence for Python AI/ML systems. Actions: plan, design, build, implement, review, fix, optimize, refactor, debug, secure, scale ML services and pipelines. Focus: data quality, reproducibility, reliability, performance, security, observability, model evaluation, MLOps.

🇺🇸|EnglishTranslated

AI & Machine Learningsundial-org/skills

training-data-curation

Guidelines for creating high-quality datasets for LLM post-training (SFT/DPO/RLHF). Use when preparing data for fine-tuning, evaluating data quality, or designing data collection strategies.

🇺🇸|EnglishTranslated

Data Processingborghei/claude-skills

senior-data-engineer

Expert data engineering covering data pipelines, ETL/ELT, data warehousing, streaming, and data quality.

🇺🇸|EnglishTranslated

Data Processingerichowens/some_claude_sk...

data-pipeline-engineer

Expert data engineer for ETL/ELT pipelines, streaming, data warehousing. Activate on: data pipeline, ETL, ELT, data warehouse, Spark, Kafka, Airflow, dbt, data modeling, star schema, streaming data, batch processing, data quality. NOT for: API design (use api-architect), ML training (use ML skills), dashboards (use design skills).

🇺🇸|EnglishTranslated

3 scripts/Attention

Data Processingcasper-studios/casper-mar...

csv-analyzer

Comprehensive CSV data analysis and visualization tool. Use this skill when analyzing CSV files, generating data summaries, creating visualizations from data, detecting outliers, finding correlations, assessing data quality, or creating data reports. Triggers on CSV analysis, data exploration, data visualization, data profiling, statistical analysis, or data quality assessment requests.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingthe-perfect-developer/the...

pandera

This skill should be used when the user asks to "validate a DataFrame with pandera", "write a pandera schema", "use pandera DataFrameModel", "add data validation to a pipeline", or needs guidance on pandera best practices for data quality.

🇺🇸|EnglishTranslated

Data Processinggithub/awesome-copilot

geofeed-tuner

Use this skill whenever the user mentions IP geolocation feeds, RFC 8805, geofeeds, or wants help creating, tuning, validating, or publishing a self-published IP geolocation feed in CSV format. Intended user audience is a network operator, ISP, mobile carrier, cloud provider, hosting company, IXP, or satellite provider asking about IP geolocation accuracy, or geofeed authoring best practices. Helps create, refine, and improve CSV-format IP geolocation feeds with opinionated recommendations beyond RFC 8805 compliance. Do NOT use for private or internal IP address management — applies only to publicly routable IP addresses.

🇺🇸|EnglishTranslated