Search Results: data-pipeline

Found 67 Skills

data-quality-frameworks

Implement data quality validation with Great Expectations, dbt tests, and data contracts. Use when building data quality pipelines, implementing validation rules, or establishing data contracts.

🇺🇸|EnglishTranslated

Data Processingsecondsky/sap-skills

sap-hana-cloud-data-intelligence

Develops data processing pipelines, integrations, and machine learning scenarios in SAP Data Intelligence Cloud. Use when building graphs/pipelines with operators, integrating ABAP/S4HANA systems, creating replication flows, developing ML scenarios with JupyterLab, or using Data Transformation Language functions. Covers Gen1/Gen2 operators, subengines (Python, Node.js, C++), structured data operators, and repository objects.

🇺🇸|EnglishTranslated

Data Processingaltimateai/data-engineeri...

debugging-dbt-errors

Debugs and fixes dbt errors systematically. Use when working with dbt errors for: (1) Task mentions "fix", "error", "broken", "failing", "debug", "wrong", or "not working" (2) Compilation Error, Database Error, or test failures occur (3) Model produces incorrect output or unexpected results (4) Need to troubleshoot why a dbt command failed Reads full error, checks upstream first, runs dbt build (not just compile) to verify fix.

🇺🇸|EnglishTranslated

Data Processingmanutej/luxor-claude-mark...

apache-airflow-orchestration

Complete guide for Apache Airflow orchestration including DAGs, operators, sensors, XComs, task dependencies, dynamic workflows, and production deployment

🇺🇸|EnglishTranslated

Data Processingsnowflake-labs/sfguides

dynamic-tables-tutorial

Interactive tutorial that teaches Snowflake Dynamic Tables hands-on. The agent guides users step-by-step through building data pipelines with automatic refresh, incremental processing, and CDC patterns. Use when the user wants to learn dynamic tables, build a DT pipeline, or understand DT vs streams/tasks/materialized views.

🇺🇸|EnglishTranslated

Data Processingpersonamanagmentlayer/pcl

airflow-expert

Expert-level Apache Airflow orchestration, DAGs, operators, sensors, XComs, task dependencies, and scheduling

🇺🇸|EnglishTranslated

Data Processingadityawrk/analytics-with-...

data-quality

Run a comprehensive data quality assessment and produce a scorecard across 6 dimensions: completeness, uniqueness, consistency, timeliness, accuracy, validity. Use when the user asks about data quality, mentions data issues, wants to audit a table, is onboarding a new data source, or needs to validate pipeline output.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesgoldsky-io/goldsky-agent

mirror-doctor

Diagnose and fix broken Goldsky Mirror pipelines. Use this skill whenever a user has a Mirror pipeline that is failing, stuck, terminated, won't start, is in a restart loop, or is blocked by an in-flight request. Also use when the user mentions a specific Mirror pipeline name alongside a problem — even if they don't say 'mirror' explicitly, if they're using `goldsky pipeline` commands (not `goldsky turbo`), this is the right skill. Runs CLI commands directly to check status, read errors, identify root cause, and apply fixes. For YAML syntax or config reference, use /mirror instead. For turbo pipeline problems, use /turbo-doctor instead.

🇺🇸|EnglishTranslated

Data Processingpluginagentmarketplace/cu...

data-engineering

Master data engineering, ETL/ELT, data warehousing, SQL optimization, and analytics. Use when building data pipelines, designing data systems, or working with large datasets.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingmajesticlabs-dev/majestic...

etl-patterns

Production ETL patterns orchestrator. Routes to core reliability patterns and incremental load strategies.

🇺🇸|EnglishTranslated

Data Processingjeremylongshore/claude-co...

data-quality-checker

Data Quality Checker - Auto-activating skill for Data Pipelines. Triggers on: data quality checker, data quality checker Part of the Data Pipelines skill category.

🇺🇸|EnglishTranslated

Data Processingapache/beam

runners

Guides understanding and working with Apache Beam runners (Direct, Dataflow, Flink, Spark, etc.). Use when configuring pipelines for different execution environments or debugging runner-specific issues.

🇺🇸|EnglishTranslated