Search Results: data-orchestration

Found 7 Skills

Data Processingdtsong/data-engineering-s...

streaming-data-skill

Use this skill when building real-time or near-real-time data pipelines. Covers Kafka, Flink, Spark Streaming, Snowpipe, BigQuery streaming, materialized views, and batch-vs-streaming decisions. Common phrases: "real-time pipeline", "Kafka consumer", "streaming vs batch", "low latency ingestion". Do NOT use for batch integration patterns (use integration-patterns-skill) or pipeline orchestration (use data-orchestration-skill).

🇺🇸|EnglishTranslated

Data Processingdagster-io/skills

dagster-integrations

Skill that helps users discover and understand Dagster integration libraries. Used when users have requests related to integrating with other tools / technologies, or when have users have questions related to specific integration libraries (dagster-*).

🇺🇸|EnglishTranslated

Data Processingasgard-ai-platform/skills

tech-data-pipeline

Design data pipelines covering ETL vs ELT architectures, data source integration, scheduling, quality checks, and warehouse design. Use this skill when the user needs to move data between systems, build a data warehouse, automate data processing, or improve data reliability — even if they say 'move data from X to Y', 'build an ETL pipeline', 'our data is a mess', or 'set up a data warehouse'.

🇺🇸|EnglishTranslated

Data Processinggemini-cli-extensions/dat...

gcp-data-pipelines

Primary entry point for building, managing, and orchestrating data pipelines on Google Cloud. Guides users to the appropriate skill for dbt, Dataflow (Apache Beam), Dataform, Spark (Dataproc Serverless), BigQuery Data Transfer Service (DTS) or orchestration pipeline using Cloud Composer. Clarify requirements and resolve ambiguity for creating, updating and running data pipelines.

🇺🇸|EnglishTranslated

Data Processingmasthead-data/for-agents

optimize-data-model-compute

Optimize BigQuery compute costs by assigning data models (Dataform, dbt, Airflow) to slot reservations or on-demand compute based on Masthead recommendations.

🇺🇸|EnglishTranslated

Data Processinglegout/data-platform-agen...

data-engineering-observability

Observability and monitoring for data pipelines using OpenTelemetry (traces) and Prometheus (metrics). Covers instrumentation, dashboards, and alerting.

🇺🇸|EnglishTranslated

Data Processingmajesticlabs-dev/majestic...

etl-patterns

Production ETL patterns orchestrator. Routes to core reliability patterns and incremental load strategies.

🇺🇸|EnglishTranslated