Search Results: data-engineering

Found 41 Skills

data-engineering

Use when "data pipelines", "ETL", "data warehousing", "data lakes", or asking about "Airflow", "Spark", "dbt", "Snowflake", "BigQuery", "data modeling"

🇺🇸|EnglishTranslated

Data Processingpluginagentmarketplace/cu...

data-engineering

Master data engineering, ETL/ELT, data warehousing, SQL optimization, and analytics. Use when building data pipelines, designing data systems, or working with large datasets.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningpluginagentmarketplace/cu...

data-engineering

Data engineering, machine learning, AI, and MLOps. From data pipelines to production ML systems and LLM applications.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingrohitg00/awesome-claude-c...

data-engineering

Data engineering patterns for ETL pipelines, data warehousing, Apache Spark, and data quality validation

🇺🇸|EnglishTranslated

Data Processingsickn33/antigravity-aweso...

data-engineering-data-pipeline

You are a data pipeline architecture expert specializing in scalable, reliable, and cost-effective data pipelines for batch and streaming data processing.

🇺🇸|EnglishTranslated

AI & Machine Learningancoleman/ai-design-compo...

ai-data-engineering

Data pipelines, feature stores, and embedding generation for AI/ML systems. Use when building RAG pipelines, ML feature serving, or data transformations. Covers feature stores (Feast, Tecton), embedding pipelines, chunking strategies, orchestration (Dagster, Prefect, Airflow), dbt transformations, data versioning (LakeFS), and experiment tracking (MLflow, W&B).

🇺🇸|EnglishTranslated

11 scripts/Attention

Product & Designsickn33/antigravity-aweso...

data-engineering-data-driven-feature

Build features guided by data insights, A/B testing, and continuous measurement using specialized agents for analysis, implementation, and experimentation.

🇺🇸|EnglishTranslated

Data Processinglegout/data-platform-agen...

data-engineering-observability

Observability and monitoring for data pipelines using OpenTelemetry (traces) and Prometheus (metrics). Covers instrumentation, dashboards, and alerting.

🇺🇸|EnglishTranslated

Data Processinglegout/data-platform-agen...

data-engineering-storage-remote-access-integrations-duckdb

Using DuckDB with remote cloud storage via HTTPFS extension, fsspec, and Delta Lake integration. Covers S3, GCS, Azure, and S3-compatible endpoints.

🇺🇸|EnglishTranslated

Data Processinglegout/data-platform-agen...

data-engineering-storage-remote-access-libraries-pyarrow-fs

Native Arrow filesystem integration with PyArrow. Optimized for Parquet workflows, zero-copy data transfer, predicate pushdown, and column pruning. Covers S3, GCS, HDFS with PyArrow datasets.

🇺🇸|EnglishTranslated

Data Processinglegout/data-platform-agen...

data-engineering-storage-remote-access-integrations-pandas

Reading and writing data with Pandas from/to cloud storage (S3, GCS, Azure) using fsspec and PyArrow filesystems.

🇺🇸|EnglishTranslated

Data Processinglegout/data-platform-agen...

data-engineering-storage-remote-access-integrations-delta-lake

Delta Lake integration with cloud storage (S3, GCS, Azure). Covers storage_options, PyArrow filesystem, time travel, and partitioned writes.

🇺🇸|EnglishTranslated