Loading...
Loading...
Data lake and lakehouse platform patterns: ingestion/CDC, transformations, open table formats (Iceberg/Delta/Hudi), query and serving engines (Trino/ClickHouse/DuckDB), orchestration, governance/lineage, cost and operations. Self-hosted and cloud options.
npx skill4agent add vasilyu1983/ai-agents-public data-lake-platformreferences/storage-formats.mdassets/cross-platform/template-schema-evolution.mdassets/cross-platform/template-partitioning-strategy.mdreferences/ingestion-patterns.mdassets/cross-platform/template-ingestion-governance-checklist.mdassets/cross-platform/template-incremental-loading.mdreferences/transformation-patterns.mdassets/cross-platform/template-data-pipeline.mdreferences/query-engine-patterns.mdreferences/governance-catalog.mdassets/cross-platform/template-data-quality-governance.mdassets/cross-platform/template-data-quality.mdreferences/operational-playbook.mdreferences/cost-optimization.mdassets/cross-platform/template-data-quality-backfill-runbook.mdassets/cross-platform/template-cost-optimization.mdreferences/architecture-patterns.mdreferences/architecture-patterns.mdreferences/streaming-patterns.mdreferences/overview.mdpip install "dlt[clickhouse]"
dlt init rest_api clickhouse
python pipeline.pypip install sqlmesh
sqlmesh init duckdb
sqlmesh plan && sqlmesh run| Resource | Purpose |
|---|---|
| references/overview.md | Diagrams and decision flows |
| references/architecture-patterns.md | Medallion, data mesh |
| references/ingestion-patterns.md | dlt vs Airbyte, CDC |
| references/transformation-patterns.md | SQLMesh vs dbt |
| references/storage-formats.md | Iceberg vs Delta |
| references/query-engine-patterns.md | ClickHouse, DuckDB |
| references/streaming-patterns.md | Kafka, Flink |
| references/orchestration-patterns.md | Dagster, Airflow |
| references/bi-visualization-patterns.md | Metabase, Superset |
| references/cost-optimization.md | Cost levers and maintenance |
| references/operational-playbook.md | Monitoring and incident response |
| references/governance-catalog.md | Catalog, lineage, access control |
| Template | Purpose |
|---|---|
| assets/cross-platform/template-medallion-architecture.md | Baseline bronze/silver/gold plan |
| assets/cross-platform/template-data-pipeline.md | End-to-end pipeline skeleton |
| assets/cross-platform/template-ingestion-governance-checklist.md | Source onboarding checklist |
| assets/cross-platform/template-incremental-loading.md | Incremental + backfill plan |
| assets/cross-platform/template-schema-evolution.md | Schema change rules |
| assets/cross-platform/template-cost-optimization.md | Cost control checklist |
| assets/cross-platform/template-data-quality-governance.md | Quality contracts + SLOs |
| assets/cross-platform/template-data-quality-backfill-runbook.md | Backfill incident/runbook |
| Skill | Purpose |
|---|---|
| ai-mlops | ML deployment |
| ai-ml-data-science | Feature engineering |
| data-sql-optimization | OLTP optimization |