etl-patterns

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

ETL Patterns

ETL模式

Orchestrator for production-grade Extract-Transform-Load patterns.

生产级抽取-转换-加载（ETL）模式的编排器。

Skill Routing

技能路由

Need	Skill	Content
Reliability patterns	`etl-core-patterns`	Idempotency, checkpointing, error handling, chunking, retry, logging
Load strategies	`etl-incremental-patterns`	Backfill, timestamp-based, CDC, pipeline orchestration

需求	技能	内容
可靠性模式	`etl-core-patterns`	Idempotency、checkpointing、错误处理、分块处理、重试、日志记录
加载策略	`etl-incremental-patterns`	回填、基于时间戳的加载、CDC、管道编排

Pattern Selection Guide

模式选择指南

By Reliability Need

按可靠性需求选择

Need	Pattern	Skill
Repeatable runs	Idempotency	`etl-core-patterns`
Resume after failure	Checkpointing	`etl-core-patterns`
Handle bad records	Error handling + DLQ	`etl-core-patterns`
Memory management	Chunked processing	`etl-core-patterns`
Network resilience	Retry with backoff	`etl-core-patterns`
Observability	Structured logging	`etl-core-patterns`

需求	模式	技能
可重复运行	Idempotency	`etl-core-patterns`
故障后恢复	Checkpointing	`etl-core-patterns`
处理坏数据	错误处理 + DLQ	`etl-core-patterns`
内存管理	分块处理	`etl-core-patterns`
网络韧性	退避重试	`etl-core-patterns`
可观测性	结构化日志	`etl-core-patterns`

By Load Strategy

按加载策略选择

Scenario	Pattern	Skill
Small tables (<100K)	Full refresh	`etl-incremental-patterns`
Large tables	Timestamp incremental	`etl-incremental-patterns`
Real-time sync	CDC events	`etl-incremental-patterns`
Historical migration	Parallel backfill	`etl-incremental-patterns`
Zero-downtime refresh	Swap pattern	`etl-incremental-patterns`
Multi-step pipelines	Pipeline orchestration	`etl-incremental-patterns`

场景	模式	技能
小表（<10万行）	全量刷新	`etl-incremental-patterns`
大表	基于时间戳的增量加载	`etl-incremental-patterns`
实时同步	CDC事件	`etl-incremental-patterns`
历史数据迁移	并行回填	`etl-incremental-patterns`
零停机刷新	交换模式	`etl-incremental-patterns`
多步骤管道	管道编排	`etl-incremental-patterns`

Quick Reference

快速参考

Idempotency Options

Idempotency选项

python

undefined

python

undefined

Small datasets: Delete-then-insert

Large datasets: UPSERT on conflict

Change detection: Row hash comparison

undefined

undefined

Load Strategy Decision

加载策略决策

Is table < 100K rows?
  → Full refresh

Has reliable timestamp column?
  → Timestamp incremental

Source supports CDC?
  → CDC event processing

Need zero downtime?
  → Swap pattern (temp table → rename)

One-time historical load?
  → Parallel backfill with date ranges

Is table < 100K rows?
  → Full refresh

Has reliable timestamp column?
  → Timestamp incremental

Source supports CDC?
  → CDC event processing

Need zero downtime?
  → Swap pattern (temp table → rename)

One-time historical load?
  → Parallel backfill with date ranges

Common Pipeline Structure

常见管道结构

python

undefined

python

undefined

1. Setup

checkpoint = Checkpoint('.etl_checkpoint.json') processor = ETLProcessor()

2. Extract (with incremental)

df = incremental_by_timestamp(source_table, 'updated_at')

3. Transform (with error handling)

transformed = processor.process_batch(df.to_dict('records'))

4. Load (with idempotency)

upsert_records(pd.DataFrame(transformed))

5. Checkpoint

checkpoint.set_last_processed('sync', df['updated_at'].max())

6. Handle failures

processor.save_failures('failures/')

undefined

processor.save_failures('failures/')

undefined

etl-patterns

Original

Translation

ETL Patterns

ETL模式

Skill Routing

技能路由

Pattern Selection Guide

模式选择指南

By Reliability Need

按可靠性需求选择

By Load Strategy

按加载策略选择

Quick Reference

快速参考

Idempotency Options

Idempotency选项

Small datasets: Delete-then-insert

Small datasets: Delete-then-insert

Large datasets: UPSERT on conflict

Large datasets: UPSERT on conflict

Change detection: Row hash comparison

Change detection: Row hash comparison

Load Strategy Decision

加载策略决策

Common Pipeline Structure

常见管道结构

1. Setup

1. Setup

2. Extract (with incremental)

2. Extract (with incremental)

3. Transform (with error handling)

3. Transform (with error handling)

4. Load (with idempotency)

4. Load (with idempotency)

5. Checkpoint

5. Checkpoint

6. Handle failures

6. Handle failures

Related Skills

相关技能