ETL

Definition

Extract, Transform, Load - a crucial data integration process that moves and transforms data from various sources into a data warehouse for analysis.

Use Cases

Provider Equivalents

Frequently Asked Questions

What's the difference between ETL and ELT?
ETL transforms data before loading it into the target system (like a data warehouse). ELT loads raw data first and then transforms it inside the target system using its compute (common with modern cloud warehouses). ETL is often used when you need strict data quality checks or when the target system isn’t designed for heavy transformations.
When should I use ETL?
Use ETL when you need to clean, validate, standardize, or mask data before it reaches the destination (for example, enforcing schemas, removing duplicates, applying business rules, or protecting sensitive fields). ETL is also a good fit when integrating many legacy sources or when downstream systems require curated, consistent datasets.
How much does ETL cost?
ETL cost depends on data volume, transformation complexity, frequency (batch vs near-real-time), connector/licensing needs, and where compute runs. Managed services typically charge for orchestration and/or processing time, plus underlying storage and network egress. Costs rise with large scans, complex joins, frequent runs, and high-throughput streaming.

Category: data

Difficulty: advanced

Related Terms

See Also