Question 1

What's the difference between Azure Data Factory and Azure Synapse Pipelines?

Accepted Answer

They share a very similar pipeline authoring experience. Azure Data Factory is a standalone data integration service focused on orchestrating data movement and transformation across many systems. Synapse Pipelines provides similar orchestration capabilities but is integrated into Azure Synapse Analytics, making it convenient when your primary analytics workspace is Synapse.

Question 2

When should I use Data Factory?

Accepted Answer

Use Azure Data Factory when you need to regularly move data between systems (for example, on a schedule or triggered by events), orchestrate multi-step workflows with dependencies, and connect to many data sources. Common scenarios include loading data into a data warehouse/lake, copying data between on-premises and cloud, and coordinating transformations using mapping data flows or external compute like Databricks.

Question 3

How much does Data Factory cost?

Accepted Answer

Pricing is usage-based. Key cost drivers typically include pipeline orchestration/activity runs, data movement (copy activity and integration runtime usage), and transformation compute (for example, Mapping Data Flows use managed Spark clusters billed by time and capacity). Costs vary by region, number of runs, data volume, and whether you use self-hosted vs managed integration runtimes.

Data Factory

Definition

Use Cases

Provider Equivalents

Frequently Asked Questions

Related Terms

See Also