Dear experts,
I am analysing the different options to load data into azure synapses dedicated sql pool and found relevant differences in the DWU consumption.
I tried 3 different methods:
- Transforming the data in a data flow and then insert it with a sink with staging enable (is this using polybase in the background?)
- Transforming the data in a data flow, copy it to an ADLS folder in parquet format, then moving it to the database using a copy activity with Polybase.
- Transforming the data in a data flow, copy it to an ADLS folder in parquet format, then moving it to the database using a copy activity with the copy command.
After many tests I realized the DWU usage for the first method, the insert using a sink in a data flow, consumes around 4 times more DWUs:
Could someone explain this behavior?
I repeated the tests several time, so this is not occasionally happens.