How to implement CDC from on-premises sources to Azure synapse?

Neeraj Sharma 1 Reputation point
2021-09-13T06:26:14.287+00:00

I need to implement CDC on on-premises data sources such as sql server/ excel / oracle/ api's to azure synapse
Could anyone please explain steps/demo how to implement solution.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,950 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,666 questions
{count} votes

3 answers

Sort by: Most helpful
  1. ShaikMaheer-MSFT 38,046 Reputation points Microsoft Employee
    2021-09-14T06:29:40.307+00:00

    Hi @Neeraj Sharma ,

    Welcome to Microsoft Q&A Platform. Thank you for posting your query here.

    You can leverage Synapse Pipelines or Azure data factory pipelines to implement CDC.

    There are may tutorials or articles documented to explain how to perform Incremental copy from different types of source data such as structured data or semi-structured/unstructured data.

    Below are few documentation links for same. Kindly check them for better understanding.

    Please Note, Logic explained above can also be applied on on-prem SQL as well. We should consider installing Self-hosted IR to access our on-prem data.

    Hope this will help. Please let us know if any further queries.

    ------------------------------

    • Please don't forget to click on 130616-image.png or upvote 130671-image.png button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how
    • Want a reminder to come back and check responses? Here is how to subscribe to a notification
    • If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators

  2. Kuldeep Chitrakar 1 Reputation point
    2021-09-14T11:09:31.427+00:00

    If source is on premise SQL Server then

    Export data using BCP utility into files. Use Last modified column to read the only updated data
    Upload these files to Azure storage using AzCopy and then use ADF to read and process it further
    You can also maintain a audit of it by logging the details into a table like

    Batch ID | SrcTableName | StartDate | EndDate | Status

    So next day when you read previous day's EndDate is your start date (> Greater than)

    0 comments No comments

  3. Junpei Liang 0 Reputation points
    2024-03-07T00:53:01.18+00:00

    Just checking in to see if there are any tools we an use to implement CDC from on-premise Oracle database to Azure Synapse

    0 comments No comments