Supported data stores in Azure Data Share
Azure Data Share provides open and flexible data sharing, including the ability to share from and to different data stores. Data providers can share data from one type of data store, and their data consumers can choose which data store to receive data into.
In this article, you'll learn about the rich set of Azure data stores that are supported in Azure Data Share. You can also find information on the combinations of data stores that can be leveraged by data providers and data consumers.
What data stores are supported in Azure Data Share?
The below table details the supported data sources for Azure Data Share.
|Data store||Snapshot-based sharing||In-place sharing|
|Azure Blob storage||✓|
|Azure Data Lake Storage Gen1||✓|
|Azure Data Lake Storage Gen2||✓|
|Azure SQL Database||Public Preview|
|Azure Synapse Analytics (formerly Azure SQL DW)||Public Preview|
|Azure Data Explorer||Public Preview|
Data store support matrix
Azure Data Share offers data consumers flexibility when deciding on a data store to accept data in to. For example, data being shared from Azure SQL Database can be received into Azure Data Lake Store Gen2, Azure SQL Database or Azure Synapse Analytics. Customers can choose which format to receive data in when configuring a received data share.
The below table details different combinations and choices that data consumers have when accepting and configuring their data share. For more information on how to configure dataset mappings, see how to configure dataset mappings.
|Azure Blob Storage||Azure Data Lake Storage Gen1||Azure Data Lake Storage Gen2||Azure SQL Database||Azure Synapse Analytics|
|Azure Blob storage||✓||✓|
|Azure Data Lake Storage Gen1||✓||✓|
|Azure Data Lake Storage Gen2||✓||✓|
|Azure SQL Database||✓||✓||✓||✓|
|Azure Synapse Analytics (formerly Azure SQL DW)||✓||✓||✓||✓|
Share from a storage account
Azure Data Share supports sharing of files, folders and file systems from Azure Data Lake Gen1 and Azure Data Lake Gen2. It also supports sharing of blobs, folders and containers from Azure Blob Storage. When folders are shared in snapshot-based sharing, data consumer can can choose to make a full copy of the share data, or leverage incremental snapshot capability to copy only new or updated files. Existing files with the same name will be overwritten.
Share from a SQL-based source
Azure Data Share supports sharing of tables or views from Azure SQL Database and Azure Synapse Analytics (formerly Azure SQL DW). Data consumer can choose to accept the data into Azure Data Lake Storage Gen2 or Azure Blob Storage as csv or parquet file. Full snapshot overwrites the content of the target file. Alternatively, data consumer can accept the data into a SQL table. If the target SQL table is not available on the data consumer side, Azure Data Share creates the SQL table with the source schema. Full snapshot appends content of the source table to the target SQL table. Incremental snapshot is currently not supported.
Share from Azure Data Explorer
Azure Data Share supports the ability to share databases in-place from Azure Data Explorer clusters. Data provider can share at the database or cluster level. When shared at database level, data consumer will only be able to access the specific database(s) shared by the data provider. When shared at cluster level, data consumer can access all the databases from the provider's cluster, including any future databases created by the data provider.
To access shared databases, data consumer needs to have its own Azure Data Explorer cluster. Data consumer's Azure Data Explorer cluster needs to locate in the same Azure data center as the data provider's Azure Data Explorer cluster. When sharing relationship is established, Azure Data Share creates a symbolic link between the provider and consumer's Azure Data Explorer clusters.
Azure Data Explorer supports two modes of data ingestion: batch and streaming. Data received from batch in the shared database will appear between a few seconds to a few minutes on the data consumer side. Data received from streaming could take up to 24 hours to appear on the data consumer side.
To learn how to start sharing data, continue to the share your data tutorial.