Datasets, regions, and sinks supported by Microsoft Graph Data Connect
Microsoft Graph Data Connect supports a variety of datasets, data regions, and storage locations in Microsoft Azure. This article describes the supported datasets and how to access the dataset schemas, the Microsoft 365 and Microsoft Azure regions that are supported, and the storage locations that Microsoft Graph Data Connect utilizes through Azure Synapse or Azure Data Factory.
Datasets
Microsoft Graph Data Connect currently supports the following datasets. To view the schemas for each dataset, create a new dataset in Azure Synapse or Azure Data Factory and use the Schema tab to view it.
Azure Active Directory
| Dataset name | Description | Sample and Schema | Billing |
|---|---|---|---|
| BasicDataSet_v0.User_v0 | Contains user information (DisplayName, UserPrincipalName, and other information). | Free | |
| BasicDataSet_v0.User_v1 | Contains user information. | Free | |
| BasicDataSet_v0.Manager_v0 | Contains user information for the manager of each user. | Free | |
| BasicDataSet_v0.DirectReport_v0 | Contains user information about the employees that directly report to each user. | Free |
Outlook and Exchange Online
| Dataset name | Description | Sample and Schema | Billing |
|---|---|---|---|
| BasicDataSet_v0.CalendarView_v0 | Contains the events from the Calendar view. | Paid | |
| BasicDataSet_v0.Contact_v0 | Contains the available information from each user’s address book. | Paid | |
| BasicDataSet_v0.Contact_v1 | Contains contact information from each user's address book. | Paid | |
| BasicDataSet_v0.Event_v0 | Contains the information from a user’s calendar events. | Paid | |
| BasicDataSet_v0.Event_v1 | Contains the events in each user's calendar. | Paid | |
| BasicDataSet_v0.Inbox_v1 | Contains the mail folders from each user's inbox. | Paid | |
| BasicDataSet_v0.MailboxSettings_v0 | Contains the mailbox settings of each user. | Free | |
| BasicDataSet_v0.MailFolder_v0 | Contains the mail folders from each user's mailbox. | Paid | |
| BasicDataSet_v0.Message_v0 | Contains the email messages from a user’s mailbox. | Paid | |
| BasicDataSet_v0.Message_v1 | Contains the email message in each user's mailbox. | Paid | |
| BasicDataSet_v0.SentItem_v0 | Contains the messages sent from each user's mailbox. | Paid | |
| BasicDataSet_v0.SentItem_v1 | Contains the message sent from each user's mailbox. | Paid | |
| BasicDataSet_v0.OutlookGroupConversations_v0 | Contains the message sent from each user's mailbox. | Paid |
Microsoft Teams
| Dataset name | Description | Sample and Schema | Billing |
|---|---|---|---|
| BasicDataSet_v0.TeamChat_v1 | Contains Teams chat messages for one-on-one and group chat messages. This dataset excludes chat messages explicitly deleted by users. | Paid | |
| BasicDataset_v0.TeamChannelDetails_v0 | Contains a Teams Channel details. | Paid | |
| BasicDataset_v0.TeamStandardChannelMessages_v0 | Contains a Teams standard channel message details. | Paid | |
| BasicDataSet_v0.TeamsCallRecords_v1 | Contains activity records from Teams calls and meetings. | Paid | |
| BasicDataSet_v0.TeamsChannelDetails_v0 | Contains details about Channels in a team. | Paid | |
| BasicDataSet_v0.TeamsStandardChannelMessages_v0 | Contains channel posts and messages from Standard Channels in Teams. | Paid | |
| BasicDataSet_v0.TeamsTranscript_v1 | Contains transcripts from calls and meetings in Teams when the transcript is enabled for a meeting or a call. | Paid |
Microsoft Groups
| Dataset name | Description | Sample and Schema | Billing |
|---|---|---|---|
| BasicDataSet_v0.GroupDetails_v0 | Contains a Teams Chat Group details. | Free | |
| BasicDataSet_v0.GroupMembers_v0 | Contains a Teams Chat Group member details. | Free | |
| BasicDataSet_v0.GroupOwners_v0 | Contains the Teams Chat Group owners' details. | Free |
OneDrive and SharePoint Online
Please consider that these datasets are available after 48 hours (about two days). For instance, you can query data for 07/01 starting on 07/03.
The data available is from the last 21 days (about three weeks). For instance, if you would like to query data for 07/01, you can do so from 07/03 to 07/24.
| Dataset name | Description | Sample and Schema | Billing |
|---|---|---|---|
| DocumentSharingDataset_v0_Preview | Contains information about sharing permissions of documents. | Free (under preview only). These datasets will be available to those in Preview. You must join the Graph TAP program using the signup form at https://aka.ms/GraphTAPForm and request to join the Preview. | |
| SharePointSitesDataset_v0_Preview | Contains information about SharePoint sites. | Free (under preview only). These datasets will be available to those in Preview. You will need to join the Graph TAP program using the signup form at https://aka.ms/GraphTAPForm and request to join the Preview. | |
| SharePointGroupsDataset_v0_Preview | Contains SharePoint group information, including details about group members. | Free (under preview only). These datasets will be available to those in Preview. You must join the Graph TAP program using the signup form at https://aka.ms/GraphTAPForm and request to join the Preview. |
Viva Insights
| Dataset name | Description | Sample and Schema | Billing |
|---|---|---|---|
| VivaInsightsDataset_v0 | Contains Viva Insights metrics. | Requires Viva Insights license |
Regions
Microsoft Graph Data Connect supports extracting data from a variety of Microsoft 365 regions. To successfully move data from the Microsoft 365 datacenter into your Microsoft Azure storage, the Azure Synapse or Azure Data Factory instance and the Azure storage location must both map to a supported region for the location of the Microsoft 365 data.
The following table indicates which Microsoft 365 regions are supported and the corresponding Azure regions required for data movement.
| Office region | Azure region |
|---|---|
| Asia-Pacific |
|
| Australia |
|
| Europe |
|
| North America |
|
| United Kingdom |
|
Sinks
Sinks are the output location that Azure Synapse or Azure Data Factory uses to place data in Azure storage. Microsoft Graph Data Connect supports the following sink storage types:
The following characteristics apply to sinks:
- The output files are of format JSON lines. The output format is fixed, and there is no support for modifying the format of the output. However, you can use Azure Synapse or Azure Data Factory to copy the result of a Microsoft Graph Data Connect pipeline into another storage mechanism (such as Azure SQL Database).
- Service Principal authentication is the only supported authentication mechanism for all sink types in a copy activity with Microsoft 365 as the source.
- When using Azure Storage Blob as the sink, you must ensure that your application has Storage Blob Data Contributor access to the Azure Storage Blob location.
See also
Зворотний зв’язок
Надіслати й переглянути відгук про