Datasets, regions, and sinks supported by Microsoft Graph Data Connect

Microsoft Graph Data Connect supports a variety of datasets, data regions, and storage locations in Microsoft Azure. This article describes the supported datasets and how to access the dataset schemas, the Microsoft 365 and Microsoft Azure regions that are supported, and the storage locations that Microsoft Graph Data Connect utilizes through Azure Synapse or Azure Data Factory.

Datasets

Microsoft Graph Data Connect currently supports the following datasets. To view the schemas for each dataset, create a new dataset in Azure Synapse or Azure Data Factory and go to the Schema tab.

Activities

Dataset name Description Learn more
OutlookContactActivity_v0 Provides employees' activity with their contacts in Microsoft Outlook. OutlookContactActivity_v0 dataset
OutlookMailActivity_v0 Provides employees' activity with their email in Outlook. OutlookMailActivity_v0 dataset
OutlookMeetingActivity_v0 Provides employees' activity with their meetings in Outlook. OutlookMeetingActivity_v0 dataset
TeamsChannelActivity_v0 Providesemployees' activity with their channels in Microsoft Teams. TeamsChannelActivity_v0 dataset
TeamsConversationActivity_v0 Provides employees' activity with their teams and chats in Teams. TeamsConversationActivity_v0 dataset

Call records

Dataset name Description Learn more
TeamsCallRecords_v1 Provides activity records from Teams calls and meetings. TeamsCallRecords_v1 dataset

Channel

Dataset name Description Learn more
TeamsChannelDetails_v0 Generates a list of Microsoft Teams channels. TeamsChannelDetails_v0 dataset

Contact

Dataset name Description Learn more
Contact_v0 Provides contact details available from each user's address book. Contact_v0 dataset
Contact_v1 Provides the contact details available from each user's address book. Contact_v1 dataset

Devices and Licenses

Dataset name Description Learn more
OwnedDevices_v0 Provides detailed information related to all the devices that are owned by each user in the organization. OwnedDevices_v0 dataset
RegisteredDevices_v0 Provides detailed information related to all the devices that a user is registered on in the organization. RegisteredDevices_v0 dataset
LicenseDetails_v0 Provides details for users' licenses that are directly assigned and those transitively assigned through memberships in licensed groups. LicenseDetails_v0 dataset

Event

Dataset name Description Learn more
CalendarView_v0 Provides occurrences, exceptions and single instances of events, based on the calendar view from users' calendars. CalendarView_v0 dataset
ConferenceRoomCalendar_v0 Provides CalendarView data of the Conference Rooms created for a tenant. ConferenceRoomCalendar_v0 dataset
Event_v0 Provides all the events from users' calendars. Event_v0 dataset
Event_v1 Provides all the events from users' calendars. Event_v1 dataset

Group

Dataset name Description Learn more
GroupDetails_v0 Provides the Microsoft Entra ID (Azure AD) groups data for a tenant. GroupDetails_v0 dataset
GroupMembers_v0 Generates a list of direct members of all groups. GroupMembers_v0 dataset
GroupOwners_v0 Retrieves the list of all the group owners. GroupOwners_v0 dataset

Mail

Dataset name Description Learn more
Message_v0 Provides a collection of all the messages received by a user in mail folders. Message_v0 dataset
Message_v1 Provides a collection of all the messages received by a user in mail folders. Message_v1 dataset
SentItems_v0 Provides a collection of all the sent emails by all users of a tenant. SentItems_v0 dataset
SentItems_v1 Provides a collection of all the sent emails with some additional fields. SentItems_v1 dataset

Mail folder

Dataset name Description Learn more
Inbox_v1 Provides the messages from users' mail folders. Inbox_v1 dataset
Mailfolder_v0 Provides information on all the folders created in a user's mailbox. Mailfolder_v0 dataset
Mailfolder_v2 Provides the information on all mail folders created in a user's mailbox. Mailfolder_v2 dataset

Mailbox settings

Dataset name Description Learn more
MailboxSettings_v0 Provides details of all users' mailbox settings. MailboxSettings_v0 dataset

Message

Dataset name Description Learn more
OutlookGroupConversations_v0 Provides a collection of group conversations between users of tenant. OutlookGroupConversations_v0 dataset
TeamChat_v1 Provides Teams chat messages for one-on-one and group chat messages. TeamChat_v1 dataset
TeamChat_v2 Provides Teams chat messages for one-on-one and group chat messages. TeamChat_v2 dataset
TeamsStandardChannelMessages_v0 Provides channel posts and messages from standard channels in Teams. TeamsStandardChannelMessages_v0 dataset

Online meetings

Dataset name Description Learn more
TeamsTranscript_v1 Provides transcripts from calls and meetings in Teams when the transcript is enabled for a meeting or a call. TeamsTranscript_v1 dataset

Org hierarchy

Dataset name Description Learn more
DirectReport_v0 Provides details of all the direct reports for your users. DirectReport_v0 dataset
Manager_v0 Provides a list of users assigned as managers. Manager_v0 dataset

Task

Dataset name Description Learn more
TodoTaskFolders_v0 Identifies task folders in Microsoft Outlook that track user-level work items. TodoTaskFolders_v0 dataset
TodoTasks_v0 Identifies tasks in Microsoft Outlook that track user-level work items. TodoTasks_v0 dataset
PlannerTasks_v0 Identifies tasks in Planner that track user-level work items. PlannerTasks_v0 dataset

User

Dataset name Description Learn more
User_v0 Provides user details stored for all the Microsoft Entra ID (Azure AD) user accounts that are created for a particular tenant. User_v0 dataset
User_v1 Provides user details stored for all the Microsoft Entra ID (Azure AD) user accounts. User_v1 dataset

OneDrive and SharePoint Online

Dataset name Description Sample and Schema
SharePointSites_v1 Contains information about SharePoint sites. Formerly known as SharePointSitesDataset_v0_Preview. SharePointSites_v1
SharePointPermissions_v1 Contains information about sharing permissions. Formerly DocumentSharingDataset_v0_Preview. SharePointPermissions_v1
SharePointGroups_v1 Contains SharePoint group information, including details about group members. Formerly SharePointGroupsDataset_v0_Preview. SharePointGroups_v1
SharePointFiles_v1 Contains information about SharePoint files. Coming in June 2024. SharePointFiles_v1
SharePointFileActions_v1 Contains information about SharePoint file actions. Coming in March 2024. SharePointFileActions_v1
OneDriveSyncHealth_v1 Contains information about devices running OneDrive for Business. Coming in March 2024. OneDriveSyncHealth_v1

Viva Insights

Dataset name Description Sample and Schema License
VivaInsightsDataset_PersonReport_v1 Contains Viva Insights metrics. Requires Viva Insights license

Regions

Microsoft Graph Data Connect supports extracting data from a variety of Microsoft 365 regions. To successfully move data from the Microsoft 365 data center into your Microsoft Azure storage, the Azure Synapse or Azure Data Factory instance and the Azure storage location must both map to a supported region for the location of the Microsoft 365 data.

The following table indicates which Microsoft 365 regions are supported and the corresponding Azure regions required for data movement.

Office region Azure region
Asia-Pacific
  • East Asia
  • Southeast Asia
Australia
  • Australia East
  • Australia Southeast
Europe
  • North Europe
  • West Europe
North America
  • Central US
  • East US
  • East US 2
  • North Central US
  • South Central US
  • West Central US
  • West US
  • West US 2
Brazil
  • Brazil South
United Kingdom
  • UK South
  • UK West
Canada (CAN)
  • Canada Central
  • Canada East
Japan (JPN)
  • Japan West
  • Japan East
India (IND)
  • South India
  • Central India
Korea (KOR)
  • Korea Central
  • Korea South
Switzerland (CHE)
  • Switzerland North
Germany (DEU)
  • Germany West Central
Norway (NOR)
  • Norway East
France (FRA)
  • France Central
UAE (UAE)
  • UAE North

Sinks

Sinks are the output location that Azure Synapse or Azure Data Factory uses to place data in Azure storage. Microsoft Graph Data Connect supports the following sink storage types:

The following characteristics apply to sinks:

  • Service Principal authentication is the only supported authentication mechanism for all sink types in a copy activity with Microsoft 365 as the source.

  • When using Azure Storage Blob as the sink, you must ensure that your application has Storage Blob Data Contributor access to the Azure Storage Blob location.

  • For copy activity, the output files are formatted as JSON. This format is fixed and modifying the format isn't supported. However, you can use Azure Synapse or Azure Data Factory to copy the result of a Microsoft Graph Data Connect pipeline into another storage mechanism (such as Azure SQL Database).

  • Mapping data flows: Copy and transform data from Microsoft 365 (Office 365) - Azure Data Factory & Azure Synapse | Microsoft Learn |

    • Output can be in parquet format. For details about the supported data transformations, see Flatten transformation in mapping data flow.

    • Microsoft Graph Data Connect on mapping data flows supports direct output of the data into Azure SQL DB.

 

The following table indicates the areas that are supported for the corresponding copy activity and mapping data flows.

Area Copy activity Mapping data flows
Output data formats supported JSON JSON, Parquet
Data transformation (normalization/flattening/etc.) Requires additional transformation step in the ADF/Synapse pipeline Supports inline transformations
Supported data sinks ADLS gen2, Azure Blob ADLS gen2, Azure Blob, Azure SQL DB
Azure VNET IR Not supported Supported