Connect to data sources for Power BI dataflows
With Power BI dataflows, you can connect to many different data sources to create new dataflows, or add new entities to an existing dataflow.
This article lists the many available data sources for creating or adding to dataflows, and describes how to create those dataflows using these data sources.
For an overview of how to create and use dataflows, see creating and using dataflows in Power BI.
Create a dataflow from a data source
To connect to data, from the Power BI service select the + Create menu item, then select dataflow from the menu that appears. When selected, the following image appears in the canvas of the Power BI service.
If your dataflow already exists, you can add new entities to your dataflow by selecting Add entities, shown below, or by selecting Get data in the dataflow authoring tool.
The following image shows the Get data button in the dataflow authoring tool.
Data sources for dataflows
You can view the available data sources by selecting Get Data from the dataflow authoring tool, which then displays a dialog for selecting the categories and each data source, as shown in the following image.
Data sources for dataflows are organized into the following categories, which appear across the top of the Get data dialog:
- All categories
- Power BI
- Online Services
The All categories category contains all data sources, from all categories.
The File category includes the following available data connections for dataflows:
The Database category includes the following available data connections for dataflows:
- IBM DB2 Database
- MySQL Database
- Oracle Database
- PostgreSQL Database
- SQL Server Database
- Sybase Database
- Teradata Database
The Power BI category includes the following available data connections for dataflows:
- Power BI dataflows
The Azure category includes the following available data connections for dataflows:
- Azure Blobs
- Azure Data Explorer
- Azure SQL Data Warehouse
- Azure SQL Database
- Azure Tables
The Online Services includes the following available data connections for dataflows:
- Amazon Redshift
- Common Data Service for Apps
- Microsoft Exchange Online
- Salesforce Objects
- Salesforce Reports
- SharePoint Online List
The Other category includes the following available data connections for dataflows:
- Active Directory
- SharePoint List
- Web API
- Web page
- Blank table
- Blank Query
Connecting to a data source
To connect to a data source, select the data source. We'll use one example to show how the process works, but each data connection for dataflows is similar in process. Different connectors may require specific credentials or other information, but the flow is similar. In our example, you see in the following image that Common Data Service for Apps is selected from the Online services data connection category.
A connection window for the selected data connection is displayed. If credentials are required, you're prompted to provide them. The following image shows a Server URL being entered to connect to a Common Data Service for Apps server.
Once the Server URL or resource connection information is provided, select Sign in to enter the credentials to use for the data access, then select Next.
Power Query Online initiates and establishes the connection to the data source, and then presents the available tables from that data source in the Navigator window, shown in the following image.
You can select tables and data to load by selecting the checkbox next to each in the left pane. To load the data, select OK from the bottom of the Navigator pane. A Power Query Online dialog appears, where you can edit queries and perform any other transformation you want to perform on the selected data.
That's all there is to it. Other data sources have similar flows, and use Power Query Online to edit and transform the data you bring into your dataflow.
Connecting to additional data sources
There are additional data connectors that are not shown in the Power BI dataflows user interface, but are supported with a few additional steps.
You can take the following steps to create a connection to a connector that is not displayed in the user interface:
Open Power BI Desktop and select Get Data.
Open Power Query Editor in Power BI Desktop, then right-click on the relevant query and open the Advanced Editor, as shown in the following image. From there, you can copy the M script that appears in the Advanced Editor.
Open the Power BI dataflow, and select Get data for a blank query, as shown in the following image.
Paste the copied query into the blank query for the dataflow.
And then your script connects to the data source you specified.
The following list shows which connectors you can currently use by copying and pasting the M query into a blank query:
- SAP Business Warehouse
- Azure analysis Services
- Adobe Analytics
- OLE DB
- SharePoint Online folder
- SharePoint folder
- Hadoop HDFS
- Azure HDInsight (HDFS)
- Hadoop file HDFS
- Informix (beta)
That's all there is to connecting to data sources in Power BI dataflows.
This article showed which data sources you can connect to for dataflows. The following articles go into more detail about common usage scenarios for dataflows.
- Self-service data prep in Power BI
- Create and use dataflows in Power BI
- Using computed entities on Power BI Premium
- Using dataflows with on-premises data sources
- Developer resources for Power BI dataflows
- Dataflows and Azure Data Lake integration (Preview)
For more information about Power Query and scheduled refresh, you can read these articles:
For more information about the Common Data Model, you can read its overview article: