Microsoft Power BI is a business analytics service that provides interactive visualizations with self-service business intelligence capabilities, enabling end users to create reports and dashboards by themselves without having to depend on information technology staff or database administrators.
When you use Azure Databricks as a data source with Power BI, you can bring the advantages of Azure Databricks performance and technology beyond data scientists and data engineers to all business users.
You can connect Power BI Desktop to your Azure Databricks clusters using the built-in Azure Databricks connector. You can also publish Power BI reports to the Power BI service and enable users to access the underlying Azure Databricks data using SSO, passing along the same Azure AD credentials they use to access the report.
Power BI Desktop 2.85.681.0 or above. Download the latest version.
Decide how you are going to authenticate from Power BI to Azure Databricks.
You can authenticate using Azure Active Directory (recommended) or a personal access token.
For large imports that take longer than one hour, Databricks recommends that you use personal access token authentication as there is a known token refresh issue when using Azure AD authentication.
Connect to Azure Databricks from Power BI
Step 1: Get Azure Databricks connection information
- Get the server hostname, port, and HTTP path.
- If you authenticate using a personal access token, get a token.
Step 2: Configure Azure Databricks connection in Power BI
In PowerBI Desktop, go to Get Data > Azure and select the Azure Databricks connector.
Paste the Server Hostname and HTTP Path you retrieved in Step 1.
Select your Data Connectivity mode. For information about the difference between Import and DirectQuery, see Use DirectQuery in Power BI Desktop.
At the authentication prompt, select the way to authenticate to Azure Databricks:
- Azure Active Directory: Use your Azure account credentials. Click the Sign in button. In the sign-in dialog, enter your Azure account username (email, phone, or Skype).
- Personal access token: Use the personal access token you retrieved in Step 1.
Select the Azure Databricks data to query from the Power BI Navigator.
Access Azure Databricks data source using the Power BI service
When you publish a report to the Power BI service, you can enable users to access the report and underlying Azure Databricks data source using SSO:
Publish your Power BI report from Power BI Desktop to the Power BI service.
Enable single sign on (SSO) access to the report and underlying data source.
- Go to the underlying Azure Databricks dataset for the report in the Power BI service, expand Data source credentials, and click Edit credentials.
- On the configuration dialog, select Report viewers can only access this data source with their own Power BI identities using Direct Query and click Sign in.
With this option selected, access to the data source is handled using DirectQuery and managed using the Azure AD identity of the user who is accessing the report. If you don’t select this option, only you, as the user who published the report, will have access to the Azure Databricks data source.
The Azure Databricks connector does not support web proxy.