Visualize Apache Hive data with Microsoft Power BI using ODBC in Azure HDInsight
Learn how to connect Microsoft Power BI to Azure HDInsight using ODBC and visualize Apache Hive data.
You can leverage the Hive ODBC driver to do import via the generic ODBC connector in Power BI Desktop. However it is not recommended for BI workloads given non-interactive nature of the Hive query engine. HDInsight Interactive Query connector and HDInsight Spark connector are better choices for their performance.
In this tutorial, you load the data from a hivesampletable Hive table to Power BI. The Hive table contains some mobile phone usage data. Then you plot the usage data on a world map:
The information also applies to the new Interactive Query cluster type. For how to connect to HDInsight Interactive Query using direct query, see Visualize Interactive Query Hive data with Microsoft Power BI using direct query in Azure HDInsight.
Before going through this article, you must have the following items:
- HDInsight cluster. The cluster can be either a HDInsight cluster with Hive or a newly released Interactive Query cluster. For creating clusters, see Create cluster.
- Microsoft Power BI Desktop. You can download a copy from the Microsoft Download Center.
Create Hive ODBC data source
Load data from HDInsight
The hivesampletable Hive table comes with all HDInsight clusters.
- Sign in to Power BI Desktop.
Click the Home tab, click Get Data from the External data ribbon, and then select More.
- From the Get Data pane, click Other from the left, click ODBC from the right, and then click Connect on the bottom.
- From the From ODBC pane, select the data source name you created in the last section, and then click OK.
- From the Navigator pane, expand ODBC->HIVE->default, select hivesampletable, and then click Load.
Continue from the last procedure.
From the Visualizations pane, select Map. It is a globe icon.
- From the Fields pane, select country and devicemake. You can see the data plotted on the map.
- Expand the map.
In this article, you learned how to visualize data from HDInsight using Power BI. To learn more, see the following articles:
- Use Apache Zeppelin to run Apache Hive queries in Azure HDInsight.
- Connect Excel to HDInsight with the Microsoft Hive ODBC Driver.
- Connect Excel to Apache Hadoop by using Power Query.
- Connect to Azure HDInsight and run Apache Hive queries using Data Lake Tools for Visual Studio.
- Use Azure HDInsight Tool for Visual Studio Code.
- Upload Data to HDInsight.