HDInsight on Windows: Building PowerPivot Report from Hive in a Few Easy Steps

What are we Covering:

  • Hive
    • Create table
    • Load Data
    • Query Data
  • Configure ODBC
  • Create PowerPivot
    • With DataSource as HDInsight
    • Import Data from HDInsight
  • Create Pivot Chat using imported data

 

(1)Open Hadoop Command Line

 

After installing HDInsight for Windows you will get Hadoop Command Line Shortcut on desktop

Click on Hadoop Command Line Shortcut and it will open Cmd Line for Hadoop

clip_image002

 

(2)Type Hive and it will change to Hive Command

 

 

clip_image004

 

(3)Create Tables

 

CREATE TABLE emp(id int,name string)ROW FORMAT DELIMITED FIELDS TERMINATED BY ',';

CREATE TABLE sales(id int,sales int)ROW FORMAT DELIMITED FIELDS TERMINATED BY ',';

clip_image005

 

(4)Load Data using CSV Files

 

LOAD DATA LOCAL INPATH 'c:/data/emp.csv' OVERWRITE INTO TABLE emp;

clip_image006

LOAD DATA LOCAL INPATH 'c:/data/sales.csv' OVERWRITE INTO TABLE sales;

clip_image007

 

(5) Open HDInsight Dashboard (You will find shortcut on desktop) and click on Download

 

clip_image010

 

(6)Click on the appropriate link for your Excel.

 

clip_image013

 

(7)Configure ODBC

For configuring ODBC, Click start and type in ODBC. This will bring up the Data Sources dialog. Go to the System DSN tab and click Add. Configure it with a Data Source Name, Your Host Information, and a Username. In Single Node HDInsight Installation you don’t need to provide Username but if you are using HDInsight on Azure you can provide Username .

clip_image014

clip_image016

 

 

(8) Provider Data Source Name and Default Port is 10000

 

clip_image018

 

 

(9)Open Excel Sheet, select PowerPivot in ribbon and click on Manage

 

clip_image021

 

(10)Click on Get External Data

 

clip_image025

 

(11)Select – Get External Data and Data Import window and Select Others(OLEDB/ODBC)

 

clip_image027

 

 

(12)Click Build

 

clip_image029

 

(13)Select OLE DB Provider for ODBC

 

clip_image030

 

 

(14)Select DSN which we have created earlier in step number 8, Select Database (In Single Node HDInsight installation you don’t need to provider User Name and Password in current release but in case of HDInsight on Azure you need to provide User Name and Password) and follow the wizard for selecting tables in our case its EMP and Sales

 

clip_image031

clip_image034

clip_image035

clip_image036

 

 

(15)If you Notice Relationship between Emp and Sales is absent so with one click we can create relationship

 

clip_image038

clip_image040

 

(16)Added feature like Calculated Measure name Sales and created a KPI

 

clip_image041

 

(17)Select Pivot of your Choice from PivotTable

 

clip_image042

 

 

(18) Finally - you are good to go with PowerPivot Report based on Hive

clip_image044

Refer

HDInsight – What is it?

Configuring: PowerPivot for Excel 2013