Install custom Apache Hadoop applications on Azure HDInsight

In this article, you'll learn how to install an Apache Hadoop application on Azure HDInsight, which hasn't been published to the Azure portal. The application you'll install in this article is Hue.

An HDInsight application is an application that users can install on an HDInsight cluster. These applications can be developed by Microsoft, independent software vendors (ISV) or by yourself.

Prerequisites

If you want to install HDInsight applications on an existing HDInsight cluster, you must have an HDInsight cluster. To create one, see Create clusters. You can also install HDInsight applications when you create an HDInsight cluster.

Install HDInsight applications

HDInsight applications can be installed when you create a cluster or to an existing HDInsight cluster. For defining Azure Resource Manager templates, see MSDN: Install an HDInsight application.

The files needed for deploying this application (Hue):

To install Hue to an existing HDInsight cluster

  1. Select the following image to sign in to Azure and open the Resource Manager template in the Azure portal.

    Deploy to Azure button for new cluster

    The Resource Manager template is located at https://github.com/hdinsight/Iaas-Applications/tree/master/Hue. To learn how to write this Resource Manager template, see MSDN: Install an HDInsight application.

  2. Select the existing Resource group that contains your cluster from the drop-down list. It's required to use the same resource group as the cluster.

  3. Enter the name of the cluster where you want to install the application. This cluster must be an existing cluster.

  4. Select the checkbox for I agree to the terms and conditions stated above.

  5. Select Purchase.

You can see the installation status from the tile pinned to the portal dashboard and the portal notification (click the bell icon on the top of the portal). It takes about 10 minutes to install the application.

To install Hue while creating a cluster

  1. Select the following image to sign in to Azure and open the Resource Manager template in the Azure portal.

    Deploy to Azure button for new cluster

    The Resource Manager template is located at https://hditutorialdata.blob.core.windows.net/hdinsightapps/create-linux-based-hadoop-cluster-in-hdinsight.json. To learn how to write this Resource Manager template, see MSDN: Install an HDInsight application.

  2. Follow the instruction to create cluster and install Hue. For more information on creating HDInsight clusters, see Create Linux-based Hadoop clusters in HDInsight.

Other installation methods

In addition to the Azure portal, you can also use Azure PowerShell and Azure CLI to call Resource Manager templates.

Validate the installation

You can check the application status on the Azure portal to validate the application installation. In addition, you can also validate all HTTP endpoints came up as expected and the webpage if there's one.

For Hue, you can use the following steps:

Azure portal

  1. Sign in to the Azure portal.
  2. Select the cluster where you installed the application.
  3. From the Settings menu, select Applications.
  4. Select hue from the list to view the properties.
  5. Select the Webpage link to validate the website.

Azure CLI

Replace CLUSTERNAME, and RESOURCEGROUP with the relevant values and then enter the commands below:

  • To lists all of the applications for the HDInsight cluster.

    az hdinsight application list --cluster-name CLUSTERNAME --resource-group RESOURCEGROUP
    
  • To retrieve properties of the specified application.

    az hdinsight application show --name hue --cluster-name CLUSTERNAME --resource-group RESOURCEGROUP
    

Troubleshoot the installation

You can check the application installation status from the portal notification (Click the bell icon on the top of the portal).

If an application installation failed, you can see the error messages and debug information from three places:

  • HDInsight Applications: general error information.

    Open the cluster from the portal, and select Applications from Settings:

    hdinsight applications application installation error

  • HDInsight script action: If the HDInsight Applications' error message indicates a script action failure, more details about the script failure will be presented in the script actions pane.

    Select Script Action from Settings. Script action history shows the error messages

    hdinsight applications script action error

  • Apache Ambari Web UI: If the install script was the cause of the failure, use Ambari Web UI to check full logs about the install scripts.

    For more information, see Troubleshoot script actions.

Remove HDInsight applications

Azure portal

  1. Sign in to the Azure portal.
  2. Select the cluster where you installed the application.
  3. From the Settings menu, select Applications.
  4. Right-click the application you want to remove, and then select Delete.
  5. Select Yes to confirm.

Azure CLI

Replace NAME, CLUSTERNAME, and RESOURCEGROUP with the relevant values and then enter the command below:

az hdinsight application delete --name NAME --cluster-name CLUSTERNAME --resource-group RESOURCEGROUP

Next steps