Create Linux-based clusters in HDInsight by using the Azure portalCreate Linux-based clusters in HDInsight by using the Azure portal

The Azure portal is a web-based management tool for services and resources hosted in the Microsoft Azure cloud.The Azure portal is a web-based management tool for services and resources hosted in the Microsoft Azure cloud. In this article, you learn how to create Linux-based Azure HDInsight clusters by using the portal.In this article, you learn how to create Linux-based Azure HDInsight clusters by using the portal.

Ostrzeżenie

Opłaty za klastry usługi HDInsight są naliczane proporcjonalnie za minutę, niezależnie od tego, czy są używane.Billing for HDInsight clusters is prorated per minute, whether you use them or not. Pamiętaj o usunięciu klastra po zakończeniu korzystania z niego.Be sure to delete your cluster after you finish using it. Zobacz Jak usunąć klaster usługi HDInsight.See how to delete an HDInsight cluster.

Wymagania wstępnePrerequisites

Tworzenie klastrówCreate clusters

The Azure portal exposes most of the cluster properties.The Azure portal exposes most of the cluster properties. By using Azure Resource Manager templates, you can hide many details.By using Azure Resource Manager templates, you can hide many details. For more information, see Create Apache Hadoop clusters in HDInsight by using Resource Manager templates.For more information, see Create Apache Hadoop clusters in HDInsight by using Resource Manager templates.

Uwaga

Funkcja wymagająca bezpiecznego transferu wymusza wszystkie żądania do konta przez bezpieczne połączenie.The feature that requires secure transfer enforces all requests to your account through a secure connection. Ta funkcja obsługuje tylko klaster usługi HDInsight w wersji 3,6 lub nowszej.Only HDInsight cluster version 3.6 or newer supports this feature. Aby uzyskać więcej informacji, zobacz Tworzenie klastra Apache Hadoop z kontami magazynu Secure transfer w usłudze Azure HDInsight.For more information, see Create Apache Hadoop cluster with secure transfer storage accounts in Azure HDInsight.

  1. Zaloguj się do portalu Azure.Sign in to the Azure portal.

  2. From the left menu, navigate to + Create a resource > Analytics > Azure HDInsight.From the left menu, navigate to + Create a resource > Analytics > Azure HDInsight.

    Create a new cluster in the Azure portalCreate a new cluster in the Azure portal

  3. From the Create HDInsight cluster page, select Go to classic create experience.From the Create HDInsight cluster page, select Go to classic create experience.

    Go to classic create experience

  4. On the HDInsight page, select Custom (size, settings, apps) .On the HDInsight page, select Custom (size, settings, apps).

  5. Select 1 Basics.Select 1 Basics. Then enter the following information:Then enter the following information:

    WłaściwośćProperty OpisDescription
    Nazwa klastraCluster name Ta nazwa musi być unikatowa w skali globalnej.This name must be globally unique.
    SubskrypcjaSubscription From the drop-down list, select the Azure subscription that's used for the cluster.From the drop-down list, select the Azure subscription that's used for the cluster.
    Typ klastraCluster type Select the type of cluster you want to create.Select the type of cluster you want to create. Examples are Hadoop and Apache Spark.Examples are Hadoop and Apache Spark. The Operating system will be Linux.The Operating system will be Linux. Next, select a cluster type version.Next, select a cluster type version. Use the default version if you don't know what to choose.Use the default version if you don't know what to choose. Więcej informacji można znaleźć w temacie HDInsight cluster versions (Wersje klastrów usługi HDInsight).For more information, see HDInsight cluster versions.
    Nazwa użytkownika logowania klastraCluster login username Provide the username, default is admin.Provide the username, default is admin.
    Hasło logowania klastraCluster login password Provide the password.Provide the password.
    Nazwa użytkownika protokołu SSH (Secure Shell)Secure Shell (SSH) username Default is sshuser.Default is sshuser. If you want the same SSH password as the admin password you specified earlier, select the Use cluster login password for SSH check box.If you want the same SSH password as the admin password you specified earlier, select the Use cluster login password for SSH check box. If not, provide either a PASSWORD or PUBLIC KEY to authenticate the SSH user.If not, provide either a PASSWORD or PUBLIC KEY to authenticate the SSH user. A public key is the approach we recommend.A public key is the approach we recommend. Choose Select at the bottom to save the credentials configuration.Choose Select at the bottom to save the credentials configuration. For more information, see Connect to HDInsight (Apache Hadoop) by using SSH.For more information, see Connect to HDInsight (Apache Hadoop) by using SSH.
    Grupa zasobówResource group Określ, czy chcesz utworzyć nową grupę zasobów, czy użyć istniejącej grupy.Specify whether you want to create a new resource group or use an existing one.
    LokalizacjaLocation Specify a datacenter where the cluster is created.Specify a datacenter where the cluster is created.

    HDInsight create cluster basicsHDInsight create cluster basics

    Ważne

    HDInsight clusters come in a variety of types.HDInsight clusters come in a variety of types. They correspond to the workload or technology that the cluster is tuned for.They correspond to the workload or technology that the cluster is tuned for. There's no supported method to create a cluster that combines multiple types.There's no supported method to create a cluster that combines multiple types. Examples are Storm and HBase on one cluster.Examples are Storm and HBase on one cluster.

    Select Next to move to the next page.Select Next to move to the next page.

  6. From 2 Security + networking, you can connect your cluster to a virtual network by using the provided drop-down menu.From 2 Security + networking, you can connect your cluster to a virtual network by using the provided drop-down menu. Select an Azure virtual network and the subnet if you want to place the cluster into a virtual network.Select an Azure virtual network and the subnet if you want to place the cluster into a virtual network. For information on using HDInsight with a virtual network, see Plan a virtual network deployment for Azure HDInsight clusters.For information on using HDInsight with a virtual network, see Plan a virtual network deployment for Azure HDInsight clusters. The article includes specific configuration requirements for the virtual network.The article includes specific configuration requirements for the virtual network.

    If you want to use the Enterprise Security Package, follow these instructions: Configure a HDInsight cluster with Enterprise Security Package by using Azure Active Directory Domain Services.If you want to use the Enterprise Security Package, follow these instructions: Configure a HDInsight cluster with Enterprise Security Package by using Azure Active Directory Domain Services.

    Select Next to move to the next page.Select Next to move to the next page.

  7. From 3 Storage, for Storage Account Settings, specify whether you want Azure Storage or Azure Data Lake Storage as your default storage.From 3 Storage, for Storage Account Settings, specify whether you want Azure Storage or Azure Data Lake Storage as your default storage. For more information, see the following table.For more information, see the following table.

    Primary Storage typePrimary Storage type OpisDescription
    Azure StorageAzure Storage * For Selection method, choose My subscriptions if you want to specify a storage account that's part of your Azure subscription.* For Selection method, choose My subscriptions if you want to specify a storage account that's part of your Azure subscription. Then select the storage account.Then select the storage account. Otherwise, select Access key.Otherwise, select Access key. Then provide the information for the storage account that you want to choose from outside your Azure subscription.Then provide the information for the storage account that you want to choose from outside your Azure subscription.
    * For Default container, choose the default container name suggested by the portal or specify your own.* For Default container, choose the default container name suggested by the portal or specify your own.

    * If Azure Blob storage is your default storage, you can also select Additional Storage Accounts to specify additional storage accounts to associate with the cluster.* If Azure Blob storage is your default storage, you can also select Additional Storage Accounts to specify additional storage accounts to associate with the cluster. For Azure Storage Keys, select Add a storage key.For Azure Storage Keys, select Add a storage key. Then you can provide a storage account from your Azure subscriptions or from other subscriptions.Then you can provide a storage account from your Azure subscriptions or from other subscriptions. Provide the storage account access key.Provide the storage account access key.

    * If Blob storage is your default storage, you can also select Data Lake Storage access to specify Azure Data Lake Storage as additional storage.* If Blob storage is your default storage, you can also select Data Lake Storage access to specify Azure Data Lake Storage as additional storage. For more information, see Quickstart: Set up clusters in HDInsight.For more information, see Quickstart: Set up clusters in HDInsight.
  8. Azure Data Lake StorageAzure Data Lake Storage Select Azure Data Lake Storage Gen1 or Azure Data Lake Storage Gen2.Select Azure Data Lake Storage Gen1 or Azure Data Lake Storage Gen2. Then refer to the article Quickstart: Set up clusters in HDInsight for instructions.Then refer to the article Quickstart: Set up clusters in HDInsight for instructions.

    Metastore Settings (optional)Metastore Settings (optional)

    As an option, specify a SQL database to save Apache Hive and Apache Oozie metadata associated with the cluster.As an option, specify a SQL database to save Apache Hive and Apache Oozie metadata associated with the cluster. For Select a SQL database for Hive, select a SQL database.For Select a SQL database for Hive, select a SQL database. Then provide the username and password for the database.Then provide the username and password for the database. Repeat these steps for Oozie metadata.Repeat these steps for Oozie metadata.

    Some considerations about using Azure SQL database for metastores are as follows:Some considerations about using Azure SQL database for metastores are as follows:

    • The Azure SQL database that's used for the metastore must allow connectivity to other Azure services, including Azure HDInsight.The Azure SQL database that's used for the metastore must allow connectivity to other Azure services, including Azure HDInsight. On the right side of the Azure SQL database dashboard, select the server name.On the right side of the Azure SQL database dashboard, select the server name. This server is the one that the SQL database instance runs on.This server is the one that the SQL database instance runs on. After you're in server view, select Configure.After you're in server view, select Configure. Then for Azure Services, select Yes.Then for Azure Services, select Yes. Następnie wybierz pozycję Zapisz.Then select Save.
    • When you create a metastore, don't name a database with dashes or hyphens.When you create a metastore, don't name a database with dashes or hyphens. These characters can cause the cluster creation process to fail.These characters can cause the cluster creation process to fail.

    HDInsight create cluster storageHDInsight create cluster storage

    Ostrzeżenie

    Using an additional storage account in a different location than the HDInsight cluster isn't supported.Using an additional storage account in a different location than the HDInsight cluster isn't supported.

    Select Next to move to the next page.Select Next to move to the next page.

  9. From 4 Applications (optional) , select any applications that you want.From 4 Applications (optional), select any applications that you want. Microsoft, independent software vendors (ISVs), or you can develop these applications.Microsoft, independent software vendors (ISVs), or you can develop these applications. For more information, see Install applications during cluster creation.For more information, see Install applications during cluster creation.

    Select Next to move to the next page.Select Next to move to the next page.

  10. 5 Cluster size displays information about the nodes that are used for this cluster.5 Cluster size displays information about the nodes that are used for this cluster. Set the number of worker nodes that you need for the cluster.Set the number of worker nodes that you need for the cluster. The estimated cost of running the cluster is also shown.The estimated cost of running the cluster is also shown.

    HDInsight create cluster nodesHDInsight create cluster nodes

    Ważne

    If you plan on more than 32 worker nodes, select a head node size with at least eight cores and 14 GB RAM.If you plan on more than 32 worker nodes, select a head node size with at least eight cores and 14 GB RAM. Plan the nodes either at cluster creation or by scaling the cluster after creation.Plan the nodes either at cluster creation or by scaling the cluster after creation.

    For more information on node sizes and associated costs, see Azure HDInsight pricing.For more information on node sizes and associated costs, see Azure HDInsight pricing.

    Select Next to move to the next page.Select Next to move to the next page.

  11. From 6 Script actions, you can customize a cluster to install custom components.From 6 Script actions, you can customize a cluster to install custom components. This option works if you want to use a custom script to customize a cluster, as the cluster is being created.This option works if you want to use a custom script to customize a cluster, as the cluster is being created. For more information about script actions, see Customize Linux-based HDInsight clusters by using script actions.For more information about script actions, see Customize Linux-based HDInsight clusters by using script actions.

    Select Next to move to the next page.Select Next to move to the next page.

  12. From 7 Summary, verify the information you entered earlier.From 7 Summary, verify the information you entered earlier. Następnie wybierz przycisk Utwórz.Then select Create.

    HDInsight create cluster summaryHDInsight create cluster summary

    Uwaga

    Tworzenie klastra zajmuje trochę czasu, zwykle około 20 minut.It takes some time for the cluster to be created, usually around 20 minutes. Monitor Notifications to check on the provisioning process.Monitor Notifications to check on the provisioning process.

  13. After the creation process finishes, select Go to Resource from the Deployment succeeded notification.After the creation process finishes, select Go to Resource from the Deployment succeeded notification. The cluster window provides the following information.The cluster window provides the following information.

    HDI Azure portal cluster overviewHDI Azure portal cluster overview

    Some of the icons in the window are explained as follows:Some of the icons in the window are explained as follows:

    WłaściwośćProperty OpisDescription
    PrzeglądOverview Provides all the essential information about the cluster.Provides all the essential information about the cluster. Examples are the name, the resource group it belongs to, the location, the operating system, and the URL for the cluster dashboard.Examples are the name, the resource group it belongs to, the location, the operating system, and the URL for the cluster dashboard.
    Cluster dashboardsCluster dashboards Directs you to the Ambari portal associated with the cluster.Directs you to the Ambari portal associated with the cluster.
    SSH + Cluster loginSSH + Cluster login Provides information needed to access the cluster by using SSH.Provides information needed to access the cluster by using SSH.
    UsuńDelete Deletes the HDInsight cluster.Deletes the HDInsight cluster.

Dostosowywanie klastrówCustomize clusters

Usuwanie klastraDelete the cluster

Ostrzeżenie

Opłaty za klastry usługi HDInsight są naliczane proporcjonalnie za minutę, niezależnie od tego, czy są używane.Billing for HDInsight clusters is prorated per minute, whether you use them or not. Pamiętaj o usunięciu klastra po zakończeniu korzystania z niego.Be sure to delete your cluster after you finish using it. Zobacz Jak usunąć klaster usługi HDInsight.See how to delete an HDInsight cluster.

Rozwiązywanie problemówTroubleshoot

W razie problemów podczas tworzenia klastrów usługi HDInsight zapoznaj się z wymaganiami dotyczącymi kontroli dostępu.If you run into issues with creating HDInsight clusters, see access control requirements.

Następne krokiNext steps

You've successfully created an HDInsight cluster.You've successfully created an HDInsight cluster. Now learn how to work with your cluster.Now learn how to work with your cluster.

Apache Hadoop clustersApache Hadoop clusters

Apache HBase clustersApache HBase clusters

Apache Storm clustersApache Storm clusters

Apache Spark clustersApache Spark clusters