Accessing adls gen2 from an esp enabled HDinsights cluster
Hallo All, I have configured an HDI Cluster with ESP enabled and from the Ambari and Azure Console I can see that all the services are running fine. But when trying to execute a sample pi job even to list files using hdfs dfs -ls , I am getting the…
Can we add new blob storages to an existing HDI cluster without r
Hi Team, I have a scenario where I have an existing cluster. I created 2 new blob storages. I wanted to add these storages to that existing HDI. Is there any way to add these storages to the HDI cluster and get effect without having to delete…
How to model thousands of files from Azure Data Lake Gen 2 to Single dataset for analysis?
Hi, I have an initial 1000s of delimited files in Azure Data Lake Gen 2 storage account. I need to read all these files and create them as single dataset for analysis. This dataset must be preserved for future files. After these files are processed,…
Unable to access 101-hdinsight-linux-add-edge-node template
I am trying to Create an Edge node to HDInsight Cluster using link…
Can't access the Hadoop services on HDInsight 4.0
Hi, ppl I deployed the ESP Hadoop clusters in my VNET. But, I can't access to some Hadoop services as NameNode UI and Solr Ambari UI because I can't access to the FQDN:hn0-[clustername initial 6 characters].[AAD-DS DNS domain name] :…
Import HDI external HIVE Metadata DB into Synapse
I have an HDI 4.0 cluster I am trying to turn off and move to Synapse. I have an external HIVE DB. How can I import HIVE tables+metadata? I know I can first recreate the metadata by hand and then read from ADLS. I want to automate/eliminate…
HDInsights Cluster with ESP
I know that in-order to create a Kerberized HDInsights Cluster, we have to enable the Azure Active Directory Domain Services. My question is, Will it be possible to create a Kerberized HDInsights Cluster with external KDC Server...? By external KDC…
Data Migration between Kerberized CDH and ESP enabled HDInsights.
Hi All, We have a requirement for transferring HDFS data residing in our on-premise Kerberised CDH Cluster(MIT KDC) into an HDInsights cluster with ESP Enabled. Source will be CDH and destination will be adls. How can I do that since I see issues…
Need to delete HDICluster as of no use.
Hi, Need to know if there is any impact on storage or any other component when I decide to delete an HDI Cluster.
HDinsight cluster deletion failure
Hello, Currently, I am facing an issue in which my On-demand HDinsight cluster is not automatically deleted after the job execution and it is causing huge costs for me. I am looking for an automated process to delete the HDcluster if there is no…
Pyspark HDInsight DataFactory Eviroment Variable
I'm faced a problem with Pyspark, datafactory and HdInsight I create a HDInsight with 2 master and 2 slaves. I created environment variables in all server like sudo echo 'TEST=server' >> /etc/environment After that, in all server I…
HDInsight cluster is failing when upgrading the version to 4.0
Hi Team, I am getting this error : Operation on target Txxxxxx failed: Hadoop job failed with exit code '2'. See…
What is the difference between : Azure Synapsis Analytics - Azure Databricks - Azure HD insight
Hello Everybody, I'm running a project where we need to propose an azure-based architecture to import data from an on-premises data warehouse (databases) to azure-based data platform. Data are aimed to be exposed to company operators through a web…
Error with Hive Warehouse Connector Jar in Azure java.lang.NoSuchFieldError: HIVE_STATS_JDBC_TIMEOUT
I have the following Azure env HDP Spark version 2.3.2.3.1.0.319-3 Hadoop Version : 3.1.0.319-3 Hive Version : 3.1.0 Can anyone please suggest which versions of jars need to be used with the above env configuration. I am using following Jar…
HDInsight cluster stuck in 'delete' state
I issued a delete request for an HDInsight cluster over 12 hours ago, and it now says 'Deleting'. Anyway to complete the deletion of the cluster?
How to do big data analysis on Append Block with HDInsight or any other alternative service on Azure? (source log data produced by Azure Monitor)
Hello, I enable the Diagnostic settings in Monitoring of storage account, and the log will be sent to another storage account. And the default type of the log JSON file is append blob rather than block blob, and seems the type can not be changed. …
Best practices for submitting spark batch jobs in Azure HDinsights.
Hi, I'm looking to submit my pyspark scripts in HDInsight. Currently, HDInsight provided Livy for job submission, using curl. However, If I want to productionize it, then what authentication mechanism to use. Also, How can I check the progress of…
How to access an external storage account in HDInsight cluster without access key?
How does the HDInsight cluster to read data from a private blob container not set as the default or additional storage account during the cluster's creation. We don't want to use access key. Can we add additional storage accounts after the cluster…
How To Create an Azure Cluster In Another Region Using Azure Data Factory
I'm Trying to change the region where the Hd Insight Linked Service create an on-demand cluster. I Want to change from East Us to East Us 2 (Or West US) But I can't do it I've added in the following places the Location key without success: …
Llap is disabled and cant be enabled due to some reason and query with joins taking long to fetch the result
Llap is disabled in hdi4 manually and now query with joins taking 30 mins extra to fetch 30 millions rows