199 questions with Azure HDInsight tags

Sort by: Updated
4 answers

Unable to create HDInsight cluster using Microsoft tutorial

I am a student trying to learn how to setup HDInsight Hadoop cluster. I have signed up and received the free $200 credit. I am following the steps laid out in Microsoft's tutorial here:…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2020-12-21T18:46:02.527+00:00
Justin Birchard 1 Reputation point
commented 2022-07-24T05:26:36.617+00:00
Vrunda 21 Reputation points
1 answer

azure hdinsight There are not enough cores available

I want to create HDInsight in my pay as you go subscription, but I get error: There are not enough cores available to support the selected number of nodes. I checked in my subscription usage and quotas for computing and usage is for every processor…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-07-06T21:41:26.09+00:00
Ales Ventus 46 Reputation points
commented 2022-07-11T05:39:06.973+00:00
PRADEEPCHEEKATLA-MSFT 77,516 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Files not getting saved in Azure blob using Spark in HDInsights cluster

We've setup HDInsights cluster on Azure with Blob as the storage for Hadoop. We tried uploading files to the Hadoop using hadoop CLI and the files were getting uploaded to the Azure Blob. Command used to upload: Hadoop fs -put somefile…

Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,436 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-06-14T11:44:29.873+00:00
Saif Ahmad 21 Reputation points
commented 2022-06-21T13:09:24.653+00:00
Saif Ahmad 21 Reputation points
1 answer One of the answers was accepted by the question author.

Connect Synapse Spark Pool with Kafka on HDInsight

I have created a Kafka on HDinsight cluster . I have also created Azure Synapse Analytics - Spark Pool on same region as HDinsight. I need guidance on how to consume topics from Kafka into Spark Structured Streaming. Any documentation or steps will be of…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,389 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-06-07T01:52:19.007+00:00
sql-seek 61 Reputation points
accepted 2022-06-15T04:38:39.04+00:00
sql-seek 61 Reputation points
1 answer One of the answers was accepted by the question author.

HDInsight - Kafka - Version 3.2

Hi all Is there a roadmap to release a cluster with a higher kafka version than 2.4.1 in the near future? Thanks for the info in advance. Best reagrads, Michael

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-05-30T13:56:47.557+00:00
Michael Ahrens 21 Reputation points
accepted 2022-06-07T09:26:48.037+00:00
Michael Ahrens 21 Reputation points
1 answer One of the answers was accepted by the question author.

Can Azure Streaming Analytics read from Kafka on HDInsight and write to Deltalake table on Synapse lake.

Hello I am looking for guidance on building a new event driven platform. The options we are exploring for processing are - Azure Stream Analytics Apache Spark Structured Streaming in Synapse Source is like going to be Kafka on HDInsight …

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,389 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
Azure Stream Analytics
Azure Stream Analytics
An Azure real-time analytics service designed for mission-critical workloads.
330 questions
asked 2022-06-01T21:16:30.287+00:00
sql-seek 61 Reputation points
commented 2022-06-03T05:46:55.343+00:00
sql-seek 61 Reputation points
1 answer

what is the best way to copy data from my hadoop on prem cluster to the azure hdinsight cluster?

hi experts, what is the best way to copy data from my hadoop on prem cluster to the azure hdinsight cluster? So we recently deployed a new hdinsight cluster and now I would like to copy some data from my onprem cluster to hdinsight. Thanks,

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-05-16T20:57:25.917+00:00
Richmond Yu 1 Reputation point
commented 2022-06-01T05:47:08.503+00:00
PRADEEPCHEEKATLA-MSFT 77,516 Reputation points Microsoft Employee
1 answer

How to run hdfs commands from my on prem cluster to azure?

Hi experts, How to run hdfs commands from my on prem cluster to azure? So I have an on prem cluster that I would like to run hdfs commands to read files that are from my Azure HDinsight cluster. How can I do this? Thanks,

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-05-16T20:56:10.407+00:00
Richmond Yu 1 Reputation point
commented 2022-06-01T05:46:30.057+00:00
PRADEEPCHEEKATLA-MSFT 77,516 Reputation points Microsoft Employee
1 answer

HDinsight monitering

Hi Friends i am new to HD insight any ida about hd insight clusters monitor What are the major things we need to observe and moniter we are using apache Ambari

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
Azure R Server for HDInsight
Azure R Server for HDInsight
An Azure service that provides predictive analytics, machine learning, and statistical modeling for big data.
13 questions
asked 2022-05-10T08:17:22.893+00:00
Anshal 1,886 Reputation points
commented 2022-05-15T11:30:21.27+00:00
Luis Rodriguez 6,191 Reputation points Microsoft Employee
1 answer

Looking for HDInsight support team alias

I am running into an error while running a spark job in Azure Data Factory's pipeline and would like to connect to the HDInsight support team for further assistance. If you can please provide the alias

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-04-29T20:18:15.62+00:00
Harsha Deshmukh 1 Reputation point Microsoft Employee
commented 2022-05-10T07:31:36+00:00
PRADEEPCHEEKATLA-MSFT 77,516 Reputation points Microsoft Employee
1 answer

Can't create HDInsight Cluster(Hadoop)

Dear All, I am struggling against creating HDInsight. After reviewing documents and other posts, I upgraded free-trial to paid subscription and created paid subscription as well. However, regardless of subscription types(both subscription paid type) I…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-05-03T15:57:25.88+00:00
Jongmin Lee 6 Reputation points
commented 2022-05-09T05:15:42.647+00:00
PRADEEPCHEEKATLA-MSFT 77,516 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

How to leverage Azure key vault secrets from HD Insight Jupyter notebook?

Hi, I am trying to store the user id and password in Secrets and retrieve them in HD Insight Jupyter notebook? Any guidance.

Azure Key Vault
Azure Key Vault
An Azure service that is used to manage and protect cryptographic keys and other secrets used by cloud apps and services.
1,122 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-04-20T03:08:50.533+00:00
Jeeva 161 Reputation points
accepted 2022-05-02T12:04:55.727+00:00
Jeeva 161 Reputation points
1 answer One of the answers was accepted by the question author.

How to use GnuPG in HDInsight for encryption and decryption?

Hi, I am working with the HDInsight Spark cluster on Azure. Trying to encrypt files with pgp encryption using our private key. Is there a way that this can achieve rather than using the inbuilt encryption mechanism? How to set the home for GnuPG…

Azure Disk Encryption
Azure Disk Encryption
An Azure service for virtual machines (VMs) that helps address organizational security and compliance requirements by encrypting the VM boot and data disks with keys and policies that are controlled in Azure Key Vault.
160 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-04-17T03:02:56.34+00:00
vijay singh parmar 26 Reputation points
accepted 2022-04-28T07:21:51.667+00:00
vijay singh parmar 26 Reputation points
1 answer

Not enough cores error while deploying resource group on Azure for Students

Hello! I am very new to Azure. I have an Azure for Students subscription and I'm trying to create an Apache Kafka cluster using Azure HDInsight. I selected West Europe as my region. I'm using this resource as a guide:…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-04-15T00:28:01.03+00:00
Leila Moussa 1 Reputation point
commented 2022-04-22T03:57:47.553+00:00
PRADEEPCHEEKATLA-MSFT 77,516 Reputation points Microsoft Employee
1 answer

Zeppelin notebook - sc.textFile does not work for HDI with ESP

We have HDI cluster with ESP enabled. From our zeppelin notebook, when I read data to a dataset (spark.read.text) it works but when I try to read it to an RDD (sc.textFile), I get an authentication exception: Note that, while sc.textFile…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2021-02-11T15:14:14.283+00:00
Steven Lai 1 Reputation point
commented 2022-04-01T04:03:14.057+00:00
PRADEEPCHEEKATLA-MSFT 77,516 Reputation points Microsoft Employee
0 answers

unable to access index for repository https://mran.microsoft.com/snapshot/2017-03-15/src/contrib

The last time I did same thing last month, it was still ok, but today When I tried to install R package from MRAN repository; I got this error Checking the repository via browser also error could not find repository Could you please help me in the…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-03-19T10:46:16.717+00:00
Suryanto 16 Reputation points
commented 2022-03-21T22:31:04.06+00:00
Saurabh Sharma 23,751 Reputation points Microsoft Employee
1 answer

HDInsight: Commands to clean up the space

Hi, at my workplace, we are using HDInsight 3.6. We have encountered space issues before, but we were able to resolve them by simply executing the simple cleanup commands from the edge node. Unfortunately, these commands have not been useful recently.…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-03-10T19:51:28.77+00:00
vijay singh parmar 26 Reputation points
commented 2022-03-17T10:07:15.783+00:00
PRADEEPCHEEKATLA-MSFT 77,516 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

run job in HDInsight compute linked service

is username and password is the only way to submit job to HDInsight cluster? is managed identity or msi or service principal supported? Added question: can HDI team build API which uses AAD tokens as password instead of user input password? we have…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,587 questions
asked 2022-03-04T21:52:05.777+00:00
Bill Kan 21 Reputation points
commented 2022-03-11T03:03:01.097+00:00
PRADEEPCHEEKATLA-MSFT 77,516 Reputation points Microsoft Employee
1 answer

Kafka REST Proxy Authentication

We would like to develop a HDInsight Kafka cluster to share real time data with a subcontractor. The REST proxy documentation indicates that "Kafka clients that need access to the REST proxy should be registered to a group by the group owner."…

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-02-18T18:30:46.19+00:00
Mike McNulty 1 Reputation point
commented 2022-02-27T17:35:16.017+00:00
ShaikMaheer-MSFT 37,896 Reputation points Microsoft Employee
0 answers

Run C# mapreduce job

I am a beginner to Hadoop MapReduce. I have implemented a MapReduce job in visual C# and want to run it locally. As I understood, the HDInsight emulator hasn't been updated for a long time. What else options I have, to run the job locally?

Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
asked 2022-02-17T17:59:18.127+00:00
Lilukshi Silva 1 Reputation point
commented 2022-02-25T20:30:24.82+00:00
Lilukshi Silva 1 Reputation point