Building Open Source Software (OSS) Analytical Solutions with Azure HDInsight

Data Engineer
Data Scientist

In this learning path, the learner will be introduced to HDInsight and how to apply this technology to solve a range of real world challenges.


The following pre-requisite should be completed

  • Successfully login to the Azure portal
  • Understand the Azure storage options
  • Understand the Azure compute options

Modules in this learning path

At the end of this module, you will learn that Azure HDInsight is a fully managed cloud service that enables you to efficiently process massive amounts of data using the most popular open source frameworks.

In this module, you will learn the different configuration for ensuring optimal use of HDInsight from both a performance and cost perspective.

In this module, you will learn how to create a HDInsight Cluster, monitor a cluster and be aware of common provisioning issues.

By the end of this module, you will be able to perform ad hoc queries on a big-data set. Using HDInsight Interactive Query helps to achieve sub second query latencies.