Azure HDInsight Documentation
Learn how to use Azure HDInsight to analyze streaming or historical data. Tutorials and other documentation show you how to create clusters, process and analyze big data, and develop solutions using the most popular open-source frameworks, like Apache Hadoop, Apache Spark, Apache Hive, Apache LLAP, Apache Kafka, Apache Storm, and Microsoft Machine Learning Server.
Azure HDInsight is a managed, full-spectrum, open-source analytics service for enterprises. HDInsight is a cloud service that makes it easy, fast, and cost-effective to process massive amounts of data. HDInsight also supports a broad range of scenarios, like extract, transform, and load (ETL); data warehousing; machine learning; and IoT.
Learn how to create an HDInsight cluster and run jobs:
Learn how to use Azure HDInsight in different scenarios:
- Apache Hadoop : Perform ETL operations | Create on-demand clusters
- Apache Spark : Run interactive queries | Visualize data | Machine learning
- Apache Kafka : Structured streaming with Kafka | Use with Storm on HDInsight | Use Kafka Producer and Consumer APIs
- Apache HBase : Create HBase clusters in a VNET | Use Apache Phoenix | Connect to Spark
- Interactive Query : Connect with Power BI using Direct Query
- Apache Storm : Create Storm topology in Java | Deploy Storm topologies on HDInsight | Write from Storm to Data Lake Storage
- ML Services : Use R Tools for Visual Studio