Release notes for Azure HDInsight

This article provides information about the most recent Azure HDInsight release updates. For information on earlier releases, see HDInsight Release Notes Archive.

Important

Linux is the only operating system used on HDInsight version 3.4 or greater. For more information, see HDInsight versioning article.

Summary

Azure HDInsight is one of the most popular services among enterprise customers for open-source Apache Hadoop and Apache Spark analytics on Azure.

New features

For more information on important changes with HDInsight 4.0., see What's new in HDI 4.0?.

Component versions

The official Apache versions of all HDInsight 4.0 components are given below. The components listed are releases of the most recent stable versions available.

  • Apache Ambari 2.7.1
  • Apache Hadoop 3.1.1
  • Apache HBase 2.0.0
  • Apache Hive 3.1.0
  • Apache Kafka 1.1.1
  • Apache Mahout 0.9.0+
  • Apache Oozie 4.2.0
  • Apache Phoenix 4.7.0
  • Apache Pig 0.16.0
  • Apache Ranger 0.7.0
  • Apache Slider 0.92.0
  • Apache Spark 2.3.1
  • Apache Sqoop 1.4.7
  • Apache TEZ 0.9.1
  • Apache Zeppelin 0.8.0
  • Apache ZooKeeper 3.4.6

Later versions of Apache components are sometimes bundled in the HDP distribution in addition to the versions listed above. In this case, these later versions are listed in the Technical Previews table and should not substitute for the Apache component versions of the above list in a production environment.

Apache patch information

For more information on patches available in HDInsight 4.0, see the patch listing for each product in the table below.

Product name Patch information
Ambari Ambari patch information
Hadoop Hadoop patch information
HBase HBase patch information
Hive This release provides Hive 3.1.0 with no additional Apache patches.
Kafka This release provides Kafka 1.1.1 with no additional Apache patches.
Oozie Oozie patch information
Phoenix Phoenix patch information
Pig Pig patch information
Ranger Ranger patch information
Spark Spark patch information
Sqoop This release provides Sqoop 1.4.7 with no additional Apache patches.
Tez This release provides Tez 0.9.1 with no additional Apache patches.
Zeppelin This release provides Zeppelin 0.8.0 with no additional Apache patches.
Zookeeper Zookeeper patch information

Fixed Common Vulnerabilities and Exposures

For more information on security issues resolved in this release, see Hortonworks' Fixed Common Vulnerabilities and Exposures for HDP 3.0.1.

Known issues

Replication is broken for Secure HBase with default installation

For HDInsight 4.0, do the following steps:

  1. Enable inter-cluster communication.

  2. Sign in to the active headnode.

  3. Download a script to enable replication with the following command:

    sudo wget https://raw.githubusercontent.com/Azure/hbase-utils/master/replication/hdi_enable_replication.sh
    
  4. Type the command sudo kinit <domainuser>.

  5. Type the following command to run the script:

    sudo bash hdi_enable_replication.sh -m <hn0> -s <srclusterdns> -d <dstclusterdns> -sp <srcclusterpasswd> -dp <dstclusterpasswd> -copydata
    

For HDInsight 3.6, do the following:

  1. Sign in to active HMaster ZK.

  2. Download a script to enable replication with the following command:

    sudo wget https://raw.githubusercontent.com/Azure/hbase-utils/master/replication/hdi_enable_replication.sh
    
  3. Type the command sudo kinit -k -t /etc/security/keytabs/hbase.service.keytab hbase/<FQDN>@<DOMAIN>.

  4. Type the following command:

    sudo bash hdi_enable_replication.sh -s <srclusterdns> -d <dstclusterdns> -sp <srcclusterpasswd> -dp <dstclusterpasswd> -copydata
    

Phoenix Sqlline stops working after migrating HBase cluster to HDInsight 4.0

Do the following steps:

  1. Drop the following Phoenix tables:
    1. SYSTEM.FUNCTION
    2. SYSTEM.SEQUENCE
    3. SYSTEM.STATS
    4. SYSTEM.MUTEX
    5. SYSTEM.CATALOG
  2. If you can't delete any of the tables, restart HBase to clear any connections to the tables.
  3. Run sqlline.py again. Phoenix will re-create all of the tables that were deleted in step 1.
  4. Regenerate Phoenix tables and views for your HBase data.

Phoenix Sqlline stops working after replicating HBase Phoenix metadata from HDInsight 3.6 to 4.0

Do the following steps:

  1. Before doing the replication, go to the destination 4.0 cluster and execute sqlline.py. This command will generate Phoenix tables like SYSTEM.MUTEX and SYSTEM.LOG that only exist in 4.0.
  2. Drop the following tables:
    1. SYSTEM.FUNCTION
    2. SYSTEM.SEQUENCE
    3. SYSTEM.STATS
    4. SYSTEM.CATALOG
  3. Start the HBase replication

Deprecation

Apache Storm and ML services aren't available in HDInsight 4.0.