HDInsight SDK Reference Documentation

The topics in this section describe the public namespaces, classes, and interfaces that support the Microsoft Azure HDInsight (Hadoop) Service, Hadoop on Parallel Data Warehouse (PDW) appliances, and the Microsoft Avro Library for data serialization.

The client classes are used to manage Hadoop jobs on Azure and on PDW appliances. The management classes are used to configure the clusters, storage accounts, MapReduce programs, and the HBase, Hive, Pig, Sqoop, and Oozie components associated with the HDInsight clusters for a Windows Azure subscription.

The Microsoft Avro Library is an implementation of the Avro data serialization system. It employs rich, JSON-defined data structures and an object container to store persistent data. The Avro data format can be processed by many languages: C, C++, C#, Java, PHP, Python, and Ruby are currently supported.

Namespaces for the Windows Azure HDInsight Service API

Namespaces

Descriptions

Microsoft.Hadoop.Client

Provides classes related the creation, configuration, submission, and monitoring of to Hadoop jobs managed by Azure HDInsight.

Microsoft.Hadoop.Client.HadoopStorageClientLayer

Provides the interfaces associated with the Hadoop storage client.

Microsoft.HadoopAppliance.Client

Provides credentials and the means to connect to client storage in Hadoop on appliance.

Microsoft.HadoopAppliance.Client.HadoopStorageClientLayer

Provides the interface for the Hadoop appliance storage client.

Microsoft.WindowsAzure.Management.HDInsight

Provides classes used to manage Microsoft Azure subscriptions using HDInsight and to configure the clusters, storage accounts, MapReduce programs, and the Hive, Pig, and Oozie components associated with the HDInsight clusters managed by a Windows Azure subscription.

Microsoft.WindowsAzure.Management.HDInsight.ClusterProvisioning.Data

Provides information used to provision clusters.

Microsoft.WindowsAzure.Management.HDInsight.JobSubmission

Provides classes used to submit jobs to HDInsight

Microsoft.Hadoop.Avro

Provides the main library of classes used for Avro for the serialization and deserialization of objects.

Microsoft.Hadoop.Avro.Container

Provides classes used by Avro stream-based containers.

Microsoft.Hadoop.Avro.Schema

Provides operations related to Avro serialization schema.

Microsoft.Hadoop.Avro.Utils.Templates

Provides the template interface used by Avro serialization

Windows Azure HDInsight PowerShell

The HDInsight Service uses Azure PowerShell to configure, run, and post-process Hadoop jobs. The documentation for the cmdlets is available at HDInsight Cmdlet Reference Documentation.

See Also

Other Resources

Windows Azure HDInsight Documentation