MSDN Library

HDInsight SDK Reference Documentation

Updated: January 28, 2015

The topics in this section describe the public namespaces, classes, and interfaces that support the Microsoft Azure HDInsight (Hadoop) Service, Hadoop on Parallel Data Warehouse (PDW) appliances, and the Microsoft Avro Library for data serialization.

The client classes are used to manage Hadoop jobs on Azure and on PDW appliances. The management classes are used to configure the clusters, storage accounts, MapReduce programs, and the HBase, Hive, Pig, Sqoop, and Oozie components associated with the HDInsight clusters for a Windows Azure subscription.

The Microsoft Avro Library is an implementation of the Avro data serialization system. It employs rich, JSON-defined data structures and an object container to store persistent data. The Avro data format can be processed by many languages: C, C++, C#, Java, PHP, Python, and Ruby are currently supported.




Provides classes related the creation, configuration, submission, and monitoring of to Hadoop jobs managed by Azure HDInsight.


Provides the interfaces associated with the Hadoop storage client.


Provides credentials and the means to connect to client storage in Hadoop on appliance.


Provides the interface for the Hadoop appliance storage client.


Provides classes used to manage Microsoft Azure subscriptions using HDInsight and to configure the clusters, storage accounts, MapReduce programs, and the Hive, Pig, and Oozie components associated with the HDInsight clusters managed by a Windows Azure subscription.


Provides information used to provision clusters.


Provides classes used to submit jobs to HDInsight


Provides the main library of classes used for Avro for the serialization and deserialization of objects.


Provides classes used by Avro stream-based containers.


Provides operations related to Avro serialization schema.


Provides the template interface used by Avro serialization

Windows Azure HDInsight PowerShell

The HDInsight Service uses Azure PowerShell to configure, run, and post-process Hadoop jobs. The documentation for the cmdlets is available at HDInsight Cmdlet Reference Documentation.

© 2015 Microsoft