Azure Machine Learning Studio Algorithm and Module Reference

Azure Machine Learning is a cloud predictive analytics service that makes it possible to quickly create and deploy predictive models as analytics solutions. The Machine Learning service is cloud-based, and eliminates setup and installation concerns because you can work through your web browser on any Internet-connected PC.

What is Azure Machine Learning Studio?

Microsoft Azure Machine Learning Studio is a collaborative visual development environment that helps you build, test, and deploy predictive analytics solutions in the cloud. You upload data or connect to data already in the cloud, choose an algorithm from a ready-to-use library of algorithms, and build an end-to-end predictive workflow. You can then quickly deploy the model and integrate the workflow in applications by calling a web service.

If you have not used machine learning before, see these resources:

Help for Machine Learning Modules

This documentation contains detailed technical and how-to information for the modules that are available in Azure Machine Learning Studio.

What Is a Module?

Each module represents a set of code that can run independently and perform a machine learning task, given the required inputs. A module might contain a particular algorithm, or perform a task that is important in machine learning, such as missing value replacement, or statistical analysis.

In Studio, you'll find modules organized by functionality:

  • Machine learning algorithms, such as clustering, SVM, or neural networks, are contained within modules that let you customize the machine learning task with appropriate parameters. For classification tasks, you can choose from binary or multiclass algorithms. After you've configured a particular type of model, you use another module to perform training, scoring, or evaluation.
  • Use the Data Input and Output modules to get data from cloud sources. You can write your results or intermediate data to Azure storage, SQL database, or Hive, while running an experiment, or use cloud storage to exchange data between experiments.
  • Data transformation modules support operations on data that are unique to machine learning, such as normalizing or binning data, feature selection, and dimensionality reduction.
  • Text analytics modules include support for Vowpal Wabbit and various natural language proessing (NLP) tasks.
  • Support for external languages is provided by Python and R language modules.
  • Categories for specific machine learning tasks, such as using the OpenCV library for image recognition, time series analysis, and anomaly detection.
    • Other modules support a wide range of statistical functions and statistical methods that are core to data science, or create data summaries.

In this reference section, you'll find technical background on the machine learning algorithms, implementation details if available, and links to sample experiments that demonstrate how the module is used. All examples are available for public use, in the Cortana Intelligence Gallery.

Tip

If you are logged into Studio and have created an experiment, you can get information about a specific module by selecting the module and then clicking the more help link in the Quick Help pane.

Other Technical Reference

Section Description
Data Types List This section contains reference topics describing the learner interfaces, and the DataTable format used for datasets.
Module Parameter Types This section contains reference topics describing the parameter API.
Exceptions List This section lists the errors that can be generated by modules, with causes and possible workarounds.

For the list of error codes related to the Web service API, see Machine Learning REST API Error Codes

See Also

Azure Machine Learning Documentation Center