Azure Machine Learning Studio: algorithm and module help

Azure Machine Learning Studio is a cloud predictive analytics service that makes it possible to quickly create and deploy predictive models as analytics solutions. The machine learning tools are mostly cloud-based services, which eliminates setup and installation concerns because you can work through your web browser on any internet-connected PC. See the article, "What is Studio?" for more details.

This documentation contains detailed technical and how-to information for the modules that are available in Machine Learning Studio.

What is a module?

Each module in Machine Learning Studio represents a set of code that can run independently and perform a machine learning task, given the required inputs. A module might contain a particular algorithm, or perform a task that is important in machine learning, such as missing value replacement, or statistical analysis.

In Studio, modules are organized by functionality:

  • Data input and output modules do the work of moving data from cloud sources into your experiment. You can write your results or intermediate data to Azure Storage, a SQL database, or Hive, while running an experiment, or use cloud storage to exchange data between experiments.

  • Data transformation modules support operations on data that are unique to machine learning, such as normalizing or binning data, feature selection, and dimensionality reduction.

  • Machine learning algorithms, such as clustering, support vector machine, or neural networks, are available within individual modules that let you customize the machine learning task with appropriate parameters. For classification tasks, you can choose from binary or multiclass algorithms.

    After you've configured the model, use a training module to run data through the algorithm, and measure the accuracy of the trained model by using one of the evaluation modules. To get predictions from the model you've just trained, use one of the scoring modules.

  • Anomaly detection: Machine Learning Studio includes multiple algorithms specialized for these tasks.

  • Text analytics modules support various natural language processing tasks.

  • Vowpal Wabbit support makes it easy to use this scalable platform.

  • Python and R language modules make it easy to run a custom function. You write the code, and embed it in a module, to integrate Python and R with an experiment service.

  • OpenCV library provides modules to use in specific image recognition tasks.

  • Time series analysis supports anomaly detection in time series.

  • Statistical modules provide a wide variety of numerical methods related to data science. Look in this group for correlation methods, data summaries, and statistical and math operations.

In this reference section, you'll find technical background on the machine learning algorithms, implementation details if available, and links to sample experiments that demonstrate how the module is used. You can download examples in the Azure AI Gallery to your workspace. These examples are for public use.

Tip

If you are signed in to Machine Learning Studio and have created an experiment, you can get information about a specific module. Select the module, then select the more help link in the Quick Help pane.

Other technical references

Section Description
Data Types List This section contains reference topics describing the learner interfaces, and the DataTable format used for datasets.
Module Parameter Types This section contains reference topics describing the parameter API.
Exceptions List This section lists the errors that modules can generate, with causes and possible workarounds.

For the list of error codes related to the web service API, see Machine Learning REST API error codes.

See also

Azure Machine Learning Studio documentation