ML Studio (classic) module descriptions

Article
05/06/2019

Important

Support for Machine Learning Studio (classic) will end on 31 August 2024. We recommend you transition to Azure Machine Learning by that date.

Beginning 1 December 2021, you will not be able to create new Machine Learning Studio (classic) resources. Through 31 August 2024, you can continue to use the existing Machine Learning Studio (classic) resources.

See information on moving machine learning projects from ML Studio (classic) to Azure Machine Learning.
Learn more about Azure Machine Learning.

ML Studio (classic) documentation is being retired and may not be updated in the future.

This topic provides an overview of all the modules included in Machine Learning Studio (classic), which is an interactive, visual workspace to easily build and test predictive models.

Note

Applies to: Machine Learning Studio (classic) only

Similar drag-and-drop modules are available in Azure Machine Learning designer.

What is a module?

In Machine Learning Studio (classic), a module is a building block for creating experiments. Each module encapsulates a specific machine learning algorithm, function, or code library that can act on data in your workspace. The modules are designed to accept connections from other modules, to share and modify data.

The code that runs in each module comes from many sources. These include open source libraries and languages, algorithms developed by Microsoft Research, and tools for working with Azure and other cloud services.

Tip

Looking for machine learning algorithms? See the Machine Learning category, which contains modules for decision trees, clustering, neural networks, among others. The Train and Evaluate categories include modules to help train and test your models.

By connecting and configuring modules, you can create a workflow that reads data from external sources, prepares it for analysis, applies machine learning algorithms, and generates results.

When an experiment is open in Machine Learning Studio (classic), you can see the complete list of current modules in the navigation pane at left. You drag these building blocks into your experiment, and then connect them to create a complete machine learning workflow, called an experiment.

Sometimes modules are updated to add new functionality, or to remove older code. When this happens, any experiments that you created that use the module continue to run. But the next time you open the experiment, you are prompted to upgrade the module, or to use a different module.

Examples

For an example of how to build a complete machine learning experiment, see these tutorials:

Module categories

To make it easier to find related modules, the machine learning tools in Machine Learning Studio (classic) are grouped by these categories.

Data Format conversions

Use these modules to convert data to one of the formats used by other machine learning tools or formats.

Data Input and Output

Use these modules to read data and models from cloud data sources, including Hadoop clusters, Azure Table storage, and web URLs. You can also use these modules to write results to storage or to a database.
Data Transformation

Use these modules to prepare data for analysis. You can change data types, flag columns as features or labels, generate features, and scale or normalize data.
Filter

Transform numeric data derived from digital signal processing.
Learning With Counts

Use joint probability distributions to build features that compactly describe large datasets.
Manipulation

This group provides a variety of tools for data science. For example, you can remove or replace missing values, choose a subset of columns, add a column, or concatenate two datasets.
Sample and Split

Divide a dataset by criteria or by size, to create training and test sets, or to isolate certain rows.
Scale and Reduce

Transform numerical data.

Feature Selection

Use these modules to identify the best features in your data, using widely researched statistical methods.

Machine Learning

This group contains most of the machine learning algorithms supported by Machine Learning.

It also contains modules intended to support the algorithms by training models, generating scores, and evaluating model performance.

Evaluate

After you have trained a model, use these tools to measure the model’s accuracy.
Initialize

These modules provide the machine learning algorithms, which you can customize by setting parameters. The algorithms in this section are grouped by type:
Score

Use these modules to pass new data through the algorithm, and generate a set of results for evaluation. You can also use the results of scoring as part of a predictive service.
Train

These modules train an initialized machine learning model on data you provide.

Normalization, grouping, and scaling of data
Computing statistical distribution of data
Conversion to other machine learning formats
Import of data used for machine learning experiments and export of results
Text analytics, feature selection, and dimensionality reduction

If you need more sophisticated facilities for data manipulation and storage, see the following:

Azure Data Factory: Enterprise-ready, cloud data processing pipelines.
Azure SQL Database: Scalable storage, with integrated access to machine learning.
CosmosDB: NoSQL data store; import data to Machine Learning Studio (classic).
Azure Data Lake Analytics: Distributed analytics on big data.
Stream Analytics: Event processing for the Internet of Things.
Azure Text Analytics: Multiple options for text processing, and related Cognitive Services for speech, image, and facial recognition.
Azure Databricks: Spark-based analytics platform.

ML Studio (classic) module descriptions

What is a module?

Examples

Module categories

Data Format conversions

Feature Selection

Machine Learning

OpenCV Library Modules

R Language Modules

Python Language Modules

Statistical Functions

Text Analytics

Time Series

See also

Additional resources

ML Studio (classic) module descriptions

What is a module?

Examples

Module categories

Related tasks

See also

Additional resources