Apply Transformation module

This article describes a module in Azure Machine Learning designer.

Use this module to modify an input dataset based on a previously computed transformation. This module is necessary in if you need to update transformations in inference pipelines.

For example, if you used z-scores to normalize your training data by using the Normalize Data module, you would want to use the z-score value that was computed for training during the scoring phase as well. In Azure Machine Learning, you can save the normalization method as a transform, and then using Apply Transformation to apply the z-score to the input data before scoring.

How to save transformations

The designer lets you save data transformations as datasets so that you can use them in other pipelines.

  1. Select a data transformation module that has successfully run.

  2. Select the Outputs + logs tab.

  3. Find the transformation output, and select the Register dataset to save it as a module under Datasets category in the module palette.

How to use Apply Transformation

  1. Add the Apply Transformation module to your pipeline. You can find this module in the Model Scoring & Evaluation section of the module palette.

  2. Find the saved transformation you want to use under Datasets in the module palette.

  3. Connect the output of the saved transformation to the left input port of the Apply Transformation module.

    The dataset should have exactly the same schema (number of columns, column names, data types) as the dataset for which the transformation was first designed.

  4. Connect the dataset output of the desired module to the right input port of the Apply Transformation module.

  5. To apply a transformation to the new dataset, submit the pipeline.

Important

To make sure the updated transformation in training pipelines is also feasible in inference pipelines, you need to follow the steps below each time there is updated transformation in the training pipeline:

  1. In the training pipeline, register the output of the Select Columns Transform as a dataset. Register dataset of module output
  2. In the inference pipeline, remove the TD- module, and replace it with the registered dataset in the previous step. Replace TD module

Next steps

See the set of modules available to Azure Machine Learning.