Prepare data for modeling with Azure Machine Learning

Data preparation is an important part of a machine learning workflow. Your models will be more accurate and efficient if they have access to clean data in a format that is easier to consume.

You can prepare your data in Python using the Azure Machine Learning Data Prep SDK.

Data preparation pipeline

The main data preparation steps are:

  1. Load data, which can be in various formats
  2. Transform it into a more usable structure
  3. Write that data to a location accessible to your models

Data preparation process

Next steps

Review an example notebook of data preparation using the Azure Machine Learning Data Prep SDK.