Appends a set of rows from an input dataset to the end of another dataset
Category: Data Transformation / Manipulation
Applies to: Machine Learning Studio (classic)
This content pertains only to Studio (classic). Similar drag and drop modules have been added to Azure Machine Learning designer (preview). Learn more in this article comparing the two versions.
This article describes how to use the Add Rows module in Azure Machine Learning to concatenate two datasets. In concatenation, the rows of the second dataset are added to the end of the first dataset.
Concatenation of rows is useful in scenarios such as these:
You have generated a series of evaluation statistics, and you want to combine them into one table for easier reporting.
You have been working with different datasets, and you want to combine the datasets to create a final dataset.
How to use Add Rows
To concatenate rows from two datasets, the rows must have exactly the same schema. This means, the same number of columns, and the same type of data in the columns.
Drag the Add Rows module into your experiment, You can find it under Data Transformation, in the Manipulate category.
Connect the datasets to the two input ports. The dataset that you want to append should be connected to the second (right) port.
Run the experiment. The number of rows in the output dataset should equal the sum of the rows of both input datasets.
If you add the same dataset to both inputs of the Add Rows module, the dataset is duplicated.
This section describes implementation details and common questions.
You cannot filter the source dataset when adding rows. All the rows from both datasets provided as inputs are concatenated when you use Add Rows.
If you want to add only a few rows, use Partition and Sample to define a condition by which to filter the rows and generate a dataset with only the rows you want.
To see examples of how this module is used, see the Azure AI Gallery:
Demand estimation: Combines the result of evaluating multiple models into a single dataset and passes it to an Execute R Script for custom processing
Time Series Forecasting: Uses R scripts to generate custom metrics and then combines them in a single table by using Add Rows.
|Dataset1||Data Table||Dataset rows to be added to the output dataset first|
|Dataset2||Data Table||Dataset rows to be appended to the first dataset|
|Results dataset||Data Table||Dataset that contains all rows of input datasets|
|Error 0003||An exception occurs if one or more of input datasets is null or empty.|
|Error 0010||An exception occurs if input datasets have column names that should match but do not.|
|Error 0016||An exception occurs if input datasets passed to the module should have compatible column types but do not.|
|Error 0008||An exception occurs if the parameter is not in range.|
For a list of errors specific to Studio (classic) modules, see Machine Learning Error codes.
For a list of API exceptions, see Machine Learning REST API Error Codes.