PipelineOutputAbstractDataset Class
Represents the base class for promoting intermediate data to an Azure Machine Learning Dataset.
Once an intermediate data is promoted to an Azure Machine Learning dataset, it will also be consumed as a Dataset instead of a DataReference in subsequent steps.
Create an intermediate data that will be promoted to an Azure Machine Learning Dataset.
- Inheritance
-
builtins.objectPipelineOutputAbstractDataset
Constructor
PipelineOutputAbstractDataset(pipeline_data)
Parameters
Name | Description |
---|---|
pipeline_data
Required
|
The PipelineData that represents the intermediate output which will be promoted to a Dataset. |
pipeline_data
Required
|
The PipelineData that represents the intermediate output which will be promoted to a Dataset. |
Methods
as_named_input |
Set the name of the dataset when it is used as input for subsequent steps. |
create_input_binding |
Create an input binding. |
register |
Register the output dataset to the workspace. |
as_named_input
Set the name of the dataset when it is used as input for subsequent steps.
as_named_input(name)
Parameters
Name | Description |
---|---|
name
Required
|
The name of the dataset for the input. |
Returns
Type | Description |
---|---|
The intermediate data with the new input name. |
create_input_binding
Create an input binding.
create_input_binding()
Returns
Type | Description |
---|---|
The InputPortBinding with this PipelineData as the source. |
register
Register the output dataset to the workspace.
register(name, create_new_version=True)
Parameters
Name | Description |
---|---|
name
Required
|
The name of the registered dataset once the intermediate data is produced. |
create_new_version
|
Whether to create a new version of the dataset if the data source changes. Defaults to True. By default, all intermediate output will output to a new location when a pipeline runs, so it is highly recommended to keep this flag set to True. default value: True
|
Remarks
Registration can only be applied to output but not input, this means if you only pass the object returned by this method to the inputs parameter of a pipline step, nothing will be registered. You must pass the object to the outputs parameter of a pipeline step for the registration to happen.
Attributes
input_name
Get the input name of the PipelineOutputDataset.
You can use this name to retrieve the materialized dataset through environment environment variable or
the Run class input_datasets
property.
Returns
Type | Description |
---|---|
Input name of the PipelineOutputDataset. |
name
Get the output name of the PipelineData.
Returns
Type | Description |
---|---|
The output name of the PipelineData. |
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for