dataset_factory Module

Contains functionality to create datasets for Azure Machine Learning.

Classes

DataType

Configures column data types for a dataset created in Azure Machine Learning.

DataType methods are used in the TabularDatasetFactory class from_* methods, which are used to create new TabularDataset objects.

FileDatasetFactory

Contains methods to create a file dataset for Azure Machine Learning.

A FileDataset is created from the from_files method defined in this class.

For more information on working with file datasets, see the notebook https://aka.ms/filedataset-samplenotebook.

TabularDatasetFactory

Contains methods to create a tabular dataset for Azure Machine Learning.

A TabularDataset is created using the from_* methods in this class, for example, the method from_delimited_files.

For more information on working with tabular datasets, see the notebook https://aka.ms/tabulardataset-samplenotebook.