dataset_partition_prep Module

Contains functionality for specifying dataset partition preparation.

Partition preparation occurs automatically, when you use a opendatasets classe that requires a partition of data, such as the NycTlcGreen class.

Functions

prep_partition_datetime

Prepare partition path 'year=\d+/month=\d+/'.

prep_partition_datetime(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, pattern: List[str])

Parameters

dflow
<xref:azureml.dataprep.Dataflow>
Required

An instance of dataprep.Dataflow.

start_date
datetime
Required

The start datetime of the Dataset.

end_date
datetime
Required

The end datetime of the Dataset.

pattern
list
Required

The datetime pattern.

prep_partition_puYear_puMonth

Prepare partition path 'year=\d+/month=\d+/'.

prep_partition_puYear_puMonth(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['puYear', 'puMonth'])

Parameters

dflow
<xref:azureml.dataprep.Dataflow>
Required

An instance of dataprep.Dataflow.

start_date
datetime
Required

The start datetime of the Dataset.

end_date
datetime
Required

The end datetime of the Dataset.

pattern
list
Required

The datetime pattern.

prep_partition_year

Prepare partition path 'year=\d+/month=\d+/'.

prep_partition_year(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['year'])

Parameters

dflow
<xref:azureml.dataprep.Dataflow>
Required

An instance of dataprep.Dataflow.

start_date
datetime
Required

The start datetime of the Dataset.

end_date
datetime
Required

The end datetime of the Dataset.

pattern
list
Required

The datetime pattern.

prep_partition_year_month

Prepare partition path 'year=\d+/month=\d+/'.

prep_partition_year_month(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['year', 'month'])

Parameters

dflow
<xref:azureml.dataprep.Dataflow>
Required

An instance of dataprep.Dataflow.

start_date
datetime
Required

The start datetime of the Dataset.

end_date
datetime
Required

The end datetime of the Dataset.

pattern
list
Required

The datetime pattern.

prep_partition_year_month_day

Prepare partition path 'year=\d+/month=\d+/'.

prep_partition_year_month_day(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['year', 'month', 'day'])

Parameters

dflow
<xref:azureml.dataprep.Dataflow>
Required

An instance of dataprep.Dataflow.

start_date
datetime
Required

The start datetime of the Dataset.

end_date
datetime
Required

The end datetime of the Dataset.

pattern
list
Required

The datetime pattern.