dataset_partition_prep Module

Contains functionality for specifying dataset partition preparation.

Partition preparation occurs automatically, when you use a opendatasets classe that requires a partition of data, such as the NycTlcGreen class.

Functions

prep_partition_datetime

Prepare partition path 'year=\d+/month=\d+/'.

prep_partition_datetime(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, pattern: List[str])

Parameters

Name Description
dflow
Required
<xref:azureml.dataprep.Dataflow>

An instance of dataprep.Dataflow.

start_date
Required

The start datetime of the Dataset.

end_date
Required

The end datetime of the Dataset.

pattern
Required

The datetime pattern.

prep_partition_puYear_puMonth

Prepare partition path 'year=\d+/month=\d+/'.

prep_partition_puYear_puMonth(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['puYear', 'puMonth'])

Parameters

Name Description
dflow
Required
<xref:azureml.dataprep.Dataflow>

An instance of dataprep.Dataflow.

start_date
Required

The start datetime of the Dataset.

end_date
Required

The end datetime of the Dataset.

pattern
Required

The datetime pattern.

Keyword-Only Parameters

Name Description
pattern
Default value: ['puYear', 'puMonth']

prep_partition_year

Prepare partition path 'year=\d+/month=\d+/'.

prep_partition_year(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['year'])

Parameters

Name Description
dflow
Required
<xref:azureml.dataprep.Dataflow>

An instance of dataprep.Dataflow.

start_date
Required

The start datetime of the Dataset.

end_date
Required

The end datetime of the Dataset.

pattern
Required

The datetime pattern.

Keyword-Only Parameters

Name Description
pattern
Default value: ['year']

prep_partition_year_month

Prepare partition path 'year=\d+/month=\d+/'.

prep_partition_year_month(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['year', 'month'])

Parameters

Name Description
dflow
Required
<xref:azureml.dataprep.Dataflow>

An instance of dataprep.Dataflow.

start_date
Required

The start datetime of the Dataset.

end_date
Required

The end datetime of the Dataset.

pattern
Required

The datetime pattern.

Keyword-Only Parameters

Name Description
pattern
Default value: ['year', 'month']

prep_partition_year_month_day

Prepare partition path 'year=\d+/month=\d+/'.

prep_partition_year_month_day(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['year', 'month', 'day'])

Parameters

Name Description
dflow
Required
<xref:azureml.dataprep.Dataflow>

An instance of dataprep.Dataflow.

start_date
Required

The start datetime of the Dataset.

end_date
Required

The end datetime of the Dataset.

pattern
Required

The datetime pattern.

Keyword-Only Parameters

Name Description
pattern
Default value: ['year', 'month', 'day']