dataset_partition_prep Module
Contains functionality for specifying dataset partition preparation.
Partition preparation occurs automatically, when you use a opendatasets classe that requires a partition of data, such as the NycTlcGreen class.
Functions
prep_partition_datetime
Prepare partition path 'year=\d+/month=\d+/'.
prep_partition_datetime(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, pattern: List[str])
Parameters
Name | Description |
---|---|
dflow
Required
|
<xref:azureml.dataprep.Dataflow>
An instance of dataprep.Dataflow. |
start_date
Required
|
The start datetime of the Dataset. |
end_date
Required
|
The end datetime of the Dataset. |
pattern
Required
|
The datetime pattern. |
prep_partition_puYear_puMonth
Prepare partition path 'year=\d+/month=\d+/'.
prep_partition_puYear_puMonth(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['puYear', 'puMonth'])
Parameters
Name | Description |
---|---|
dflow
Required
|
<xref:azureml.dataprep.Dataflow>
An instance of dataprep.Dataflow. |
start_date
Required
|
The start datetime of the Dataset. |
end_date
Required
|
The end datetime of the Dataset. |
pattern
Required
|
The datetime pattern. |
Keyword-Only Parameters
Name | Description |
---|---|
pattern
|
Default value: ['puYear', 'puMonth']
|
prep_partition_year
Prepare partition path 'year=\d+/month=\d+/'.
prep_partition_year(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['year'])
Parameters
Name | Description |
---|---|
dflow
Required
|
<xref:azureml.dataprep.Dataflow>
An instance of dataprep.Dataflow. |
start_date
Required
|
The start datetime of the Dataset. |
end_date
Required
|
The end datetime of the Dataset. |
pattern
Required
|
The datetime pattern. |
Keyword-Only Parameters
Name | Description |
---|---|
pattern
|
Default value: ['year']
|
prep_partition_year_month
Prepare partition path 'year=\d+/month=\d+/'.
prep_partition_year_month(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['year', 'month'])
Parameters
Name | Description |
---|---|
dflow
Required
|
<xref:azureml.dataprep.Dataflow>
An instance of dataprep.Dataflow. |
start_date
Required
|
The start datetime of the Dataset. |
end_date
Required
|
The end datetime of the Dataset. |
pattern
Required
|
The datetime pattern. |
Keyword-Only Parameters
Name | Description |
---|---|
pattern
|
Default value: ['year', 'month']
|
prep_partition_year_month_day
Prepare partition path 'year=\d+/month=\d+/'.
prep_partition_year_month_day(dflow: EnginelessDataflow, start_date: datetime, end_date: datetime, *, pattern: List[str] = ['year', 'month', 'day'])
Parameters
Name | Description |
---|---|
dflow
Required
|
<xref:azureml.dataprep.Dataflow>
An instance of dataprep.Dataflow. |
start_date
Required
|
The start datetime of the Dataset. |
end_date
Required
|
The end datetime of the Dataset. |
pattern
Required
|
The datetime pattern. |
Keyword-Only Parameters
Name | Description |
---|---|
pattern
|
Default value: ['year', 'month', 'day']
|