2021/04/26 - Amazon Personalize - 3 new api methods
Changes Added support for exporting data imported into an Amazon Personalize dataset to a specified data source (Amazon S3 bucket).
Describes the dataset export job created by CreateDatasetExportJob, including the export job status.
See also: AWS API Documentation
Request Syntax
client.describe_dataset_export_job( datasetExportJobArn='string' )
string
[REQUIRED]
The Amazon Resource Name (ARN) of the dataset export job to describe.
dict
Response Syntax
{ 'datasetExportJob': { 'jobName': 'string', 'datasetExportJobArn': 'string', 'datasetArn': 'string', 'ingestionMode': 'BULK'|'PUT'|'ALL', 'roleArn': 'string', 'status': 'string', 'jobOutput': { 's3DataDestination': { 'path': 'string', 'kmsKeyArn': 'string' } }, 'creationDateTime': datetime(2015, 1, 1), 'lastUpdatedDateTime': datetime(2015, 1, 1), 'failureReason': 'string' } }
Response Structure
(dict) --
datasetExportJob (dict) --
Information about the dataset export job, including the status.
The status is one of the following values:
CREATE PENDING
CREATE IN_PROGRESS
ACTIVE
CREATE FAILED
jobName (string) --
The name of the export job.
datasetExportJobArn (string) --
The Amazon Resource Name (ARN) of the dataset export job.
datasetArn (string) --
The Amazon Resource Name (ARN) of the dataset to export.
ingestionMode (string) --
The data to export, based on how you imported the data. You can choose to export BULK data that you imported using a dataset import job, PUT data that you imported incrementally (using the console, PutEvents, PutUsers and PutItems operations), or ALL for both types. The default value is PUT .
roleArn (string) --
The Amazon Resource Name (ARN) of the AWS Identity and Access Management service role that has permissions to add data to your output Amazon S3 bucket.
status (string) --
The status of the dataset export job.
A dataset export job can be in one of the following states:
CREATE PENDING > CREATE IN_PROGRESS > ACTIVE -or- CREATE FAILED
jobOutput (dict) --
The path to the Amazon S3 bucket where the job's output is stored. For example:
s3://bucket-name/folder-name/
s3DataDestination (dict) --
The configuration details of an Amazon S3 input or output bucket.
path (string) --
The file path of the Amazon S3 bucket.
kmsKeyArn (string) --
The Amazon Resource Name (ARN) of the Amazon Key Management Service (KMS) key that Amazon Personalize uses to encrypt or decrypt the input and output files of a batch inference job.
creationDateTime (datetime) --
The creation date and time (in Unix time) of the dataset export job.
lastUpdatedDateTime (datetime) --
The date and time (in Unix time) the status of the dataset export job was last updated.
failureReason (string) --
If a dataset export job fails, provides the reason why.
Creates a job that exports data from your dataset to an Amazon S3 bucket. To allow Amazon Personalize to export the training data, you must specify an service-linked AWS Identity and Access Management (IAM) role that gives Amazon Personalize PutObject permissions for your Amazon S3 bucket. For information, see Dataset export job permissions requirements in the Amazon Personalize developer guide.
Status
A dataset export job can be in one of the following states:
CREATE PENDING > CREATE IN_PROGRESS > ACTIVE -or- CREATE FAILED
To get the status of the export job, call DescribeDatasetExportJob, and specify the Amazon Resource Name (ARN) of the dataset export job. The dataset export is complete when the status shows as ACTIVE. If the status shows as CREATE FAILED, the response includes a failureReason key, which describes why the job failed.
See also: AWS API Documentation
Request Syntax
client.create_dataset_export_job( jobName='string', datasetArn='string', ingestionMode='BULK'|'PUT'|'ALL', roleArn='string', jobOutput={ 's3DataDestination': { 'path': 'string', 'kmsKeyArn': 'string' } } )
string
[REQUIRED]
The name for the dataset export job.
string
[REQUIRED]
The Amazon Resource Name (ARN) of the dataset that contains the data to export.
string
The data to export, based on how you imported the data. You can choose to export only BULK data that you imported using a dataset import job, only PUT data that you imported incrementally (using the console, PutEvents, PutUsers and PutItems operations), or ALL for both types. The default value is PUT .
string
[REQUIRED]
The Amazon Resource Name (ARN) of the AWS Identity and Access Management service role that has permissions to add data to your output Amazon S3 bucket.
dict
[REQUIRED]
The path to the Amazon S3 bucket where the job's output is stored.
s3DataDestination (dict) -- [REQUIRED]
The configuration details of an Amazon S3 input or output bucket.
path (string) -- [REQUIRED]
The file path of the Amazon S3 bucket.
kmsKeyArn (string) --
The Amazon Resource Name (ARN) of the Amazon Key Management Service (KMS) key that Amazon Personalize uses to encrypt or decrypt the input and output files of a batch inference job.
dict
Response Syntax
{ 'datasetExportJobArn': 'string' }
Response Structure
(dict) --
datasetExportJobArn (string) --
The Amazon Resource Name (ARN) of the dataset export job.
Returns a list of dataset export jobs that use the given dataset. When a dataset is not specified, all the dataset export jobs associated with the account are listed. The response provides the properties for each dataset export job, including the Amazon Resource Name (ARN). For more information on dataset export jobs, see CreateDatasetExportJob. For more information on datasets, see CreateDataset.
See also: AWS API Documentation
Request Syntax
client.list_dataset_export_jobs( datasetArn='string', nextToken='string', maxResults=123 )
string
The Amazon Resource Name (ARN) of the dataset to list the dataset export jobs for.
string
A token returned from the previous call to ListDatasetExportJobs for getting the next set of dataset export jobs (if they exist).
integer
The maximum number of dataset export jobs to return.
dict
Response Syntax
{ 'datasetExportJobs': [ { 'datasetExportJobArn': 'string', 'jobName': 'string', 'status': 'string', 'creationDateTime': datetime(2015, 1, 1), 'lastUpdatedDateTime': datetime(2015, 1, 1), 'failureReason': 'string' }, ], 'nextToken': 'string' }
Response Structure
(dict) --
datasetExportJobs (list) --
The list of dataset export jobs.
(dict) --
Provides a summary of the properties of a dataset export job. For a complete listing, call the DescribeDatasetExportJob API.
datasetExportJobArn (string) --
The Amazon Resource Name (ARN) of the dataset export job.
jobName (string) --
The name of the dataset export job.
status (string) --
The status of the dataset export job.
A dataset export job can be in one of the following states:
CREATE PENDING > CREATE IN_PROGRESS > ACTIVE -or- CREATE FAILED
creationDateTime (datetime) --
The date and time (in Unix time) that the dataset export job was created.
lastUpdatedDateTime (datetime) --
The date and time (in Unix time) that the dataset export job status was last updated.
failureReason (string) --
If a dataset export job fails, the reason behind the failure.
nextToken (string) --
A token for getting the next set of dataset export jobs (if they exist).