2025/12/02 - Amazon DataZone - 2 new3 updated api methods
Changes Amazon DataZone now supports exporting Catalog datasets as Amazon S3 tables, and provides automatic business glossary term suggestions for data assets.
Gets data export configuration details.
See also: AWS API Documentation
Request Syntax
client.get_data_export_configuration(
domainIdentifier='string'
)
string
[REQUIRED]
The ID of the domain where you want to get the data export configuration details.
dict
Response Syntax
{
'createdAt': datetime(2015, 1, 1),
'encryptionConfiguration': {
'kmsKeyArn': 'string',
'sseAlgorithm': 'string'
},
'isExportEnabled': True|False,
's3TableBucketArn': 'string',
'status': 'COMPLETED'|'FAILED',
'updatedAt': datetime(2015, 1, 1)
}
Response Structure
(dict) --
createdAt (datetime) --
The timestamp at which the data export configuration report was created.
encryptionConfiguration (dict) --
The encryption configuration as part of the data export configuration details.
kmsKeyArn (string) --
The Amazon Resource Name (ARN) of the KMS key to use for encryption. This field is required only when sseAlgorithm is set to aws:kms.
sseAlgorithm (string) --
The server-side encryption algorithm to use. Valid values are AES256 for S3-managed encryption keys, or aws:kms for Amazon Web Services KMS-managed encryption keys. If you choose SSE-KMS encryption you must grant the S3 Tables maintenance principal access to your KMS key. For more information, see Permissions requirements for S3 Tables SSE-KMS encryption.
isExportEnabled (boolean) --
Specifies whether the export is enabled.
s3TableBucketArn (string) --
The Amazon S3 table bucket ARN as part of the data export configuration details.
status (string) --
The status of the data export configuration.
updatedAt (datetime) --
The timestamp at which the data export configuration report was updated.
Creates data export configuration details.
In the current release, you can enable exporting asset metadata only for one domain per Amazon Web Services account per region. If you disable exporting asset metadata feature for a domain where it's already enabled, you cannot enable this feature for another domain in the same Amazon Web Services account and region.
See also: AWS API Documentation
Request Syntax
client.put_data_export_configuration(
clientToken='string',
domainIdentifier='string',
enableExport=True|False,
encryptionConfiguration={
'kmsKeyArn': 'string',
'sseAlgorithm': 'string'
}
)
string
A unique, case-sensitive identifier to ensure idempotency of the request. This field is automatically populated if not provided.
This field is autopopulated if not provided.
string
[REQUIRED]
The domain ID where you want to create data export configuration details.
boolean
[REQUIRED]
Specifies that the export is to be enabled as part of creating data export configuration details.
dict
The encryption configuration as part of creating data export configuration details.
The KMS key provided here as part of encryptionConfiguration must have the required permissions as described in KMS permissions for exporting asset metadata in Amazon SageMaker Unified Studio.
kmsKeyArn (string) --
The Amazon Resource Name (ARN) of the KMS key to use for encryption. This field is required only when sseAlgorithm is set to aws:kms.
sseAlgorithm (string) --
The server-side encryption algorithm to use. Valid values are AES256 for S3-managed encryption keys, or aws:kms for Amazon Web Services KMS-managed encryption keys. If you choose SSE-KMS encryption you must grant the S3 Tables maintenance principal access to your KMS key. For more information, see Permissions requirements for S3 Tables SSE-KMS encryption.
dict
Response Syntax
{}
Response Structure
(dict) --
{'type': 'BUSINESS_DESCRIPTIONS | BUSINESS_NAMES | '
'BUSINESS_GLOSSARY_ASSOCIATIONS'}
Response {'status': {'PARTIALLY_SUCCEEDED'},
'type': {'BUSINESS_GLOSSARY_ASSOCIATIONS', 'BUSINESS_NAMES'},
'typeStats': [{'errorMessage': 'string',
'status': 'SUBMITTED | IN_PROGRESS | CANCELED | SUCCEEDED | '
'FAILED | PARTIALLY_SUCCEEDED',
'type': 'BUSINESS_DESCRIPTIONS | BUSINESS_NAMES | '
'BUSINESS_GLOSSARY_ASSOCIATIONS'}],
'types': ['BUSINESS_DESCRIPTIONS | BUSINESS_NAMES | '
'BUSINESS_GLOSSARY_ASSOCIATIONS']}
Gets a metadata generation run in Amazon DataZone.
Prerequisites:
Valid domain and run identifier.
The metadata generation run must exist.
User must have read access to the metadata run.
See also: AWS API Documentation
Request Syntax
client.get_metadata_generation_run(
domainIdentifier='string',
identifier='string',
type='BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS'
)
string
[REQUIRED]
The ID of the Amazon DataZone domain the metadata generation run of which you want to get.
string
[REQUIRED]
The identifier of the metadata generation run.
string
The type of the metadata generation run.
dict
Response Syntax
{
'createdAt': datetime(2015, 1, 1),
'createdBy': 'string',
'domainId': 'string',
'id': 'string',
'owningProjectId': 'string',
'status': 'SUBMITTED'|'IN_PROGRESS'|'CANCELED'|'SUCCEEDED'|'FAILED'|'PARTIALLY_SUCCEEDED',
'target': {
'identifier': 'string',
'revision': 'string',
'type': 'ASSET'
},
'type': 'BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS',
'typeStats': [
{
'errorMessage': 'string',
'status': 'SUBMITTED'|'IN_PROGRESS'|'CANCELED'|'SUCCEEDED'|'FAILED'|'PARTIALLY_SUCCEEDED',
'type': 'BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS'
},
],
'types': [
'BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS',
]
}
Response Structure
(dict) --
createdAt (datetime) --
The timestamp of when the metadata generation run was start.
createdBy (string) --
The Amazon DataZone user who started the metadata generation run.
domainId (string) --
The ID of the Amazon DataZone domain the metadata generation run of which you want to get.
id (string) --
The ID of the metadata generation run.
owningProjectId (string) --
The ID of the project that owns the assets for which you're running metadata generation.
status (string) --
The status of the metadata generation run.
target (dict) --
The asset for which you're generating metadata.
identifier (string) --
The ID of the metadata generation run's target.
revision (string) --
The revision of the asset for which metadata was generated.
type (string) --
The type of the asset for which metadata was generated.
type (string) --
The type of metadata generation run.
typeStats (list) --
The type stats included in the metadata generation run output details.
(dict) --
The statistics of the metadata generation run type.
errorMessage (string) --
The error message displayed if the action fails to run.
status (string) --
The status of the metadata generation run type statistics.
type (string) --
The type of the metadata generation run type statistics.
types (list) --
The types of the metadata generation run.
(string) --
{'status': {'PARTIALLY_SUCCEEDED'},
'targetIdentifier': 'string',
'type': {'BUSINESS_GLOSSARY_ASSOCIATIONS', 'BUSINESS_NAMES'}}
Response {'items': {'status': {'PARTIALLY_SUCCEEDED'},
'type': {'BUSINESS_GLOSSARY_ASSOCIATIONS', 'BUSINESS_NAMES'},
'types': ['BUSINESS_DESCRIPTIONS | BUSINESS_NAMES | '
'BUSINESS_GLOSSARY_ASSOCIATIONS']}}
Lists all metadata generation runs.
Metadata generation runs represent automated processes that leverage AI/ML capabilities to create or enhance asset metadata at scale. This feature helps organizations maintain comprehensive and consistent metadata across large numbers of assets without manual intervention. It can automatically generate business descriptions, tags, and other metadata elements, significantly reducing the time and effort required for metadata management while improving consistency and completeness.
Prerequisites:
Valid domain identifier.
User must have access to metadata generation runs in the domain.
See also: AWS API Documentation
Request Syntax
client.list_metadata_generation_runs(
domainIdentifier='string',
maxResults=123,
nextToken='string',
status='SUBMITTED'|'IN_PROGRESS'|'CANCELED'|'SUCCEEDED'|'FAILED'|'PARTIALLY_SUCCEEDED',
targetIdentifier='string',
type='BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS'
)
string
[REQUIRED]
The ID of the Amazon DataZone domain where you want to list metadata generation runs.
integer
The maximum number of metadata generation runs to return in a single call to ListMetadataGenerationRuns. When the number of metadata generation runs to be listed is greater than the value of MaxResults, the response contains a NextToken value that you can use in a subsequent call to ListMetadataGenerationRuns to list the next set of revisions.
string
When the number of metadata generation runs is greater than the default value for the MaxResults parameter, or if you explicitly specify a value for MaxResults that is less than the number of metadata generation runs, the response includes a pagination token named NextToken. You can specify this NextToken value in a subsequent call to ListMetadataGenerationRuns to list the next set of revisions.
string
The status of the metadata generation runs.
string
The target ID for which you want to list metadata generation runs.
string
The type of the metadata generation runs.
dict
Response Syntax
{
'items': [
{
'createdAt': datetime(2015, 1, 1),
'createdBy': 'string',
'domainId': 'string',
'id': 'string',
'owningProjectId': 'string',
'status': 'SUBMITTED'|'IN_PROGRESS'|'CANCELED'|'SUCCEEDED'|'FAILED'|'PARTIALLY_SUCCEEDED',
'target': {
'identifier': 'string',
'revision': 'string',
'type': 'ASSET'
},
'type': 'BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS',
'types': [
'BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS',
]
},
],
'nextToken': 'string'
}
Response Structure
(dict) --
items (list) --
The results of the ListMetadataGenerationRuns action.
(dict) --
The metadata generation run.
createdAt (datetime) --
The timestamp at which the metadata generation run was created.
createdBy (string) --
The user who created the metadata generation run.
domainId (string) --
The ID of the Amazon DataZone domain in which the metadata generation run was created.
id (string) --
The ID of the metadata generation run.
owningProjectId (string) --
The ID of the project that owns the asset for which the metadata generation was ran.
status (string) --
The status of the metadata generation run.
target (dict) --
The asset for which metadata was generated.
identifier (string) --
The ID of the metadata generation run's target.
revision (string) --
The revision of the asset for which metadata was generated.
type (string) --
The type of the asset for which metadata was generated.
type (string) --
The type of the metadata generation run.
types (list) --
The types of the metadata generation run.
(string) --
nextToken (string) --
When the number of metadata generation runs is greater than the default value for the MaxResults parameter, or if you explicitly specify a value for MaxResults that is less than the number of metadata generation runs, the response includes a pagination token named NextToken. You can specify this NextToken value in a subsequent call to ListMetadataGenerationRuns to list the next set of revisions.
{'type': {'BUSINESS_GLOSSARY_ASSOCIATIONS', 'BUSINESS_NAMES'},
'types': ['BUSINESS_DESCRIPTIONS | BUSINESS_NAMES | '
'BUSINESS_GLOSSARY_ASSOCIATIONS']}
Response {'status': {'PARTIALLY_SUCCEEDED'},
'type': {'BUSINESS_GLOSSARY_ASSOCIATIONS', 'BUSINESS_NAMES'},
'types': ['BUSINESS_DESCRIPTIONS | BUSINESS_NAMES | '
'BUSINESS_GLOSSARY_ASSOCIATIONS']}
Starts the metadata generation run.
Prerequisites:
Asset must be created and belong to the specified domain and project.
Asset type must be supported for metadata generation (e.g., Amazon Web Services Glue table).
Asset must have a structured schema with valid rows and columns.
Valid values for --type: BUSINESS_DESCRIPTIONS, BUSINESS_NAMES, BUSINESS_GLOSSARY_ASSOCIATIONS.
The user must have permission to run metadata generation in the domain/project.
See also: AWS API Documentation
Request Syntax
client.start_metadata_generation_run(
clientToken='string',
domainIdentifier='string',
owningProjectIdentifier='string',
target={
'identifier': 'string',
'revision': 'string',
'type': 'ASSET'
},
type='BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS',
types=[
'BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS',
]
)
string
A unique, case-sensitive identifier to ensure idempotency of the request. This field is automatically populated if not provided.
This field is autopopulated if not provided.
string
[REQUIRED]
The ID of the Amazon DataZone domain where you want to start a metadata generation run.
string
[REQUIRED]
The ID of the project that owns the asset for which you want to start a metadata generation run.
dict
[REQUIRED]
The asset for which you want to start a metadata generation run.
identifier (string) -- [REQUIRED]
The ID of the metadata generation run's target.
revision (string) --
The revision of the asset for which metadata was generated.
type (string) -- [REQUIRED]
The type of the asset for which metadata was generated.
string
The type of the metadata generation run.
list
The types of the metadata generation run.
(string) --
dict
Response Syntax
{
'createdAt': datetime(2015, 1, 1),
'createdBy': 'string',
'domainId': 'string',
'id': 'string',
'owningProjectId': 'string',
'status': 'SUBMITTED'|'IN_PROGRESS'|'CANCELED'|'SUCCEEDED'|'FAILED'|'PARTIALLY_SUCCEEDED',
'type': 'BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS',
'types': [
'BUSINESS_DESCRIPTIONS'|'BUSINESS_NAMES'|'BUSINESS_GLOSSARY_ASSOCIATIONS',
]
}
Response Structure
(dict) --
createdAt (datetime) --
The timestamp at which the metadata generation run was started.
createdBy (string) --
The ID of the user who started the metadata generation run.
domainId (string) --
The ID of the Amazon DataZone domain in which the metadata generation run was started.
id (string) --
The ID of the metadata generation run.
owningProjectId (string) --
The ID of the project that owns the asset for which the metadata generation run was started.
status (string) --
The status of the metadata generation run.
type (string) --
The type of the metadata generation run.
types (list) --
The types of the metadata generation run.
(string) --