Amazon Athena

2023/12/05 - Amazon Athena - 12 updated api methods

Changes  Adding IdentityCenter enabled request for interactive query

BatchGetQueryExecution (updated) Link ¶
Changes (response)
{'QueryExecutions': {'QueryResultsS3AccessGrantsConfiguration': {'AuthenticationType': 'DIRECTORY_IDENTITY',
                                                                 'CreateUserLevelPrefix': 'boolean',
                                                                 'EnableS3AccessGrants': 'boolean'}}}

Returns the details of a single query execution or a list of up to 50 query executions, which you provide as an array of query execution ID strings. Requires you to have access to the workgroup in which the queries ran. To get a list of query execution IDs, use ListQueryExecutionsInput$WorkGroup. Query executions differ from named (saved) queries. Use BatchGetNamedQueryInput to get details about named queries.

See also: AWS API Documentation

Request Syntax

client.batch_get_query_execution(
    QueryExecutionIds=[
        'string',
    ]
)
type QueryExecutionIds

list

param QueryExecutionIds

[REQUIRED]

An array of query execution IDs.

  • (string) --

rtype

dict

returns

Response Syntax

{
    'QueryExecutions': [
        {
            'QueryExecutionId': 'string',
            'Query': 'string',
            'StatementType': 'DDL'|'DML'|'UTILITY',
            'ResultConfiguration': {
                'OutputLocation': 'string',
                'EncryptionConfiguration': {
                    'EncryptionOption': 'SSE_S3'|'SSE_KMS'|'CSE_KMS',
                    'KmsKey': 'string'
                },
                'ExpectedBucketOwner': 'string',
                'AclConfiguration': {
                    'S3AclOption': 'BUCKET_OWNER_FULL_CONTROL'
                }
            },
            'ResultReuseConfiguration': {
                'ResultReuseByAgeConfiguration': {
                    'Enabled': True|False,
                    'MaxAgeInMinutes': 123
                }
            },
            'QueryExecutionContext': {
                'Database': 'string',
                'Catalog': 'string'
            },
            'Status': {
                'State': 'QUEUED'|'RUNNING'|'SUCCEEDED'|'FAILED'|'CANCELLED',
                'StateChangeReason': 'string',
                'SubmissionDateTime': datetime(2015, 1, 1),
                'CompletionDateTime': datetime(2015, 1, 1),
                'AthenaError': {
                    'ErrorCategory': 123,
                    'ErrorType': 123,
                    'Retryable': True|False,
                    'ErrorMessage': 'string'
                }
            },
            'Statistics': {
                'EngineExecutionTimeInMillis': 123,
                'DataScannedInBytes': 123,
                'DataManifestLocation': 'string',
                'TotalExecutionTimeInMillis': 123,
                'QueryQueueTimeInMillis': 123,
                'ServicePreProcessingTimeInMillis': 123,
                'QueryPlanningTimeInMillis': 123,
                'ServiceProcessingTimeInMillis': 123,
                'ResultReuseInformation': {
                    'ReusedPreviousResult': True|False
                }
            },
            'WorkGroup': 'string',
            'EngineVersion': {
                'SelectedEngineVersion': 'string',
                'EffectiveEngineVersion': 'string'
            },
            'ExecutionParameters': [
                'string',
            ],
            'SubstatementType': 'string',
            'QueryResultsS3AccessGrantsConfiguration': {
                'EnableS3AccessGrants': True|False,
                'CreateUserLevelPrefix': True|False,
                'AuthenticationType': 'DIRECTORY_IDENTITY'
            }
        },
    ],
    'UnprocessedQueryExecutionIds': [
        {
            'QueryExecutionId': 'string',
            'ErrorCode': 'string',
            'ErrorMessage': 'string'
        },
    ]
}

Response Structure

  • (dict) --

    • QueryExecutions (list) --

      Information about a query execution.

      • (dict) --

        Information about a single instance of a query execution.

        • QueryExecutionId (string) --

          The unique identifier for each query execution.

        • Query (string) --

          The SQL query statements which the query execution ran.

        • StatementType (string) --

          The type of query statement that was run. DDL indicates DDL query statements. DML indicates DML (Data Manipulation Language) query statements, such as CREATE TABLE AS SELECT . UTILITY indicates query statements other than DDL and DML, such as SHOW CREATE TABLE , or DESCRIBE TABLE .

        • ResultConfiguration (dict) --

          The location in Amazon S3 where query and calculation results are stored and the encryption option, if any, used for query results. These are known as "client-side settings". If workgroup settings override client-side settings, then the query uses the location for the query results and the encryption configuration that are specified for the workgroup.

          • OutputLocation (string) --

            The location in Amazon S3 where your query and calculation results are stored, such as s3://path/to/query/bucket/ . To run the query, you must specify the query results location using one of the ways: either for individual queries using either this setting (client-side), or in the workgroup, using WorkGroupConfiguration. If none of them is set, Athena issues an error that no output location is provided. For more information, see Working with query results, recent queries, and output files. If workgroup settings override client-side settings, then the query uses the settings specified for the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration.

          • EncryptionConfiguration (dict) --

            If query and calculation results are encrypted in Amazon S3, indicates the encryption option used (for example, SSE_KMS or CSE_KMS ) and key information. This is a client-side setting. If workgroup settings override client-side settings, then the query uses the encryption configuration that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

            • EncryptionOption (string) --

              Indicates whether Amazon S3 server-side encryption with Amazon S3-managed keys ( SSE_S3 ), server-side encryption with KMS-managed keys ( SSE_KMS ), or client-side encryption with KMS-managed keys ( CSE_KMS ) is used.

              If a query runs in a workgroup and the workgroup overrides client-side settings, then the workgroup's setting for encryption is used. It specifies whether query results must be encrypted, for all queries that run in this workgroup.

            • KmsKey (string) --

              For SSE_KMS and CSE_KMS , this is the KMS key ARN or ID.

          • ExpectedBucketOwner (string) --

            The Amazon Web Services account ID that you expect to be the owner of the Amazon S3 bucket specified by ResultConfiguration$OutputLocation. If set, Athena uses the value for ExpectedBucketOwner when it makes Amazon S3 calls to your specified output location. If the ExpectedBucketOwner Amazon Web Services account ID does not match the actual owner of the Amazon S3 bucket, the call fails with a permissions error.

            This is a client-side setting. If workgroup settings override client-side settings, then the query uses the ExpectedBucketOwner setting that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

          • AclConfiguration (dict) --

            Indicates that an Amazon S3 canned ACL should be set to control ownership of stored query results. Currently the only supported canned ACL is BUCKET_OWNER_FULL_CONTROL . This is a client-side setting. If workgroup settings override client-side settings, then the query uses the ACL configuration that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. For more information, see WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

            • S3AclOption (string) --

              The Amazon S3 canned ACL that Athena should specify when storing query results. Currently the only supported canned ACL is BUCKET_OWNER_FULL_CONTROL . If a query runs in a workgroup and the workgroup overrides client-side settings, then the Amazon S3 canned ACL specified in the workgroup's settings is used for all queries that run in the workgroup. For more information about Amazon S3 canned ACLs, see Canned ACL in the Amazon S3 User Guide .

        • ResultReuseConfiguration (dict) --

          Specifies the query result reuse behavior that was used for the query.

          • ResultReuseByAgeConfiguration (dict) --

            Specifies whether previous query results are reused, and if so, their maximum age.

            • Enabled (boolean) --

              True if previous query results can be reused when the query is run; otherwise, false. The default is false.

            • MaxAgeInMinutes (integer) --

              Specifies, in minutes, the maximum age of a previous query result that Athena should consider for reuse. The default is 60.

        • QueryExecutionContext (dict) --

          The database in which the query execution occurred.

          • Database (string) --

            The name of the database used in the query execution. The database must exist in the catalog.

          • Catalog (string) --

            The name of the data catalog used in the query execution.

        • Status (dict) --

          The completion date, current state, submission time, and state change reason (if applicable) for the query execution.

          • State (string) --

            The state of query execution. QUEUED indicates that the query has been submitted to the service, and Athena will execute the query as soon as resources are available. RUNNING indicates that the query is in execution phase. SUCCEEDED indicates that the query completed without errors. FAILED indicates that the query experienced an error and did not complete processing. CANCELLED indicates that a user input interrupted query execution.

            Note

            Athena automatically retries your queries in cases of certain transient errors. As a result, you may see the query state transition from RUNNING or FAILED to QUEUED .

          • StateChangeReason (string) --

            Further detail about the status of the query.

          • SubmissionDateTime (datetime) --

            The date and time that the query was submitted.

          • CompletionDateTime (datetime) --

            The date and time that the query completed.

          • AthenaError (dict) --

            Provides information about an Athena query error.

            • ErrorCategory (integer) --

              An integer value that specifies the category of a query failure error. The following list shows the category for each integer value.

              1 - System

              2 - User

              3 - Other

            • ErrorType (integer) --

              An integer value that provides specific information about an Athena query error. For the meaning of specific values, see the Error Type Reference in the Amazon Athena User Guide .

            • Retryable (boolean) --

              True if the query might succeed if resubmitted.

            • ErrorMessage (string) --

              Contains a short description of the error that occurred.

        • Statistics (dict) --

          Query execution statistics, such as the amount of data scanned, the amount of time that the query took to process, and the type of statement that was run.

          • EngineExecutionTimeInMillis (integer) --

            The number of milliseconds that the query took to execute.

          • DataScannedInBytes (integer) --

            The number of bytes in the data that was queried.

          • DataManifestLocation (string) --

            The location and file name of a data manifest file. The manifest file is saved to the Athena query results location in Amazon S3. The manifest file tracks files that the query wrote to Amazon S3. If the query fails, the manifest file also tracks files that the query intended to write. The manifest is useful for identifying orphaned files resulting from a failed query. For more information, see Working with Query Results, Output Files, and Query History in the Amazon Athena User Guide .

          • TotalExecutionTimeInMillis (integer) --

            The number of milliseconds that Athena took to run the query.

          • QueryQueueTimeInMillis (integer) --

            The number of milliseconds that the query was in your query queue waiting for resources. Note that if transient errors occur, Athena might automatically add the query back to the queue.

          • ServicePreProcessingTimeInMillis (integer) --

            The number of milliseconds that Athena took to preprocess the query before submitting the query to the query engine.

          • QueryPlanningTimeInMillis (integer) --

            The number of milliseconds that Athena took to plan the query processing flow. This includes the time spent retrieving table partitions from the data source. Note that because the query engine performs the query planning, query planning time is a subset of engine processing time.

          • ServiceProcessingTimeInMillis (integer) --

            The number of milliseconds that Athena took to finalize and publish the query results after the query engine finished running the query.

          • ResultReuseInformation (dict) --

            Contains information about whether previous query results were reused for the query.

            • ReusedPreviousResult (boolean) --

              True if a previous query result was reused; false if the result was generated from a new run of the query.

        • WorkGroup (string) --

          The name of the workgroup in which the query ran.

        • EngineVersion (dict) --

          The engine version that executed the query.

          • SelectedEngineVersion (string) --

            The engine version requested by the user. Possible values are determined by the output of ListEngineVersions , including AUTO. The default is AUTO.

          • EffectiveEngineVersion (string) --

            Read only. The engine version on which the query runs. If the user requests a valid engine version other than Auto, the effective engine version is the same as the engine version that the user requested. If the user requests Auto, the effective engine version is chosen by Athena. When a request to update the engine version is made by a CreateWorkGroup or UpdateWorkGroup operation, the EffectiveEngineVersion field is ignored.

        • ExecutionParameters (list) --

          A list of values for the parameters in a query. The values are applied sequentially to the parameters in the query in the order in which the parameters occur. The list of parameters is not returned in the response.

          • (string) --

        • SubstatementType (string) --

          The kind of query statement that was run.

        • QueryResultsS3AccessGrantsConfiguration (dict) --

          Specifies whether Amazon S3 access grants are enabled for query results.

          • EnableS3AccessGrants (boolean) --

            Specifies whether Amazon S3 access grants are enabled for query results.

          • CreateUserLevelPrefix (boolean) --

            When enabled, appends the user ID as an Amazon S3 path prefix to the query result output location.

          • AuthenticationType (string) --

            The authentication type used for Amazon S3 access grants. Currently, only DIRECTORY_IDENTITY is supported.

    • UnprocessedQueryExecutionIds (list) --

      Information about the query executions that failed to run.

      • (dict) --

        Describes a query execution that failed to process.

        • QueryExecutionId (string) --

          The unique identifier of the query execution.

        • ErrorCode (string) --

          The error code returned when the query execution failed to process, if applicable.

        • ErrorMessage (string) --

          The error message returned when the query execution failed to process, if applicable.

CreateWorkGroup (updated) Link ¶
Changes (request)
{'Configuration': {'IdentityCenterConfiguration': {'EnableIdentityCenter': 'boolean',
                                                   'IdentityCenterInstanceArn': 'string'},
                   'QueryResultsS3AccessGrantsConfiguration': {'AuthenticationType': 'DIRECTORY_IDENTITY',
                                                               'CreateUserLevelPrefix': 'boolean',
                                                               'EnableS3AccessGrants': 'boolean'}}}

Creates a workgroup with the specified name. A workgroup can be an Apache Spark enabled workgroup or an Athena SQL workgroup.

See also: AWS API Documentation

Request Syntax

client.create_work_group(
    Name='string',
    Configuration={
        'ResultConfiguration': {
            'OutputLocation': 'string',
            'EncryptionConfiguration': {
                'EncryptionOption': 'SSE_S3'|'SSE_KMS'|'CSE_KMS',
                'KmsKey': 'string'
            },
            'ExpectedBucketOwner': 'string',
            'AclConfiguration': {
                'S3AclOption': 'BUCKET_OWNER_FULL_CONTROL'
            }
        },
        'EnforceWorkGroupConfiguration': True|False,
        'PublishCloudWatchMetricsEnabled': True|False,
        'BytesScannedCutoffPerQuery': 123,
        'RequesterPaysEnabled': True|False,
        'EngineVersion': {
            'SelectedEngineVersion': 'string',
            'EffectiveEngineVersion': 'string'
        },
        'AdditionalConfiguration': 'string',
        'ExecutionRole': 'string',
        'CustomerContentEncryptionConfiguration': {
            'KmsKey': 'string'
        },
        'EnableMinimumEncryptionConfiguration': True|False,
        'IdentityCenterConfiguration': {
            'EnableIdentityCenter': True|False,
            'IdentityCenterInstanceArn': 'string'
        },
        'QueryResultsS3AccessGrantsConfiguration': {
            'EnableS3AccessGrants': True|False,
            'CreateUserLevelPrefix': True|False,
            'AuthenticationType': 'DIRECTORY_IDENTITY'
        }
    },
    Description='string',
    Tags=[
        {
            'Key': 'string',
            'Value': 'string'
        },
    ]
)
type Name

string

param Name

[REQUIRED]

The workgroup name.

type Configuration

dict

param Configuration

Contains configuration information for creating an Athena SQL workgroup or Spark enabled Athena workgroup. Athena SQL workgroup configuration includes the location in Amazon S3 where query and calculation results are stored, the encryption configuration, if any, used for encrypting query results, whether the Amazon CloudWatch Metrics are enabled for the workgroup, the limit for the amount of bytes scanned (cutoff) per query, if it is specified, and whether workgroup's settings (specified with EnforceWorkGroupConfiguration ) in the WorkGroupConfiguration override client-side settings. See WorkGroupConfiguration$EnforceWorkGroupConfiguration.

  • ResultConfiguration (dict) --

    The configuration for the workgroup, which includes the location in Amazon S3 where query and calculation results are stored and the encryption option, if any, used for query and calculation results. To run the query, you must specify the query results location using one of the ways: either in the workgroup using this setting, or for individual queries (client-side), using ResultConfiguration$OutputLocation. If none of them is set, Athena issues an error that no output location is provided. For more information, see Working with query results, recent queries, and output files.

    • OutputLocation (string) --

      The location in Amazon S3 where your query and calculation results are stored, such as s3://path/to/query/bucket/ . To run the query, you must specify the query results location using one of the ways: either for individual queries using either this setting (client-side), or in the workgroup, using WorkGroupConfiguration. If none of them is set, Athena issues an error that no output location is provided. For more information, see Working with query results, recent queries, and output files. If workgroup settings override client-side settings, then the query uses the settings specified for the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration.

    • EncryptionConfiguration (dict) --

      If query and calculation results are encrypted in Amazon S3, indicates the encryption option used (for example, SSE_KMS or CSE_KMS ) and key information. This is a client-side setting. If workgroup settings override client-side settings, then the query uses the encryption configuration that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

      • EncryptionOption (string) -- [REQUIRED]

        Indicates whether Amazon S3 server-side encryption with Amazon S3-managed keys ( SSE_S3 ), server-side encryption with KMS-managed keys ( SSE_KMS ), or client-side encryption with KMS-managed keys ( CSE_KMS ) is used.

        If a query runs in a workgroup and the workgroup overrides client-side settings, then the workgroup's setting for encryption is used. It specifies whether query results must be encrypted, for all queries that run in this workgroup.

      • KmsKey (string) --

        For SSE_KMS and CSE_KMS , this is the KMS key ARN or ID.

    • ExpectedBucketOwner (string) --

      The Amazon Web Services account ID that you expect to be the owner of the Amazon S3 bucket specified by ResultConfiguration$OutputLocation. If set, Athena uses the value for ExpectedBucketOwner when it makes Amazon S3 calls to your specified output location. If the ExpectedBucketOwner Amazon Web Services account ID does not match the actual owner of the Amazon S3 bucket, the call fails with a permissions error.

      This is a client-side setting. If workgroup settings override client-side settings, then the query uses the ExpectedBucketOwner setting that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

    • AclConfiguration (dict) --

      Indicates that an Amazon S3 canned ACL should be set to control ownership of stored query results. Currently the only supported canned ACL is BUCKET_OWNER_FULL_CONTROL . This is a client-side setting. If workgroup settings override client-side settings, then the query uses the ACL configuration that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. For more information, see WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

      • S3AclOption (string) -- [REQUIRED]

        The Amazon S3 canned ACL that Athena should specify when storing query results. Currently the only supported canned ACL is BUCKET_OWNER_FULL_CONTROL . If a query runs in a workgroup and the workgroup overrides client-side settings, then the Amazon S3 canned ACL specified in the workgroup's settings is used for all queries that run in the workgroup. For more information about Amazon S3 canned ACLs, see Canned ACL in the Amazon S3 User Guide .

  • EnforceWorkGroupConfiguration (boolean) --

    If set to "true", the settings for the workgroup override client-side settings. If set to "false", client-side settings are used. For more information, see Workgroup Settings Override Client-Side Settings.

  • PublishCloudWatchMetricsEnabled (boolean) --

    Indicates that the Amazon CloudWatch metrics are enabled for the workgroup.

  • BytesScannedCutoffPerQuery (integer) --

    The upper data usage limit (cutoff) for the amount of bytes a single query in a workgroup is allowed to scan.

  • RequesterPaysEnabled (boolean) --

    If set to true , allows members assigned to a workgroup to reference Amazon S3 Requester Pays buckets in queries. If set to false , workgroup members cannot query data from Requester Pays buckets, and queries that retrieve data from Requester Pays buckets cause an error. The default is false . For more information about Requester Pays buckets, see Requester Pays Buckets in the Amazon Simple Storage Service Developer Guide .

  • EngineVersion (dict) --

    The engine version that all queries running on the workgroup use. Queries on the AmazonAthenaPreviewFunctionality workgroup run on the preview engine regardless of this setting.

    • SelectedEngineVersion (string) --

      The engine version requested by the user. Possible values are determined by the output of ListEngineVersions , including AUTO. The default is AUTO.

    • EffectiveEngineVersion (string) --

      Read only. The engine version on which the query runs. If the user requests a valid engine version other than Auto, the effective engine version is the same as the engine version that the user requested. If the user requests Auto, the effective engine version is chosen by Athena. When a request to update the engine version is made by a CreateWorkGroup or UpdateWorkGroup operation, the EffectiveEngineVersion field is ignored.

  • AdditionalConfiguration (string) --

    Specifies a user defined JSON string that is passed to the notebook engine.

  • ExecutionRole (string) --

    The ARN of the execution role used to access user resources for Spark sessions and Identity Center enabled workgroups. This property applies only to Spark enabled workgroups and Identity Center enabled workgroups.

  • CustomerContentEncryptionConfiguration (dict) --

    Specifies the KMS key that is used to encrypt the user's data stores in Athena. This setting does not apply to Athena SQL workgroups.

    • KmsKey (string) -- [REQUIRED]

      The customer managed KMS key that is used to encrypt the user's data stores in Athena.

  • EnableMinimumEncryptionConfiguration (boolean) --

    Enforces a minimal level of encryption for the workgroup for query and calculation results that are written to Amazon S3. When enabled, workgroup users can set encryption only to the minimum level set by the administrator or higher when they submit queries.

    The EnforceWorkGroupConfiguration setting takes precedence over the EnableMinimumEncryptionConfiguration flag. This means that if EnforceWorkGroupConfiguration is true, the EnableMinimumEncryptionConfiguration flag is ignored, and the workgroup configuration for encryption is used.

  • IdentityCenterConfiguration (dict) --

    Specifies whether the workgroup is IAM Identity Center supported.

    • EnableIdentityCenter (boolean) --

      Specifies whether the workgroup is IAM Identity Center supported.

    • IdentityCenterInstanceArn (string) --

      The IAM Identity Center instance ARN that the workgroup associates to.

  • QueryResultsS3AccessGrantsConfiguration (dict) --

    Specifies whether Amazon S3 access grants are enabled for query results.

    • EnableS3AccessGrants (boolean) -- [REQUIRED]

      Specifies whether Amazon S3 access grants are enabled for query results.

    • CreateUserLevelPrefix (boolean) --

      When enabled, appends the user ID as an Amazon S3 path prefix to the query result output location.

    • AuthenticationType (string) -- [REQUIRED]

      The authentication type used for Amazon S3 access grants. Currently, only DIRECTORY_IDENTITY is supported.

type Description

string

param Description

The workgroup description.

type Tags

list

param Tags

A list of comma separated tags to add to the workgroup that is created.

  • (dict) --

    A label that you assign to a resource. Athena resources include workgroups, data catalogs, and capacity reservations. Each tag consists of a key and an optional value, both of which you define. For example, you can use tags to categorize Athena resources by purpose, owner, or environment. Use a consistent set of tag keys to make it easier to search and filter the resources in your account. For best practices, see Tagging Best Practices. Tag keys can be from 1 to 128 UTF-8 Unicode characters, and tag values can be from 0 to 256 UTF-8 Unicode characters. Tags can use letters and numbers representable in UTF-8, and the following characters: + - = . _ : / @. Tag keys and values are case-sensitive. Tag keys must be unique per resource. If you specify more than one tag, separate them by commas.

    • Key (string) --

      A tag key. The tag key length is from 1 to 128 Unicode characters in UTF-8. You can use letters and numbers representable in UTF-8, and the following characters: + - = . _ : / @. Tag keys are case-sensitive and must be unique per resource.

    • Value (string) --

      A tag value. The tag value length is from 0 to 256 Unicode characters in UTF-8. You can use letters and numbers representable in UTF-8, and the following characters: + - = . _ : / @. Tag values are case-sensitive.

rtype

dict

returns

Response Syntax

{}

Response Structure

  • (dict) --

GetDataCatalog (updated) Link ¶
Changes (request)
{'WorkGroup': 'string'}

Returns the specified data catalog.

See also: AWS API Documentation

Request Syntax

client.get_data_catalog(
    Name='string',
    WorkGroup='string'
)
type Name

string

param Name

[REQUIRED]

The name of the data catalog to return.

type WorkGroup

string

param WorkGroup

The name of the workgroup. Required if making an IAM Identity Center request.

rtype

dict

returns

Response Syntax

{
    'DataCatalog': {
        'Name': 'string',
        'Description': 'string',
        'Type': 'LAMBDA'|'GLUE'|'HIVE',
        'Parameters': {
            'string': 'string'
        }
    }
}

Response Structure

  • (dict) --

    • DataCatalog (dict) --

      The data catalog returned.

      • Name (string) --

        The name of the data catalog. The catalog name must be unique for the Amazon Web Services account and can use a maximum of 127 alphanumeric, underscore, at sign, or hyphen characters. The remainder of the length constraint of 256 is reserved for use by Athena.

      • Description (string) --

        An optional description of the data catalog.

      • Type (string) --

        The type of data catalog to create: LAMBDA for a federated catalog, HIVE for an external hive metastore, or GLUE for an Glue Data Catalog.

      • Parameters (dict) --

        Specifies the Lambda function or functions to use for the data catalog. This is a mapping whose values depend on the catalog type.

        • For the HIVE data catalog type, use the following syntax. The metadata-function parameter is required. The sdk-version parameter is optional and defaults to the currently supported version. metadata-function=lambda_arn, sdk-version=version_number

        • For the LAMBDA data catalog type, use one of the following sets of required parameters, but not both.

          • If you have one Lambda function that processes metadata and another for reading the actual data, use the following syntax. Both parameters are required. metadata-function=lambda_arn, record-function=lambda_arn

          • If you have a composite Lambda function that processes both metadata and data, use the following syntax to specify your Lambda function. function=lambda_arn

        • The GLUE type takes a catalog ID parameter and is required. The catalog_id is the account ID of the Amazon Web Services account to which the Glue catalog belongs. catalog-id=catalog_id

          • The GLUE data catalog type also applies to the default AwsDataCatalog that already exists in your account, of which you can have only one and cannot modify.

        • (string) --

          • (string) --

GetDatabase (updated) Link ¶
Changes (request)
{'WorkGroup': 'string'}

Returns a database object for the specified database and data catalog.

See also: AWS API Documentation

Request Syntax

client.get_database(
    CatalogName='string',
    DatabaseName='string',
    WorkGroup='string'
)
type CatalogName

string

param CatalogName

[REQUIRED]

The name of the data catalog that contains the database to return.

type DatabaseName

string

param DatabaseName

[REQUIRED]

The name of the database to return.

type WorkGroup

string

param WorkGroup

The name of the workgroup for which the metadata is being fetched. Required if requesting an IAM Identity Center enabled Glue Data Catalog.

rtype

dict

returns

Response Syntax

{
    'Database': {
        'Name': 'string',
        'Description': 'string',
        'Parameters': {
            'string': 'string'
        }
    }
}

Response Structure

  • (dict) --

    • Database (dict) --

      The database returned.

      • Name (string) --

        The name of the database.

      • Description (string) --

        An optional description of the database.

      • Parameters (dict) --

        A set of custom key/value pairs.

        • (string) --

          • (string) --

GetQueryExecution (updated) Link ¶
Changes (response)
{'QueryExecution': {'QueryResultsS3AccessGrantsConfiguration': {'AuthenticationType': 'DIRECTORY_IDENTITY',
                                                                'CreateUserLevelPrefix': 'boolean',
                                                                'EnableS3AccessGrants': 'boolean'}}}

Returns information about a single execution of a query if you have access to the workgroup in which the query ran. Each time a query executes, information about the query execution is saved with a unique ID.

See also: AWS API Documentation

Request Syntax

client.get_query_execution(
    QueryExecutionId='string'
)
type QueryExecutionId

string

param QueryExecutionId

[REQUIRED]

The unique ID of the query execution.

rtype

dict

returns

Response Syntax

{
    'QueryExecution': {
        'QueryExecutionId': 'string',
        'Query': 'string',
        'StatementType': 'DDL'|'DML'|'UTILITY',
        'ResultConfiguration': {
            'OutputLocation': 'string',
            'EncryptionConfiguration': {
                'EncryptionOption': 'SSE_S3'|'SSE_KMS'|'CSE_KMS',
                'KmsKey': 'string'
            },
            'ExpectedBucketOwner': 'string',
            'AclConfiguration': {
                'S3AclOption': 'BUCKET_OWNER_FULL_CONTROL'
            }
        },
        'ResultReuseConfiguration': {
            'ResultReuseByAgeConfiguration': {
                'Enabled': True|False,
                'MaxAgeInMinutes': 123
            }
        },
        'QueryExecutionContext': {
            'Database': 'string',
            'Catalog': 'string'
        },
        'Status': {
            'State': 'QUEUED'|'RUNNING'|'SUCCEEDED'|'FAILED'|'CANCELLED',
            'StateChangeReason': 'string',
            'SubmissionDateTime': datetime(2015, 1, 1),
            'CompletionDateTime': datetime(2015, 1, 1),
            'AthenaError': {
                'ErrorCategory': 123,
                'ErrorType': 123,
                'Retryable': True|False,
                'ErrorMessage': 'string'
            }
        },
        'Statistics': {
            'EngineExecutionTimeInMillis': 123,
            'DataScannedInBytes': 123,
            'DataManifestLocation': 'string',
            'TotalExecutionTimeInMillis': 123,
            'QueryQueueTimeInMillis': 123,
            'ServicePreProcessingTimeInMillis': 123,
            'QueryPlanningTimeInMillis': 123,
            'ServiceProcessingTimeInMillis': 123,
            'ResultReuseInformation': {
                'ReusedPreviousResult': True|False
            }
        },
        'WorkGroup': 'string',
        'EngineVersion': {
            'SelectedEngineVersion': 'string',
            'EffectiveEngineVersion': 'string'
        },
        'ExecutionParameters': [
            'string',
        ],
        'SubstatementType': 'string',
        'QueryResultsS3AccessGrantsConfiguration': {
            'EnableS3AccessGrants': True|False,
            'CreateUserLevelPrefix': True|False,
            'AuthenticationType': 'DIRECTORY_IDENTITY'
        }
    }
}

Response Structure

  • (dict) --

    • QueryExecution (dict) --

      Information about the query execution.

      • QueryExecutionId (string) --

        The unique identifier for each query execution.

      • Query (string) --

        The SQL query statements which the query execution ran.

      • StatementType (string) --

        The type of query statement that was run. DDL indicates DDL query statements. DML indicates DML (Data Manipulation Language) query statements, such as CREATE TABLE AS SELECT . UTILITY indicates query statements other than DDL and DML, such as SHOW CREATE TABLE , or DESCRIBE TABLE .

      • ResultConfiguration (dict) --

        The location in Amazon S3 where query and calculation results are stored and the encryption option, if any, used for query results. These are known as "client-side settings". If workgroup settings override client-side settings, then the query uses the location for the query results and the encryption configuration that are specified for the workgroup.

        • OutputLocation (string) --

          The location in Amazon S3 where your query and calculation results are stored, such as s3://path/to/query/bucket/ . To run the query, you must specify the query results location using one of the ways: either for individual queries using either this setting (client-side), or in the workgroup, using WorkGroupConfiguration. If none of them is set, Athena issues an error that no output location is provided. For more information, see Working with query results, recent queries, and output files. If workgroup settings override client-side settings, then the query uses the settings specified for the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration.

        • EncryptionConfiguration (dict) --

          If query and calculation results are encrypted in Amazon S3, indicates the encryption option used (for example, SSE_KMS or CSE_KMS ) and key information. This is a client-side setting. If workgroup settings override client-side settings, then the query uses the encryption configuration that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

          • EncryptionOption (string) --

            Indicates whether Amazon S3 server-side encryption with Amazon S3-managed keys ( SSE_S3 ), server-side encryption with KMS-managed keys ( SSE_KMS ), or client-side encryption with KMS-managed keys ( CSE_KMS ) is used.

            If a query runs in a workgroup and the workgroup overrides client-side settings, then the workgroup's setting for encryption is used. It specifies whether query results must be encrypted, for all queries that run in this workgroup.

          • KmsKey (string) --

            For SSE_KMS and CSE_KMS , this is the KMS key ARN or ID.

        • ExpectedBucketOwner (string) --

          The Amazon Web Services account ID that you expect to be the owner of the Amazon S3 bucket specified by ResultConfiguration$OutputLocation. If set, Athena uses the value for ExpectedBucketOwner when it makes Amazon S3 calls to your specified output location. If the ExpectedBucketOwner Amazon Web Services account ID does not match the actual owner of the Amazon S3 bucket, the call fails with a permissions error.

          This is a client-side setting. If workgroup settings override client-side settings, then the query uses the ExpectedBucketOwner setting that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

        • AclConfiguration (dict) --

          Indicates that an Amazon S3 canned ACL should be set to control ownership of stored query results. Currently the only supported canned ACL is BUCKET_OWNER_FULL_CONTROL . This is a client-side setting. If workgroup settings override client-side settings, then the query uses the ACL configuration that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. For more information, see WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

          • S3AclOption (string) --

            The Amazon S3 canned ACL that Athena should specify when storing query results. Currently the only supported canned ACL is BUCKET_OWNER_FULL_CONTROL . If a query runs in a workgroup and the workgroup overrides client-side settings, then the Amazon S3 canned ACL specified in the workgroup's settings is used for all queries that run in the workgroup. For more information about Amazon S3 canned ACLs, see Canned ACL in the Amazon S3 User Guide .

      • ResultReuseConfiguration (dict) --

        Specifies the query result reuse behavior that was used for the query.

        • ResultReuseByAgeConfiguration (dict) --

          Specifies whether previous query results are reused, and if so, their maximum age.

          • Enabled (boolean) --

            True if previous query results can be reused when the query is run; otherwise, false. The default is false.

          • MaxAgeInMinutes (integer) --

            Specifies, in minutes, the maximum age of a previous query result that Athena should consider for reuse. The default is 60.

      • QueryExecutionContext (dict) --

        The database in which the query execution occurred.

        • Database (string) --

          The name of the database used in the query execution. The database must exist in the catalog.

        • Catalog (string) --

          The name of the data catalog used in the query execution.

      • Status (dict) --

        The completion date, current state, submission time, and state change reason (if applicable) for the query execution.

        • State (string) --

          The state of query execution. QUEUED indicates that the query has been submitted to the service, and Athena will execute the query as soon as resources are available. RUNNING indicates that the query is in execution phase. SUCCEEDED indicates that the query completed without errors. FAILED indicates that the query experienced an error and did not complete processing. CANCELLED indicates that a user input interrupted query execution.

          Note

          Athena automatically retries your queries in cases of certain transient errors. As a result, you may see the query state transition from RUNNING or FAILED to QUEUED .

        • StateChangeReason (string) --

          Further detail about the status of the query.

        • SubmissionDateTime (datetime) --

          The date and time that the query was submitted.

        • CompletionDateTime (datetime) --

          The date and time that the query completed.

        • AthenaError (dict) --

          Provides information about an Athena query error.

          • ErrorCategory (integer) --

            An integer value that specifies the category of a query failure error. The following list shows the category for each integer value.

            1 - System

            2 - User

            3 - Other

          • ErrorType (integer) --

            An integer value that provides specific information about an Athena query error. For the meaning of specific values, see the Error Type Reference in the Amazon Athena User Guide .

          • Retryable (boolean) --

            True if the query might succeed if resubmitted.

          • ErrorMessage (string) --

            Contains a short description of the error that occurred.

      • Statistics (dict) --

        Query execution statistics, such as the amount of data scanned, the amount of time that the query took to process, and the type of statement that was run.

        • EngineExecutionTimeInMillis (integer) --

          The number of milliseconds that the query took to execute.

        • DataScannedInBytes (integer) --

          The number of bytes in the data that was queried.

        • DataManifestLocation (string) --

          The location and file name of a data manifest file. The manifest file is saved to the Athena query results location in Amazon S3. The manifest file tracks files that the query wrote to Amazon S3. If the query fails, the manifest file also tracks files that the query intended to write. The manifest is useful for identifying orphaned files resulting from a failed query. For more information, see Working with Query Results, Output Files, and Query History in the Amazon Athena User Guide .

        • TotalExecutionTimeInMillis (integer) --

          The number of milliseconds that Athena took to run the query.

        • QueryQueueTimeInMillis (integer) --

          The number of milliseconds that the query was in your query queue waiting for resources. Note that if transient errors occur, Athena might automatically add the query back to the queue.

        • ServicePreProcessingTimeInMillis (integer) --

          The number of milliseconds that Athena took to preprocess the query before submitting the query to the query engine.

        • QueryPlanningTimeInMillis (integer) --

          The number of milliseconds that Athena took to plan the query processing flow. This includes the time spent retrieving table partitions from the data source. Note that because the query engine performs the query planning, query planning time is a subset of engine processing time.

        • ServiceProcessingTimeInMillis (integer) --

          The number of milliseconds that Athena took to finalize and publish the query results after the query engine finished running the query.

        • ResultReuseInformation (dict) --

          Contains information about whether previous query results were reused for the query.

          • ReusedPreviousResult (boolean) --

            True if a previous query result was reused; false if the result was generated from a new run of the query.

      • WorkGroup (string) --

        The name of the workgroup in which the query ran.

      • EngineVersion (dict) --

        The engine version that executed the query.

        • SelectedEngineVersion (string) --

          The engine version requested by the user. Possible values are determined by the output of ListEngineVersions , including AUTO. The default is AUTO.

        • EffectiveEngineVersion (string) --

          Read only. The engine version on which the query runs. If the user requests a valid engine version other than Auto, the effective engine version is the same as the engine version that the user requested. If the user requests Auto, the effective engine version is chosen by Athena. When a request to update the engine version is made by a CreateWorkGroup or UpdateWorkGroup operation, the EffectiveEngineVersion field is ignored.

      • ExecutionParameters (list) --

        A list of values for the parameters in a query. The values are applied sequentially to the parameters in the query in the order in which the parameters occur. The list of parameters is not returned in the response.

        • (string) --

      • SubstatementType (string) --

        The kind of query statement that was run.

      • QueryResultsS3AccessGrantsConfiguration (dict) --

        Specifies whether Amazon S3 access grants are enabled for query results.

        • EnableS3AccessGrants (boolean) --

          Specifies whether Amazon S3 access grants are enabled for query results.

        • CreateUserLevelPrefix (boolean) --

          When enabled, appends the user ID as an Amazon S3 path prefix to the query result output location.

        • AuthenticationType (string) --

          The authentication type used for Amazon S3 access grants. Currently, only DIRECTORY_IDENTITY is supported.

GetTableMetadata (updated) Link ¶
Changes (request)
{'WorkGroup': 'string'}

Returns table metadata for the specified catalog, database, and table.

See also: AWS API Documentation

Request Syntax

client.get_table_metadata(
    CatalogName='string',
    DatabaseName='string',
    TableName='string',
    WorkGroup='string'
)
type CatalogName

string

param CatalogName

[REQUIRED]

The name of the data catalog that contains the database and table metadata to return.

type DatabaseName

string

param DatabaseName

[REQUIRED]

The name of the database that contains the table metadata to return.

type TableName

string

param TableName

[REQUIRED]

The name of the table for which metadata is returned.

type WorkGroup

string

param WorkGroup

The name of the workgroup for which the metadata is being fetched. Required if requesting an IAM Identity Center enabled Glue Data Catalog.

rtype

dict

returns

Response Syntax

{
    'TableMetadata': {
        'Name': 'string',
        'CreateTime': datetime(2015, 1, 1),
        'LastAccessTime': datetime(2015, 1, 1),
        'TableType': 'string',
        'Columns': [
            {
                'Name': 'string',
                'Type': 'string',
                'Comment': 'string'
            },
        ],
        'PartitionKeys': [
            {
                'Name': 'string',
                'Type': 'string',
                'Comment': 'string'
            },
        ],
        'Parameters': {
            'string': 'string'
        }
    }
}

Response Structure

  • (dict) --

    • TableMetadata (dict) --

      An object that contains table metadata.

      • Name (string) --

        The name of the table.

      • CreateTime (datetime) --

        The time that the table was created.

      • LastAccessTime (datetime) --

        The last time the table was accessed.

      • TableType (string) --

        The type of table. In Athena, only EXTERNAL_TABLE is supported.

      • Columns (list) --

        A list of the columns in the table.

        • (dict) --

          Contains metadata for a column in a table.

          • Name (string) --

            The name of the column.

          • Type (string) --

            The data type of the column.

          • Comment (string) --

            Optional information about the column.

      • PartitionKeys (list) --

        A list of the partition keys in the table.

        • (dict) --

          Contains metadata for a column in a table.

          • Name (string) --

            The name of the column.

          • Type (string) --

            The data type of the column.

          • Comment (string) --

            Optional information about the column.

      • Parameters (dict) --

        A set of custom key/value pairs for table properties.

        • (string) --

          • (string) --

GetWorkGroup (updated) Link ¶
Changes (response)
{'WorkGroup': {'Configuration': {'IdentityCenterConfiguration': {'EnableIdentityCenter': 'boolean',
                                                                 'IdentityCenterInstanceArn': 'string'},
                                 'QueryResultsS3AccessGrantsConfiguration': {'AuthenticationType': 'DIRECTORY_IDENTITY',
                                                                             'CreateUserLevelPrefix': 'boolean',
                                                                             'EnableS3AccessGrants': 'boolean'}},
               'IdentityCenterApplicationArn': 'string'}}

Returns information about the workgroup with the specified name.

See also: AWS API Documentation

Request Syntax

client.get_work_group(
    WorkGroup='string'
)
type WorkGroup

string

param WorkGroup

[REQUIRED]

The name of the workgroup.

rtype

dict

returns

Response Syntax

{
    'WorkGroup': {
        'Name': 'string',
        'State': 'ENABLED'|'DISABLED',
        'Configuration': {
            'ResultConfiguration': {
                'OutputLocation': 'string',
                'EncryptionConfiguration': {
                    'EncryptionOption': 'SSE_S3'|'SSE_KMS'|'CSE_KMS',
                    'KmsKey': 'string'
                },
                'ExpectedBucketOwner': 'string',
                'AclConfiguration': {
                    'S3AclOption': 'BUCKET_OWNER_FULL_CONTROL'
                }
            },
            'EnforceWorkGroupConfiguration': True|False,
            'PublishCloudWatchMetricsEnabled': True|False,
            'BytesScannedCutoffPerQuery': 123,
            'RequesterPaysEnabled': True|False,
            'EngineVersion': {
                'SelectedEngineVersion': 'string',
                'EffectiveEngineVersion': 'string'
            },
            'AdditionalConfiguration': 'string',
            'ExecutionRole': 'string',
            'CustomerContentEncryptionConfiguration': {
                'KmsKey': 'string'
            },
            'EnableMinimumEncryptionConfiguration': True|False,
            'IdentityCenterConfiguration': {
                'EnableIdentityCenter': True|False,
                'IdentityCenterInstanceArn': 'string'
            },
            'QueryResultsS3AccessGrantsConfiguration': {
                'EnableS3AccessGrants': True|False,
                'CreateUserLevelPrefix': True|False,
                'AuthenticationType': 'DIRECTORY_IDENTITY'
            }
        },
        'Description': 'string',
        'CreationTime': datetime(2015, 1, 1),
        'IdentityCenterApplicationArn': 'string'
    }
}

Response Structure

  • (dict) --

    • WorkGroup (dict) --

      Information about the workgroup.

      • Name (string) --

        The workgroup name.

      • State (string) --

        The state of the workgroup: ENABLED or DISABLED.

      • Configuration (dict) --

        The configuration of the workgroup, which includes the location in Amazon S3 where query and calculation results are stored, the encryption configuration, if any, used for query and calculation results; whether the Amazon CloudWatch Metrics are enabled for the workgroup; whether workgroup settings override client-side settings; and the data usage limits for the amount of data scanned per query or per workgroup. The workgroup settings override is specified in EnforceWorkGroupConfiguration (true/false) in the WorkGroupConfiguration . See WorkGroupConfiguration$EnforceWorkGroupConfiguration.

        • ResultConfiguration (dict) --

          The configuration for the workgroup, which includes the location in Amazon S3 where query and calculation results are stored and the encryption option, if any, used for query and calculation results. To run the query, you must specify the query results location using one of the ways: either in the workgroup using this setting, or for individual queries (client-side), using ResultConfiguration$OutputLocation. If none of them is set, Athena issues an error that no output location is provided. For more information, see Working with query results, recent queries, and output files.

          • OutputLocation (string) --

            The location in Amazon S3 where your query and calculation results are stored, such as s3://path/to/query/bucket/ . To run the query, you must specify the query results location using one of the ways: either for individual queries using either this setting (client-side), or in the workgroup, using WorkGroupConfiguration. If none of them is set, Athena issues an error that no output location is provided. For more information, see Working with query results, recent queries, and output files. If workgroup settings override client-side settings, then the query uses the settings specified for the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration.

          • EncryptionConfiguration (dict) --

            If query and calculation results are encrypted in Amazon S3, indicates the encryption option used (for example, SSE_KMS or CSE_KMS ) and key information. This is a client-side setting. If workgroup settings override client-side settings, then the query uses the encryption configuration that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

            • EncryptionOption (string) --

              Indicates whether Amazon S3 server-side encryption with Amazon S3-managed keys ( SSE_S3 ), server-side encryption with KMS-managed keys ( SSE_KMS ), or client-side encryption with KMS-managed keys ( CSE_KMS ) is used.

              If a query runs in a workgroup and the workgroup overrides client-side settings, then the workgroup's setting for encryption is used. It specifies whether query results must be encrypted, for all queries that run in this workgroup.

            • KmsKey (string) --

              For SSE_KMS and CSE_KMS , this is the KMS key ARN or ID.

          • ExpectedBucketOwner (string) --

            The Amazon Web Services account ID that you expect to be the owner of the Amazon S3 bucket specified by ResultConfiguration$OutputLocation. If set, Athena uses the value for ExpectedBucketOwner when it makes Amazon S3 calls to your specified output location. If the ExpectedBucketOwner Amazon Web Services account ID does not match the actual owner of the Amazon S3 bucket, the call fails with a permissions error.

            This is a client-side setting. If workgroup settings override client-side settings, then the query uses the ExpectedBucketOwner setting that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

          • AclConfiguration (dict) --

            Indicates that an Amazon S3 canned ACL should be set to control ownership of stored query results. Currently the only supported canned ACL is BUCKET_OWNER_FULL_CONTROL . This is a client-side setting. If workgroup settings override client-side settings, then the query uses the ACL configuration that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. For more information, see WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

            • S3AclOption (string) --

              The Amazon S3 canned ACL that Athena should specify when storing query results. Currently the only supported canned ACL is BUCKET_OWNER_FULL_CONTROL . If a query runs in a workgroup and the workgroup overrides client-side settings, then the Amazon S3 canned ACL specified in the workgroup's settings is used for all queries that run in the workgroup. For more information about Amazon S3 canned ACLs, see Canned ACL in the Amazon S3 User Guide .

        • EnforceWorkGroupConfiguration (boolean) --

          If set to "true", the settings for the workgroup override client-side settings. If set to "false", client-side settings are used. For more information, see Workgroup Settings Override Client-Side Settings.

        • PublishCloudWatchMetricsEnabled (boolean) --

          Indicates that the Amazon CloudWatch metrics are enabled for the workgroup.

        • BytesScannedCutoffPerQuery (integer) --

          The upper data usage limit (cutoff) for the amount of bytes a single query in a workgroup is allowed to scan.

        • RequesterPaysEnabled (boolean) --

          If set to true , allows members assigned to a workgroup to reference Amazon S3 Requester Pays buckets in queries. If set to false , workgroup members cannot query data from Requester Pays buckets, and queries that retrieve data from Requester Pays buckets cause an error. The default is false . For more information about Requester Pays buckets, see Requester Pays Buckets in the Amazon Simple Storage Service Developer Guide .

        • EngineVersion (dict) --

          The engine version that all queries running on the workgroup use. Queries on the AmazonAthenaPreviewFunctionality workgroup run on the preview engine regardless of this setting.

          • SelectedEngineVersion (string) --

            The engine version requested by the user. Possible values are determined by the output of ListEngineVersions , including AUTO. The default is AUTO.

          • EffectiveEngineVersion (string) --

            Read only. The engine version on which the query runs. If the user requests a valid engine version other than Auto, the effective engine version is the same as the engine version that the user requested. If the user requests Auto, the effective engine version is chosen by Athena. When a request to update the engine version is made by a CreateWorkGroup or UpdateWorkGroup operation, the EffectiveEngineVersion field is ignored.

        • AdditionalConfiguration (string) --

          Specifies a user defined JSON string that is passed to the notebook engine.

        • ExecutionRole (string) --

          The ARN of the execution role used to access user resources for Spark sessions and Identity Center enabled workgroups. This property applies only to Spark enabled workgroups and Identity Center enabled workgroups.

        • CustomerContentEncryptionConfiguration (dict) --

          Specifies the KMS key that is used to encrypt the user's data stores in Athena. This setting does not apply to Athena SQL workgroups.

          • KmsKey (string) --

            The customer managed KMS key that is used to encrypt the user's data stores in Athena.

        • EnableMinimumEncryptionConfiguration (boolean) --

          Enforces a minimal level of encryption for the workgroup for query and calculation results that are written to Amazon S3. When enabled, workgroup users can set encryption only to the minimum level set by the administrator or higher when they submit queries.

          The EnforceWorkGroupConfiguration setting takes precedence over the EnableMinimumEncryptionConfiguration flag. This means that if EnforceWorkGroupConfiguration is true, the EnableMinimumEncryptionConfiguration flag is ignored, and the workgroup configuration for encryption is used.

        • IdentityCenterConfiguration (dict) --

          Specifies whether the workgroup is IAM Identity Center supported.

          • EnableIdentityCenter (boolean) --

            Specifies whether the workgroup is IAM Identity Center supported.

          • IdentityCenterInstanceArn (string) --

            The IAM Identity Center instance ARN that the workgroup associates to.

        • QueryResultsS3AccessGrantsConfiguration (dict) --

          Specifies whether Amazon S3 access grants are enabled for query results.

          • EnableS3AccessGrants (boolean) --

            Specifies whether Amazon S3 access grants are enabled for query results.

          • CreateUserLevelPrefix (boolean) --

            When enabled, appends the user ID as an Amazon S3 path prefix to the query result output location.

          • AuthenticationType (string) --

            The authentication type used for Amazon S3 access grants. Currently, only DIRECTORY_IDENTITY is supported.

      • Description (string) --

        The workgroup description.

      • CreationTime (datetime) --

        The date and time the workgroup was created.

      • IdentityCenterApplicationArn (string) --

        The ARN of the IAM Identity Center enabled application associated with the workgroup.

ListDataCatalogs (updated) Link ¶
Changes (request)
{'WorkGroup': 'string'}

Lists the data catalogs in the current Amazon Web Services account.

Note

In the Athena console, data catalogs are listed as "data sources" on the Data sources page under the Data source name column.

See also: AWS API Documentation

Request Syntax

client.list_data_catalogs(
    NextToken='string',
    MaxResults=123,
    WorkGroup='string'
)
type NextToken

string

param NextToken

A token generated by the Athena service that specifies where to continue pagination if a previous request was truncated. To obtain the next set of pages, pass in the NextToken from the response object of the previous page call.

type MaxResults

integer

param MaxResults

Specifies the maximum number of data catalogs to return.

type WorkGroup

string

param WorkGroup

The name of the workgroup. Required if making an IAM Identity Center request.

rtype

dict

returns

Response Syntax

{
    'DataCatalogsSummary': [
        {
            'CatalogName': 'string',
            'Type': 'LAMBDA'|'GLUE'|'HIVE'
        },
    ],
    'NextToken': 'string'
}

Response Structure

  • (dict) --

    • DataCatalogsSummary (list) --

      A summary list of data catalogs.

      • (dict) --

        The summary information for the data catalog, which includes its name and type.

        • CatalogName (string) --

          The name of the data catalog. The catalog name is unique for the Amazon Web Services account and can use a maximum of 127 alphanumeric, underscore, at sign, or hyphen characters. The remainder of the length constraint of 256 is reserved for use by Athena.

        • Type (string) --

          The data catalog type.

    • NextToken (string) --

      A token generated by the Athena service that specifies where to continue pagination if a previous request was truncated. To obtain the next set of pages, pass in the NextToken from the response object of the previous page call.

ListDatabases (updated) Link ¶
Changes (request)
{'WorkGroup': 'string'}

Lists the databases in the specified data catalog.

See also: AWS API Documentation

Request Syntax

client.list_databases(
    CatalogName='string',
    NextToken='string',
    MaxResults=123,
    WorkGroup='string'
)
type CatalogName

string

param CatalogName

[REQUIRED]

The name of the data catalog that contains the databases to return.

type NextToken

string

param NextToken

A token generated by the Athena service that specifies where to continue pagination if a previous request was truncated. To obtain the next set of pages, pass in the NextToken from the response object of the previous page call.

type MaxResults

integer

param MaxResults

Specifies the maximum number of results to return.

type WorkGroup

string

param WorkGroup

The name of the workgroup for which the metadata is being fetched. Required if requesting an IAM Identity Center enabled Glue Data Catalog.

rtype

dict

returns

Response Syntax

{
    'DatabaseList': [
        {
            'Name': 'string',
            'Description': 'string',
            'Parameters': {
                'string': 'string'
            }
        },
    ],
    'NextToken': 'string'
}

Response Structure

  • (dict) --

    • DatabaseList (list) --

      A list of databases from a data catalog.

      • (dict) --

        Contains metadata information for a database in a data catalog.

        • Name (string) --

          The name of the database.

        • Description (string) --

          An optional description of the database.

        • Parameters (dict) --

          A set of custom key/value pairs.

          • (string) --

            • (string) --

    • NextToken (string) --

      A token generated by the Athena service that specifies where to continue pagination if a previous request was truncated. To obtain the next set of pages, pass in the NextToken from the response object of the previous page call.

ListTableMetadata (updated) Link ¶
Changes (request)
{'WorkGroup': 'string'}

Lists the metadata for the tables in the specified data catalog database.

See also: AWS API Documentation

Request Syntax

client.list_table_metadata(
    CatalogName='string',
    DatabaseName='string',
    Expression='string',
    NextToken='string',
    MaxResults=123,
    WorkGroup='string'
)
type CatalogName

string

param CatalogName

[REQUIRED]

The name of the data catalog for which table metadata should be returned.

type DatabaseName

string

param DatabaseName

[REQUIRED]

The name of the database for which table metadata should be returned.

type Expression

string

param Expression

A regex filter that pattern-matches table names. If no expression is supplied, metadata for all tables are listed.

type NextToken

string

param NextToken

A token generated by the Athena service that specifies where to continue pagination if a previous request was truncated. To obtain the next set of pages, pass in the NextToken from the response object of the previous page call.

type MaxResults

integer

param MaxResults

Specifies the maximum number of results to return.

type WorkGroup

string

param WorkGroup

The name of the workgroup for which the metadata is being fetched. Required if requesting an IAM Identity Center enabled Glue Data Catalog.

rtype

dict

returns

Response Syntax

{
    'TableMetadataList': [
        {
            'Name': 'string',
            'CreateTime': datetime(2015, 1, 1),
            'LastAccessTime': datetime(2015, 1, 1),
            'TableType': 'string',
            'Columns': [
                {
                    'Name': 'string',
                    'Type': 'string',
                    'Comment': 'string'
                },
            ],
            'PartitionKeys': [
                {
                    'Name': 'string',
                    'Type': 'string',
                    'Comment': 'string'
                },
            ],
            'Parameters': {
                'string': 'string'
            }
        },
    ],
    'NextToken': 'string'
}

Response Structure

  • (dict) --

    • TableMetadataList (list) --

      A list of table metadata.

      • (dict) --

        Contains metadata for a table.

        • Name (string) --

          The name of the table.

        • CreateTime (datetime) --

          The time that the table was created.

        • LastAccessTime (datetime) --

          The last time the table was accessed.

        • TableType (string) --

          The type of table. In Athena, only EXTERNAL_TABLE is supported.

        • Columns (list) --

          A list of the columns in the table.

          • (dict) --

            Contains metadata for a column in a table.

            • Name (string) --

              The name of the column.

            • Type (string) --

              The data type of the column.

            • Comment (string) --

              Optional information about the column.

        • PartitionKeys (list) --

          A list of the partition keys in the table.

          • (dict) --

            Contains metadata for a column in a table.

            • Name (string) --

              The name of the column.

            • Type (string) --

              The data type of the column.

            • Comment (string) --

              Optional information about the column.

        • Parameters (dict) --

          A set of custom key/value pairs for table properties.

          • (string) --

            • (string) --

    • NextToken (string) --

      A token generated by the Athena service that specifies where to continue pagination if a previous request was truncated. To obtain the next set of pages, pass in the NextToken from the response object of the previous page call.

ListWorkGroups (updated) Link ¶
Changes (response)
{'WorkGroups': {'IdentityCenterApplicationArn': 'string'}}

Lists available workgroups for the account.

See also: AWS API Documentation

Request Syntax

client.list_work_groups(
    NextToken='string',
    MaxResults=123
)
type NextToken

string

param NextToken

A token generated by the Athena service that specifies where to continue pagination if a previous request was truncated. To obtain the next set of pages, pass in the NextToken from the response object of the previous page call.

type MaxResults

integer

param MaxResults

The maximum number of workgroups to return in this request.

rtype

dict

returns

Response Syntax

{
    'WorkGroups': [
        {
            'Name': 'string',
            'State': 'ENABLED'|'DISABLED',
            'Description': 'string',
            'CreationTime': datetime(2015, 1, 1),
            'EngineVersion': {
                'SelectedEngineVersion': 'string',
                'EffectiveEngineVersion': 'string'
            },
            'IdentityCenterApplicationArn': 'string'
        },
    ],
    'NextToken': 'string'
}

Response Structure

  • (dict) --

    • WorkGroups (list) --

      A list of WorkGroupSummary objects that include the names, descriptions, creation times, and states for each workgroup.

      • (dict) --

        The summary information for the workgroup, which includes its name, state, description, and the date and time it was created.

        • Name (string) --

          The name of the workgroup.

        • State (string) --

          The state of the workgroup.

        • Description (string) --

          The workgroup description.

        • CreationTime (datetime) --

          The workgroup creation date and time.

        • EngineVersion (dict) --

          The engine version setting for all queries on the workgroup. Queries on the AmazonAthenaPreviewFunctionality workgroup run on the preview engine regardless of this setting.

          • SelectedEngineVersion (string) --

            The engine version requested by the user. Possible values are determined by the output of ListEngineVersions , including AUTO. The default is AUTO.

          • EffectiveEngineVersion (string) --

            Read only. The engine version on which the query runs. If the user requests a valid engine version other than Auto, the effective engine version is the same as the engine version that the user requested. If the user requests Auto, the effective engine version is chosen by Athena. When a request to update the engine version is made by a CreateWorkGroup or UpdateWorkGroup operation, the EffectiveEngineVersion field is ignored.

        • IdentityCenterApplicationArn (string) --

          The ARN of the IAM Identity Center enabled application associated with the workgroup.

    • NextToken (string) --

      A token generated by the Athena service that specifies where to continue pagination if a previous request was truncated. To obtain the next set of pages, pass in the NextToken from the response object of the previous page call.

UpdateWorkGroup (updated) Link ¶
Changes (request)
{'ConfigurationUpdates': {'QueryResultsS3AccessGrantsConfiguration': {'AuthenticationType': 'DIRECTORY_IDENTITY',
                                                                      'CreateUserLevelPrefix': 'boolean',
                                                                      'EnableS3AccessGrants': 'boolean'}}}

Updates the workgroup with the specified name. The workgroup's name cannot be changed. Only ConfigurationUpdates can be specified.

See also: AWS API Documentation

Request Syntax

client.update_work_group(
    WorkGroup='string',
    Description='string',
    ConfigurationUpdates={
        'EnforceWorkGroupConfiguration': True|False,
        'ResultConfigurationUpdates': {
            'OutputLocation': 'string',
            'RemoveOutputLocation': True|False,
            'EncryptionConfiguration': {
                'EncryptionOption': 'SSE_S3'|'SSE_KMS'|'CSE_KMS',
                'KmsKey': 'string'
            },
            'RemoveEncryptionConfiguration': True|False,
            'ExpectedBucketOwner': 'string',
            'RemoveExpectedBucketOwner': True|False,
            'AclConfiguration': {
                'S3AclOption': 'BUCKET_OWNER_FULL_CONTROL'
            },
            'RemoveAclConfiguration': True|False
        },
        'PublishCloudWatchMetricsEnabled': True|False,
        'BytesScannedCutoffPerQuery': 123,
        'RemoveBytesScannedCutoffPerQuery': True|False,
        'RequesterPaysEnabled': True|False,
        'EngineVersion': {
            'SelectedEngineVersion': 'string',
            'EffectiveEngineVersion': 'string'
        },
        'RemoveCustomerContentEncryptionConfiguration': True|False,
        'AdditionalConfiguration': 'string',
        'ExecutionRole': 'string',
        'CustomerContentEncryptionConfiguration': {
            'KmsKey': 'string'
        },
        'EnableMinimumEncryptionConfiguration': True|False,
        'QueryResultsS3AccessGrantsConfiguration': {
            'EnableS3AccessGrants': True|False,
            'CreateUserLevelPrefix': True|False,
            'AuthenticationType': 'DIRECTORY_IDENTITY'
        }
    },
    State='ENABLED'|'DISABLED'
)
type WorkGroup

string

param WorkGroup

[REQUIRED]

The specified workgroup that will be updated.

type Description

string

param Description

The workgroup description.

type ConfigurationUpdates

dict

param ConfigurationUpdates

Contains configuration updates for an Athena SQL workgroup.

  • EnforceWorkGroupConfiguration (boolean) --

    If set to "true", the settings for the workgroup override client-side settings. If set to "false" client-side settings are used. For more information, see Workgroup Settings Override Client-Side Settings.

  • ResultConfigurationUpdates (dict) --

    The result configuration information about the queries in this workgroup that will be updated. Includes the updated results location and an updated option for encrypting query results.

    • OutputLocation (string) --

      The location in Amazon S3 where your query and calculation results are stored, such as s3://path/to/query/bucket/ . For more information, see Working with query results, recent queries, and output files. If workgroup settings override client-side settings, then the query uses the location for the query results and the encryption configuration that are specified for the workgroup. The "workgroup settings override" is specified in EnforceWorkGroupConfiguration (true/false) in the WorkGroupConfiguration . See WorkGroupConfiguration$EnforceWorkGroupConfiguration.

    • RemoveOutputLocation (boolean) --

      If set to "true", indicates that the previously-specified query results location (also known as a client-side setting) for queries in this workgroup should be ignored and set to null. If set to "false" or not set, and a value is present in the OutputLocation in ResultConfigurationUpdates (the client-side setting), the OutputLocation in the workgroup's ResultConfiguration will be updated with the new value. For more information, see Workgroup Settings Override Client-Side Settings.

    • EncryptionConfiguration (dict) --

      The encryption configuration for query and calculation results.

      • EncryptionOption (string) -- [REQUIRED]

        Indicates whether Amazon S3 server-side encryption with Amazon S3-managed keys ( SSE_S3 ), server-side encryption with KMS-managed keys ( SSE_KMS ), or client-side encryption with KMS-managed keys ( CSE_KMS ) is used.

        If a query runs in a workgroup and the workgroup overrides client-side settings, then the workgroup's setting for encryption is used. It specifies whether query results must be encrypted, for all queries that run in this workgroup.

      • KmsKey (string) --

        For SSE_KMS and CSE_KMS , this is the KMS key ARN or ID.

    • RemoveEncryptionConfiguration (boolean) --

      If set to "true", indicates that the previously-specified encryption configuration (also known as the client-side setting) for queries in this workgroup should be ignored and set to null. If set to "false" or not set, and a value is present in the EncryptionConfiguration in ResultConfigurationUpdates (the client-side setting), the EncryptionConfiguration in the workgroup's ResultConfiguration will be updated with the new value. For more information, see Workgroup Settings Override Client-Side Settings.

    • ExpectedBucketOwner (string) --

      The Amazon Web Services account ID that you expect to be the owner of the Amazon S3 bucket specified by ResultConfiguration$OutputLocation. If set, Athena uses the value for ExpectedBucketOwner when it makes Amazon S3 calls to your specified output location. If the ExpectedBucketOwner Amazon Web Services account ID does not match the actual owner of the Amazon S3 bucket, the call fails with a permissions error.

      If workgroup settings override client-side settings, then the query uses the ExpectedBucketOwner setting that is specified for the workgroup, and also uses the location for storing query results specified in the workgroup. See WorkGroupConfiguration$EnforceWorkGroupConfiguration and Workgroup Settings Override Client-Side Settings.

    • RemoveExpectedBucketOwner (boolean) --

      If set to "true", removes the Amazon Web Services account ID previously specified for ResultConfiguration$ExpectedBucketOwner. If set to "false" or not set, and a value is present in the ExpectedBucketOwner in ResultConfigurationUpdates (the client-side setting), the ExpectedBucketOwner in the workgroup's ResultConfiguration is updated with the new value. For more information, see Workgroup Settings Override Client-Side Settings.

    • AclConfiguration (dict) --

      The ACL configuration for the query results.

      • S3AclOption (string) -- [REQUIRED]

        The Amazon S3 canned ACL that Athena should specify when storing query results. Currently the only supported canned ACL is BUCKET_OWNER_FULL_CONTROL . If a query runs in a workgroup and the workgroup overrides client-side settings, then the Amazon S3 canned ACL specified in the workgroup's settings is used for all queries that run in the workgroup. For more information about Amazon S3 canned ACLs, see Canned ACL in the Amazon S3 User Guide .

    • RemoveAclConfiguration (boolean) --

      If set to true , indicates that the previously-specified ACL configuration for queries in this workgroup should be ignored and set to null. If set to false or not set, and a value is present in the AclConfiguration of ResultConfigurationUpdates , the AclConfiguration in the workgroup's ResultConfiguration is updated with the new value. For more information, see Workgroup Settings Override Client-Side Settings.

  • PublishCloudWatchMetricsEnabled (boolean) --

    Indicates whether this workgroup enables publishing metrics to Amazon CloudWatch.

  • BytesScannedCutoffPerQuery (integer) --

    The upper limit (cutoff) for the amount of bytes a single query in a workgroup is allowed to scan.

  • RemoveBytesScannedCutoffPerQuery (boolean) --

    Indicates that the data usage control limit per query is removed. WorkGroupConfiguration$BytesScannedCutoffPerQuery

  • RequesterPaysEnabled (boolean) --

    If set to true , allows members assigned to a workgroup to specify Amazon S3 Requester Pays buckets in queries. If set to false , workgroup members cannot query data from Requester Pays buckets, and queries that retrieve data from Requester Pays buckets cause an error. The default is false . For more information about Requester Pays buckets, see Requester Pays Buckets in the Amazon Simple Storage Service Developer Guide .

  • EngineVersion (dict) --

    The engine version requested when a workgroup is updated. After the update, all queries on the workgroup run on the requested engine version. If no value was previously set, the default is Auto. Queries on the AmazonAthenaPreviewFunctionality workgroup run on the preview engine regardless of this setting.

    • SelectedEngineVersion (string) --

      The engine version requested by the user. Possible values are determined by the output of ListEngineVersions , including AUTO. The default is AUTO.

    • EffectiveEngineVersion (string) --

      Read only. The engine version on which the query runs. If the user requests a valid engine version other than Auto, the effective engine version is the same as the engine version that the user requested. If the user requests Auto, the effective engine version is chosen by Athena. When a request to update the engine version is made by a CreateWorkGroup or UpdateWorkGroup operation, the EffectiveEngineVersion field is ignored.

  • RemoveCustomerContentEncryptionConfiguration (boolean) --

    Removes content encryption configuration from an Apache Spark-enabled Athena workgroup.

  • AdditionalConfiguration (string) --

    Contains a user defined string in JSON format for a Spark-enabled workgroup.

  • ExecutionRole (string) --

    The ARN of the execution role used to access user resources for Spark sessions and Identity Center enabled workgroups. This property applies only to Spark enabled workgroups and Identity Center enabled workgroups.

  • CustomerContentEncryptionConfiguration (dict) --

    Specifies the customer managed KMS key that is used to encrypt the user's data stores in Athena. When an Amazon Web Services managed key is used, this value is null. This setting does not apply to Athena SQL workgroups.

    • KmsKey (string) -- [REQUIRED]

      The customer managed KMS key that is used to encrypt the user's data stores in Athena.

  • EnableMinimumEncryptionConfiguration (boolean) --

    Enforces a minimal level of encryption for the workgroup for query and calculation results that are written to Amazon S3. When enabled, workgroup users can set encryption only to the minimum level set by the administrator or higher when they submit queries. This setting does not apply to Spark-enabled workgroups.

    The EnforceWorkGroupConfiguration setting takes precedence over the EnableMinimumEncryptionConfiguration flag. This means that if EnforceWorkGroupConfiguration is true, the EnableMinimumEncryptionConfiguration flag is ignored, and the workgroup configuration for encryption is used.

  • QueryResultsS3AccessGrantsConfiguration (dict) --

    Specifies whether Amazon S3 access grants are enabled for query results.

    • EnableS3AccessGrants (boolean) -- [REQUIRED]

      Specifies whether Amazon S3 access grants are enabled for query results.

    • CreateUserLevelPrefix (boolean) --

      When enabled, appends the user ID as an Amazon S3 path prefix to the query result output location.

    • AuthenticationType (string) -- [REQUIRED]

      The authentication type used for Amazon S3 access grants. Currently, only DIRECTORY_IDENTITY is supported.

type State

string

param State

The workgroup state that will be updated for the given workgroup.

rtype

dict

returns

Response Syntax

{}

Response Structure

  • (dict) --