AWS Glue

2024/09/23 - AWS Glue - 4 updated api methods

Changes  Added AthenaProperties parameter to Glue Connections, allowing Athena to store service specific properties on Glue Connections.

CreateConnection (updated) Link ¶
Changes (request)
{'ConnectionInput': {'AthenaProperties': {'string': 'string'}}}

Creates a connection definition in the Data Catalog.

Connections used for creating federated resources require the IAM glue:PassConnection permission.

See also: AWS API Documentation

Request Syntax

client.create_connection(
    CatalogId='string',
    ConnectionInput={
        'Name': 'string',
        'Description': 'string',
        'ConnectionType': 'JDBC'|'SFTP'|'MONGODB'|'KAFKA'|'NETWORK'|'MARKETPLACE'|'CUSTOM'|'SALESFORCE'|'VIEW_VALIDATION_REDSHIFT'|'VIEW_VALIDATION_ATHENA',
        'MatchCriteria': [
            'string',
        ],
        'ConnectionProperties': {
            'string': 'string'
        },
        'AthenaProperties': {
            'string': 'string'
        },
        'PhysicalConnectionRequirements': {
            'SubnetId': 'string',
            'SecurityGroupIdList': [
                'string',
            ],
            'AvailabilityZone': 'string'
        },
        'AuthenticationConfiguration': {
            'AuthenticationType': 'BASIC'|'OAUTH2'|'CUSTOM',
            'OAuth2Properties': {
                'OAuth2GrantType': 'AUTHORIZATION_CODE'|'CLIENT_CREDENTIALS'|'JWT_BEARER',
                'OAuth2ClientApplication': {
                    'UserManagedClientApplicationClientId': 'string',
                    'AWSManagedClientApplicationReference': 'string'
                },
                'TokenUrl': 'string',
                'TokenUrlParametersMap': {
                    'string': 'string'
                },
                'AuthorizationCodeProperties': {
                    'AuthorizationCode': 'string',
                    'RedirectUri': 'string'
                }
            },
            'SecretArn': 'string'
        },
        'ValidateCredentials': True|False
    },
    Tags={
        'string': 'string'
    }
)
type CatalogId

string

param CatalogId

The ID of the Data Catalog in which to create the connection. If none is provided, the Amazon Web Services account ID is used by default.

type ConnectionInput

dict

param ConnectionInput

[REQUIRED]

A ConnectionInput object defining the connection to create.

  • Name (string) -- [REQUIRED]

    The name of the connection.

  • Description (string) --

    The description of the connection.

  • ConnectionType (string) -- [REQUIRED]

    The type of the connection. Currently, these types are supported:

    • JDBC - Designates a connection to a database through Java Database Connectivity (JDBC). JDBC Connections use the following ConnectionParameters.

      • Required: All of ( HOST , PORT , JDBC_ENGINE ) or JDBC_CONNECTION_URL .

      • Required: All of ( USERNAME , PASSWORD ) or SECRET_ID .

      • Optional: JDBC_ENFORCE_SSL , CUSTOM_JDBC_CERT , CUSTOM_JDBC_CERT_STRING , SKIP_CUSTOM_JDBC_CERT_VALIDATION . These parameters are used to configure SSL with JDBC.

    • KAFKA - Designates a connection to an Apache Kafka streaming platform. KAFKA Connections use the following ConnectionParameters.

      • Required: KAFKA_BOOTSTRAP_SERVERS .

      • Optional: KAFKA_SSL_ENABLED , KAFKA_CUSTOM_CERT , KAFKA_SKIP_CUSTOM_CERT_VALIDATION . These parameters are used to configure SSL with KAFKA .

      • Optional: KAFKA_CLIENT_KEYSTORE , KAFKA_CLIENT_KEYSTORE_PASSWORD , KAFKA_CLIENT_KEY_PASSWORD , ENCRYPTED_KAFKA_CLIENT_KEYSTORE_PASSWORD , ENCRYPTED_KAFKA_CLIENT_KEY_PASSWORD . These parameters are used to configure TLS client configuration with SSL in KAFKA .

      • Optional: KAFKA_SASL_MECHANISM . Can be specified as SCRAM-SHA-512 , GSSAPI , or AWS_MSK_IAM .

      • Optional: KAFKA_SASL_SCRAM_USERNAME , KAFKA_SASL_SCRAM_PASSWORD , ENCRYPTED_KAFKA_SASL_SCRAM_PASSWORD . These parameters are used to configure SASL/SCRAM-SHA-512 authentication with KAFKA .

      • Optional: KAFKA_SASL_GSSAPI_KEYTAB , KAFKA_SASL_GSSAPI_KRB5_CONF , KAFKA_SASL_GSSAPI_SERVICE , KAFKA_SASL_GSSAPI_PRINCIPAL . These parameters are used to configure SASL/GSSAPI authentication with KAFKA .

    • MONGODB - Designates a connection to a MongoDB document database. MONGODB Connections use the following ConnectionParameters.

      • Required: CONNECTION_URL .

      • Required: All of ( USERNAME , PASSWORD ) or SECRET_ID .

    • SALESFORCE - Designates a connection to Salesforce using OAuth authencation.

      • Requires the AuthenticationConfiguration member to be configured.

    • VIEW_VALIDATION_REDSHIFT - Designates a connection used for view validation by Amazon Redshift.

    • VIEW_VALIDATION_ATHENA - Designates a connection used for view validation by Amazon Athena.

    • NETWORK - Designates a network connection to a data source within an Amazon Virtual Private Cloud environment (Amazon VPC). NETWORK Connections do not require ConnectionParameters. Instead, provide a PhysicalConnectionRequirements.

    • MARKETPLACE - Uses configuration settings contained in a connector purchased from Amazon Web Services Marketplace to read from and write to data stores that are not natively supported by Glue. MARKETPLACE Connections use the following ConnectionParameters.

      • Required: CONNECTOR_TYPE , CONNECTOR_URL , CONNECTOR_CLASS_NAME , CONNECTION_URL .

      • Required for JDBC CONNECTOR_TYPE connections: All of ( USERNAME , PASSWORD ) or SECRET_ID .

    • CUSTOM - Uses configuration settings contained in a custom connector to read from and write to data stores that are not natively supported by Glue.

    SFTP is not supported.

    For more information about how optional ConnectionProperties are used to configure features in Glue, consult Glue connection properties.

    For more information about how optional ConnectionProperties are used to configure features in Glue Studio, consult Using connectors and connections.

  • MatchCriteria (list) --

    A list of criteria that can be used in selecting this connection.

    • (string) --

  • ConnectionProperties (dict) -- [REQUIRED]

    These key-value pairs define parameters for the connection.

    • (string) --

      • (string) --

  • AthenaProperties (dict) --

    This field is not currently used.

    • (string) --

      • (string) --

  • PhysicalConnectionRequirements (dict) --

    The physical connection requirements, such as virtual private cloud (VPC) and SecurityGroup , that are needed to successfully make this connection.

    • SubnetId (string) --

      The subnet ID used by the connection.

    • SecurityGroupIdList (list) --

      The security group ID list used by the connection.

      • (string) --

    • AvailabilityZone (string) --

      The connection's Availability Zone.

  • AuthenticationConfiguration (dict) --

    The authentication properties of the connection. Used for a Salesforce connection.

    • AuthenticationType (string) --

      A structure containing the authentication configuration in the CreateConnection request.

    • OAuth2Properties (dict) --

      The properties for OAuth2 authentication in the CreateConnection request.

      • OAuth2GrantType (string) --

        The OAuth2 grant type in the CreateConnection request. For example, AUTHORIZATION_CODE , JWT_BEARER , or CLIENT_CREDENTIALS .

      • OAuth2ClientApplication (dict) --

        The client application type in the CreateConnection request. For example, AWS_MANAGED or USER_MANAGED .

        • UserManagedClientApplicationClientId (string) --

          The client application clientID if the ClientAppType is USER_MANAGED .

        • AWSManagedClientApplicationReference (string) --

          The reference to the SaaS-side client app that is Amazon Web Services managed.

      • TokenUrl (string) --

        The URL of the provider's authentication server, to exchange an authorization code for an access token.

      • TokenUrlParametersMap (dict) --

        A map of parameters that are added to the token GET request.

        • (string) --

          • (string) --

      • AuthorizationCodeProperties (dict) --

        The set of properties required for the the OAuth2 AUTHORIZATION_CODE grant type.

        • AuthorizationCode (string) --

          An authorization code to be used in the third leg of the AUTHORIZATION_CODE grant workflow. This is a single-use code which becomes invalid once exchanged for an access token, thus it is acceptable to have this value as a request parameter.

        • RedirectUri (string) --

          The redirect URI where the user gets redirected to by authorization server when issuing an authorization code. The URI is subsequently used when the authorization code is exchanged for an access token.

    • SecretArn (string) --

      The secret manager ARN to store credentials in the CreateConnection request.

  • ValidateCredentials (boolean) --

    A flag to validate the credentials during create connection. Used for a Salesforce connection. Default is true.

type Tags

dict

param Tags

The tags you assign to the connection.

  • (string) --

    • (string) --

rtype

dict

returns

Response Syntax

{
    'CreateConnectionStatus': 'READY'|'IN_PROGRESS'|'FAILED'
}

Response Structure

  • (dict) --

    • CreateConnectionStatus (string) --

      The status of the connection creation request. The request can take some time for certain authentication types, for example when creating an OAuth connection with token exchange over VPC.

GetConnection (updated) Link ¶
Changes (response)
{'Connection': {'AthenaProperties': {'string': 'string'}}}

Retrieves a connection definition from the Data Catalog.

See also: AWS API Documentation

Request Syntax

client.get_connection(
    CatalogId='string',
    Name='string',
    HidePassword=True|False
)
type CatalogId

string

param CatalogId

The ID of the Data Catalog in which the connection resides. If none is provided, the Amazon Web Services account ID is used by default.

type Name

string

param Name

[REQUIRED]

The name of the connection definition to retrieve.

type HidePassword

boolean

param HidePassword

Allows you to retrieve the connection metadata without returning the password. For instance, the Glue console uses this flag to retrieve the connection, and does not display the password. Set this parameter when the caller might not have permission to use the KMS key to decrypt the password, but it does have permission to access the rest of the connection properties.

rtype

dict

returns

Response Syntax

{
    'Connection': {
        'Name': 'string',
        'Description': 'string',
        'ConnectionType': 'JDBC'|'SFTP'|'MONGODB'|'KAFKA'|'NETWORK'|'MARKETPLACE'|'CUSTOM'|'SALESFORCE'|'VIEW_VALIDATION_REDSHIFT'|'VIEW_VALIDATION_ATHENA',
        'MatchCriteria': [
            'string',
        ],
        'ConnectionProperties': {
            'string': 'string'
        },
        'AthenaProperties': {
            'string': 'string'
        },
        'PhysicalConnectionRequirements': {
            'SubnetId': 'string',
            'SecurityGroupIdList': [
                'string',
            ],
            'AvailabilityZone': 'string'
        },
        'CreationTime': datetime(2015, 1, 1),
        'LastUpdatedTime': datetime(2015, 1, 1),
        'LastUpdatedBy': 'string',
        'Status': 'READY'|'IN_PROGRESS'|'FAILED',
        'StatusReason': 'string',
        'LastConnectionValidationTime': datetime(2015, 1, 1),
        'AuthenticationConfiguration': {
            'AuthenticationType': 'BASIC'|'OAUTH2'|'CUSTOM',
            'SecretArn': 'string',
            'OAuth2Properties': {
                'OAuth2GrantType': 'AUTHORIZATION_CODE'|'CLIENT_CREDENTIALS'|'JWT_BEARER',
                'OAuth2ClientApplication': {
                    'UserManagedClientApplicationClientId': 'string',
                    'AWSManagedClientApplicationReference': 'string'
                },
                'TokenUrl': 'string',
                'TokenUrlParametersMap': {
                    'string': 'string'
                }
            }
        }
    }
}

Response Structure

  • (dict) --

    • Connection (dict) --

      The requested connection definition.

      • Name (string) --

        The name of the connection definition.

      • Description (string) --

        The description of the connection.

      • ConnectionType (string) --

        The type of the connection. Currently, SFTP is not supported.

      • MatchCriteria (list) --

        A list of criteria that can be used in selecting this connection.

        • (string) --

      • ConnectionProperties (dict) --

        These key-value pairs define parameters for the connection:

        • HOST - The host URI: either the fully qualified domain name (FQDN) or the IPv4 address of the database host.

        • PORT - The port number, between 1024 and 65535, of the port on which the database host is listening for database connections.

        • USER_NAME - The name under which to log in to the database. The value string for USER_NAME is " USERNAME ".

        • PASSWORD - A password, if one is used, for the user name.

        • ENCRYPTED_PASSWORD - When you enable connection password protection by setting ConnectionPasswordEncryption in the Data Catalog encryption settings, this field stores the encrypted password.

        • JDBC_DRIVER_JAR_URI - The Amazon Simple Storage Service (Amazon S3) path of the JAR file that contains the JDBC driver to use.

        • JDBC_DRIVER_CLASS_NAME - The class name of the JDBC driver to use.

        • JDBC_ENGINE - The name of the JDBC engine to use.

        • JDBC_ENGINE_VERSION - The version of the JDBC engine to use.

        • CONFIG_FILES - (Reserved for future use.)

        • INSTANCE_ID - The instance ID to use.

        • JDBC_CONNECTION_URL - The URL for connecting to a JDBC data source.

        • JDBC_ENFORCE_SSL - A Boolean string (true, false) specifying whether Secure Sockets Layer (SSL) with hostname matching is enforced for the JDBC connection on the client. The default is false.

        • CUSTOM_JDBC_CERT - An Amazon S3 location specifying the customer's root certificate. Glue uses this root certificate to validate the customer’s certificate when connecting to the customer database. Glue only handles X.509 certificates. The certificate provided must be DER-encoded and supplied in Base64 encoding PEM format.

        • SKIP_CUSTOM_JDBC_CERT_VALIDATION - By default, this is false . Glue validates the Signature algorithm and Subject Public Key Algorithm for the customer certificate. The only permitted algorithms for the Signature algorithm are SHA256withRSA, SHA384withRSA or SHA512withRSA. For the Subject Public Key Algorithm, the key length must be at least 2048. You can set the value of this property to true to skip Glue’s validation of the customer certificate.

        • CUSTOM_JDBC_CERT_STRING - A custom JDBC certificate string which is used for domain match or distinguished name match to prevent a man-in-the-middle attack. In Oracle database, this is used as the SSL_SERVER_CERT_DN ; in Microsoft SQL Server, this is used as the hostNameInCertificate .

        • CONNECTION_URL - The URL for connecting to a general (non-JDBC) data source.

        • SECRET_ID - The secret ID used for the secret manager of credentials.

        • CONNECTOR_URL - The connector URL for a MARKETPLACE or CUSTOM connection.

        • CONNECTOR_TYPE - The connector type for a MARKETPLACE or CUSTOM connection.

        • CONNECTOR_CLASS_NAME - The connector class name for a MARKETPLACE or CUSTOM connection.

        • KAFKA_BOOTSTRAP_SERVERS - A comma-separated list of host and port pairs that are the addresses of the Apache Kafka brokers in a Kafka cluster to which a Kafka client will connect to and bootstrap itself.

        • KAFKA_SSL_ENABLED - Whether to enable or disable SSL on an Apache Kafka connection. Default value is "true".

        • KAFKA_CUSTOM_CERT - The Amazon S3 URL for the private CA cert file (.pem format). The default is an empty string.

        • KAFKA_SKIP_CUSTOM_CERT_VALIDATION - Whether to skip the validation of the CA cert file or not. Glue validates for three algorithms: SHA256withRSA, SHA384withRSA and SHA512withRSA. Default value is "false".

        • KAFKA_CLIENT_KEYSTORE - The Amazon S3 location of the client keystore file for Kafka client side authentication (Optional).

        • KAFKA_CLIENT_KEYSTORE_PASSWORD - The password to access the provided keystore (Optional).

        • KAFKA_CLIENT_KEY_PASSWORD - A keystore can consist of multiple keys, so this is the password to access the client key to be used with the Kafka server side key (Optional).

        • ENCRYPTED_KAFKA_CLIENT_KEYSTORE_PASSWORD - The encrypted version of the Kafka client keystore password (if the user has the Glue encrypt passwords setting selected).

        • ENCRYPTED_KAFKA_CLIENT_KEY_PASSWORD - The encrypted version of the Kafka client key password (if the user has the Glue encrypt passwords setting selected).

        • KAFKA_SASL_MECHANISM - "SCRAM-SHA-512" , "GSSAPI" , "AWS_MSK_IAM" , or "PLAIN" . These are the supported SASL Mechanisms.

        • KAFKA_SASL_PLAIN_USERNAME - A plaintext username used to authenticate with the "PLAIN" mechanism.

        • KAFKA_SASL_PLAIN_PASSWORD - A plaintext password used to authenticate with the "PLAIN" mechanism.

        • ENCRYPTED_KAFKA_SASL_PLAIN_PASSWORD - The encrypted version of the Kafka SASL PLAIN password (if the user has the Glue encrypt passwords setting selected).

        • KAFKA_SASL_SCRAM_USERNAME - A plaintext username used to authenticate with the "SCRAM-SHA-512" mechanism.

        • KAFKA_SASL_SCRAM_PASSWORD - A plaintext password used to authenticate with the "SCRAM-SHA-512" mechanism.

        • ENCRYPTED_KAFKA_SASL_SCRAM_PASSWORD - The encrypted version of the Kafka SASL SCRAM password (if the user has the Glue encrypt passwords setting selected).

        • KAFKA_SASL_SCRAM_SECRETS_ARN - The Amazon Resource Name of a secret in Amazon Web Services Secrets Manager.

        • KAFKA_SASL_GSSAPI_KEYTAB - The S3 location of a Kerberos keytab file. A keytab stores long-term keys for one or more principals. For more information, see MIT Kerberos Documentation: Keytab.

        • KAFKA_SASL_GSSAPI_KRB5_CONF - The S3 location of a Kerberos krb5.conf file. A krb5.conf stores Kerberos configuration information, such as the location of the KDC server. For more information, see MIT Kerberos Documentation: krb5.conf.

        • KAFKA_SASL_GSSAPI_SERVICE - The Kerberos service name, as set with sasl.kerberos.service.name in your Kafka Configuration.

        • KAFKA_SASL_GSSAPI_PRINCIPAL - The name of the Kerberos princial used by Glue. For more information, see Kafka Documentation: Configuring Kafka Brokers.

        • ROLE_ARN - The role to be used for running queries.

        • REGION - The Amazon Web Services Region where queries will be run.

        • WORKGROUP_NAME - The name of an Amazon Redshift serverless workgroup or Amazon Athena workgroup in which queries will run.

        • CLUSTER_IDENTIFIER - The cluster identifier of an Amazon Redshift cluster in which queries will run.

        • DATABASE - The Amazon Redshift database that you are connecting to.

        • (string) --

          • (string) --

      • AthenaProperties (dict) --

        This field is not currently used.

        • (string) --

          • (string) --

      • PhysicalConnectionRequirements (dict) --

        The physical connection requirements, such as virtual private cloud (VPC) and SecurityGroup , that are needed to make this connection successfully.

        • SubnetId (string) --

          The subnet ID used by the connection.

        • SecurityGroupIdList (list) --

          The security group ID list used by the connection.

          • (string) --

        • AvailabilityZone (string) --

          The connection's Availability Zone.

      • CreationTime (datetime) --

        The timestamp of the time that this connection definition was created.

      • LastUpdatedTime (datetime) --

        The timestamp of the last time the connection definition was updated.

      • LastUpdatedBy (string) --

        The user, group, or role that last updated this connection definition.

      • Status (string) --

        The status of the connection. Can be one of: READY , IN_PROGRESS , or FAILED .

      • StatusReason (string) --

        The reason for the connection status.

      • LastConnectionValidationTime (datetime) --

        A timestamp of the time this connection was last validated.

      • AuthenticationConfiguration (dict) --

        The authentication properties of the connection.

        • AuthenticationType (string) --

          A structure containing the authentication configuration.

        • SecretArn (string) --

          The secret manager ARN to store credentials.

        • OAuth2Properties (dict) --

          The properties for OAuth2 authentication.

          • OAuth2GrantType (string) --

            The OAuth2 grant type. For example, AUTHORIZATION_CODE , JWT_BEARER , or CLIENT_CREDENTIALS .

          • OAuth2ClientApplication (dict) --

            The client application type. For example, AWS_MANAGED or USER_MANAGED.

            • UserManagedClientApplicationClientId (string) --

              The client application clientID if the ClientAppType is USER_MANAGED .

            • AWSManagedClientApplicationReference (string) --

              The reference to the SaaS-side client app that is Amazon Web Services managed.

          • TokenUrl (string) --

            The URL of the provider's authentication server, to exchange an authorization code for an access token.

          • TokenUrlParametersMap (dict) --

            A map of parameters that are added to the token GET request.

            • (string) --

              • (string) --

GetConnections (updated) Link ¶
Changes (response)
{'ConnectionList': {'AthenaProperties': {'string': 'string'}}}

Retrieves a list of connection definitions from the Data Catalog.

See also: AWS API Documentation

Request Syntax

client.get_connections(
    CatalogId='string',
    Filter={
        'MatchCriteria': [
            'string',
        ],
        'ConnectionType': 'JDBC'|'SFTP'|'MONGODB'|'KAFKA'|'NETWORK'|'MARKETPLACE'|'CUSTOM'|'SALESFORCE'|'VIEW_VALIDATION_REDSHIFT'|'VIEW_VALIDATION_ATHENA'
    },
    HidePassword=True|False,
    NextToken='string',
    MaxResults=123
)
type CatalogId

string

param CatalogId

The ID of the Data Catalog in which the connections reside. If none is provided, the Amazon Web Services account ID is used by default.

type Filter

dict

param Filter

A filter that controls which connections are returned.

  • MatchCriteria (list) --

    A criteria string that must match the criteria recorded in the connection definition for that connection definition to be returned.

    • (string) --

  • ConnectionType (string) --

    The type of connections to return. Currently, SFTP is not supported.

type HidePassword

boolean

param HidePassword

Allows you to retrieve the connection metadata without returning the password. For instance, the Glue console uses this flag to retrieve the connection, and does not display the password. Set this parameter when the caller might not have permission to use the KMS key to decrypt the password, but it does have permission to access the rest of the connection properties.

type NextToken

string

param NextToken

A continuation token, if this is a continuation call.

type MaxResults

integer

param MaxResults

The maximum number of connections to return in one response.

rtype

dict

returns

Response Syntax

{
    'ConnectionList': [
        {
            'Name': 'string',
            'Description': 'string',
            'ConnectionType': 'JDBC'|'SFTP'|'MONGODB'|'KAFKA'|'NETWORK'|'MARKETPLACE'|'CUSTOM'|'SALESFORCE'|'VIEW_VALIDATION_REDSHIFT'|'VIEW_VALIDATION_ATHENA',
            'MatchCriteria': [
                'string',
            ],
            'ConnectionProperties': {
                'string': 'string'
            },
            'AthenaProperties': {
                'string': 'string'
            },
            'PhysicalConnectionRequirements': {
                'SubnetId': 'string',
                'SecurityGroupIdList': [
                    'string',
                ],
                'AvailabilityZone': 'string'
            },
            'CreationTime': datetime(2015, 1, 1),
            'LastUpdatedTime': datetime(2015, 1, 1),
            'LastUpdatedBy': 'string',
            'Status': 'READY'|'IN_PROGRESS'|'FAILED',
            'StatusReason': 'string',
            'LastConnectionValidationTime': datetime(2015, 1, 1),
            'AuthenticationConfiguration': {
                'AuthenticationType': 'BASIC'|'OAUTH2'|'CUSTOM',
                'SecretArn': 'string',
                'OAuth2Properties': {
                    'OAuth2GrantType': 'AUTHORIZATION_CODE'|'CLIENT_CREDENTIALS'|'JWT_BEARER',
                    'OAuth2ClientApplication': {
                        'UserManagedClientApplicationClientId': 'string',
                        'AWSManagedClientApplicationReference': 'string'
                    },
                    'TokenUrl': 'string',
                    'TokenUrlParametersMap': {
                        'string': 'string'
                    }
                }
            }
        },
    ],
    'NextToken': 'string'
}

Response Structure

  • (dict) --

    • ConnectionList (list) --

      A list of requested connection definitions.

      • (dict) --

        Defines a connection to a data source.

        • Name (string) --

          The name of the connection definition.

        • Description (string) --

          The description of the connection.

        • ConnectionType (string) --

          The type of the connection. Currently, SFTP is not supported.

        • MatchCriteria (list) --

          A list of criteria that can be used in selecting this connection.

          • (string) --

        • ConnectionProperties (dict) --

          These key-value pairs define parameters for the connection:

          • HOST - The host URI: either the fully qualified domain name (FQDN) or the IPv4 address of the database host.

          • PORT - The port number, between 1024 and 65535, of the port on which the database host is listening for database connections.

          • USER_NAME - The name under which to log in to the database. The value string for USER_NAME is " USERNAME ".

          • PASSWORD - A password, if one is used, for the user name.

          • ENCRYPTED_PASSWORD - When you enable connection password protection by setting ConnectionPasswordEncryption in the Data Catalog encryption settings, this field stores the encrypted password.

          • JDBC_DRIVER_JAR_URI - The Amazon Simple Storage Service (Amazon S3) path of the JAR file that contains the JDBC driver to use.

          • JDBC_DRIVER_CLASS_NAME - The class name of the JDBC driver to use.

          • JDBC_ENGINE - The name of the JDBC engine to use.

          • JDBC_ENGINE_VERSION - The version of the JDBC engine to use.

          • CONFIG_FILES - (Reserved for future use.)

          • INSTANCE_ID - The instance ID to use.

          • JDBC_CONNECTION_URL - The URL for connecting to a JDBC data source.

          • JDBC_ENFORCE_SSL - A Boolean string (true, false) specifying whether Secure Sockets Layer (SSL) with hostname matching is enforced for the JDBC connection on the client. The default is false.

          • CUSTOM_JDBC_CERT - An Amazon S3 location specifying the customer's root certificate. Glue uses this root certificate to validate the customer’s certificate when connecting to the customer database. Glue only handles X.509 certificates. The certificate provided must be DER-encoded and supplied in Base64 encoding PEM format.

          • SKIP_CUSTOM_JDBC_CERT_VALIDATION - By default, this is false . Glue validates the Signature algorithm and Subject Public Key Algorithm for the customer certificate. The only permitted algorithms for the Signature algorithm are SHA256withRSA, SHA384withRSA or SHA512withRSA. For the Subject Public Key Algorithm, the key length must be at least 2048. You can set the value of this property to true to skip Glue’s validation of the customer certificate.

          • CUSTOM_JDBC_CERT_STRING - A custom JDBC certificate string which is used for domain match or distinguished name match to prevent a man-in-the-middle attack. In Oracle database, this is used as the SSL_SERVER_CERT_DN ; in Microsoft SQL Server, this is used as the hostNameInCertificate .

          • CONNECTION_URL - The URL for connecting to a general (non-JDBC) data source.

          • SECRET_ID - The secret ID used for the secret manager of credentials.

          • CONNECTOR_URL - The connector URL for a MARKETPLACE or CUSTOM connection.

          • CONNECTOR_TYPE - The connector type for a MARKETPLACE or CUSTOM connection.

          • CONNECTOR_CLASS_NAME - The connector class name for a MARKETPLACE or CUSTOM connection.

          • KAFKA_BOOTSTRAP_SERVERS - A comma-separated list of host and port pairs that are the addresses of the Apache Kafka brokers in a Kafka cluster to which a Kafka client will connect to and bootstrap itself.

          • KAFKA_SSL_ENABLED - Whether to enable or disable SSL on an Apache Kafka connection. Default value is "true".

          • KAFKA_CUSTOM_CERT - The Amazon S3 URL for the private CA cert file (.pem format). The default is an empty string.

          • KAFKA_SKIP_CUSTOM_CERT_VALIDATION - Whether to skip the validation of the CA cert file or not. Glue validates for three algorithms: SHA256withRSA, SHA384withRSA and SHA512withRSA. Default value is "false".

          • KAFKA_CLIENT_KEYSTORE - The Amazon S3 location of the client keystore file for Kafka client side authentication (Optional).

          • KAFKA_CLIENT_KEYSTORE_PASSWORD - The password to access the provided keystore (Optional).

          • KAFKA_CLIENT_KEY_PASSWORD - A keystore can consist of multiple keys, so this is the password to access the client key to be used with the Kafka server side key (Optional).

          • ENCRYPTED_KAFKA_CLIENT_KEYSTORE_PASSWORD - The encrypted version of the Kafka client keystore password (if the user has the Glue encrypt passwords setting selected).

          • ENCRYPTED_KAFKA_CLIENT_KEY_PASSWORD - The encrypted version of the Kafka client key password (if the user has the Glue encrypt passwords setting selected).

          • KAFKA_SASL_MECHANISM - "SCRAM-SHA-512" , "GSSAPI" , "AWS_MSK_IAM" , or "PLAIN" . These are the supported SASL Mechanisms.

          • KAFKA_SASL_PLAIN_USERNAME - A plaintext username used to authenticate with the "PLAIN" mechanism.

          • KAFKA_SASL_PLAIN_PASSWORD - A plaintext password used to authenticate with the "PLAIN" mechanism.

          • ENCRYPTED_KAFKA_SASL_PLAIN_PASSWORD - The encrypted version of the Kafka SASL PLAIN password (if the user has the Glue encrypt passwords setting selected).

          • KAFKA_SASL_SCRAM_USERNAME - A plaintext username used to authenticate with the "SCRAM-SHA-512" mechanism.

          • KAFKA_SASL_SCRAM_PASSWORD - A plaintext password used to authenticate with the "SCRAM-SHA-512" mechanism.

          • ENCRYPTED_KAFKA_SASL_SCRAM_PASSWORD - The encrypted version of the Kafka SASL SCRAM password (if the user has the Glue encrypt passwords setting selected).

          • KAFKA_SASL_SCRAM_SECRETS_ARN - The Amazon Resource Name of a secret in Amazon Web Services Secrets Manager.

          • KAFKA_SASL_GSSAPI_KEYTAB - The S3 location of a Kerberos keytab file. A keytab stores long-term keys for one or more principals. For more information, see MIT Kerberos Documentation: Keytab.

          • KAFKA_SASL_GSSAPI_KRB5_CONF - The S3 location of a Kerberos krb5.conf file. A krb5.conf stores Kerberos configuration information, such as the location of the KDC server. For more information, see MIT Kerberos Documentation: krb5.conf.

          • KAFKA_SASL_GSSAPI_SERVICE - The Kerberos service name, as set with sasl.kerberos.service.name in your Kafka Configuration.

          • KAFKA_SASL_GSSAPI_PRINCIPAL - The name of the Kerberos princial used by Glue. For more information, see Kafka Documentation: Configuring Kafka Brokers.

          • ROLE_ARN - The role to be used for running queries.

          • REGION - The Amazon Web Services Region where queries will be run.

          • WORKGROUP_NAME - The name of an Amazon Redshift serverless workgroup or Amazon Athena workgroup in which queries will run.

          • CLUSTER_IDENTIFIER - The cluster identifier of an Amazon Redshift cluster in which queries will run.

          • DATABASE - The Amazon Redshift database that you are connecting to.

          • (string) --

            • (string) --

        • AthenaProperties (dict) --

          This field is not currently used.

          • (string) --

            • (string) --

        • PhysicalConnectionRequirements (dict) --

          The physical connection requirements, such as virtual private cloud (VPC) and SecurityGroup , that are needed to make this connection successfully.

          • SubnetId (string) --

            The subnet ID used by the connection.

          • SecurityGroupIdList (list) --

            The security group ID list used by the connection.

            • (string) --

          • AvailabilityZone (string) --

            The connection's Availability Zone.

        • CreationTime (datetime) --

          The timestamp of the time that this connection definition was created.

        • LastUpdatedTime (datetime) --

          The timestamp of the last time the connection definition was updated.

        • LastUpdatedBy (string) --

          The user, group, or role that last updated this connection definition.

        • Status (string) --

          The status of the connection. Can be one of: READY , IN_PROGRESS , or FAILED .

        • StatusReason (string) --

          The reason for the connection status.

        • LastConnectionValidationTime (datetime) --

          A timestamp of the time this connection was last validated.

        • AuthenticationConfiguration (dict) --

          The authentication properties of the connection.

          • AuthenticationType (string) --

            A structure containing the authentication configuration.

          • SecretArn (string) --

            The secret manager ARN to store credentials.

          • OAuth2Properties (dict) --

            The properties for OAuth2 authentication.

            • OAuth2GrantType (string) --

              The OAuth2 grant type. For example, AUTHORIZATION_CODE , JWT_BEARER , or CLIENT_CREDENTIALS .

            • OAuth2ClientApplication (dict) --

              The client application type. For example, AWS_MANAGED or USER_MANAGED.

              • UserManagedClientApplicationClientId (string) --

                The client application clientID if the ClientAppType is USER_MANAGED .

              • AWSManagedClientApplicationReference (string) --

                The reference to the SaaS-side client app that is Amazon Web Services managed.

            • TokenUrl (string) --

              The URL of the provider's authentication server, to exchange an authorization code for an access token.

            • TokenUrlParametersMap (dict) --

              A map of parameters that are added to the token GET request.

              • (string) --

                • (string) --

    • NextToken (string) --

      A continuation token, if the list of connections returned does not include the last of the filtered connections.

UpdateConnection (updated) Link ¶
Changes (request)
{'ConnectionInput': {'AthenaProperties': {'string': 'string'}}}

Updates a connection definition in the Data Catalog.

See also: AWS API Documentation

Request Syntax

client.update_connection(
    CatalogId='string',
    Name='string',
    ConnectionInput={
        'Name': 'string',
        'Description': 'string',
        'ConnectionType': 'JDBC'|'SFTP'|'MONGODB'|'KAFKA'|'NETWORK'|'MARKETPLACE'|'CUSTOM'|'SALESFORCE'|'VIEW_VALIDATION_REDSHIFT'|'VIEW_VALIDATION_ATHENA',
        'MatchCriteria': [
            'string',
        ],
        'ConnectionProperties': {
            'string': 'string'
        },
        'AthenaProperties': {
            'string': 'string'
        },
        'PhysicalConnectionRequirements': {
            'SubnetId': 'string',
            'SecurityGroupIdList': [
                'string',
            ],
            'AvailabilityZone': 'string'
        },
        'AuthenticationConfiguration': {
            'AuthenticationType': 'BASIC'|'OAUTH2'|'CUSTOM',
            'OAuth2Properties': {
                'OAuth2GrantType': 'AUTHORIZATION_CODE'|'CLIENT_CREDENTIALS'|'JWT_BEARER',
                'OAuth2ClientApplication': {
                    'UserManagedClientApplicationClientId': 'string',
                    'AWSManagedClientApplicationReference': 'string'
                },
                'TokenUrl': 'string',
                'TokenUrlParametersMap': {
                    'string': 'string'
                },
                'AuthorizationCodeProperties': {
                    'AuthorizationCode': 'string',
                    'RedirectUri': 'string'
                }
            },
            'SecretArn': 'string'
        },
        'ValidateCredentials': True|False
    }
)
type CatalogId

string

param CatalogId

The ID of the Data Catalog in which the connection resides. If none is provided, the Amazon Web Services account ID is used by default.

type Name

string

param Name

[REQUIRED]

The name of the connection definition to update.

type ConnectionInput

dict

param ConnectionInput

[REQUIRED]

A ConnectionInput object that redefines the connection in question.

  • Name (string) -- [REQUIRED]

    The name of the connection.

  • Description (string) --

    The description of the connection.

  • ConnectionType (string) -- [REQUIRED]

    The type of the connection. Currently, these types are supported:

    • JDBC - Designates a connection to a database through Java Database Connectivity (JDBC). JDBC Connections use the following ConnectionParameters.

      • Required: All of ( HOST , PORT , JDBC_ENGINE ) or JDBC_CONNECTION_URL .

      • Required: All of ( USERNAME , PASSWORD ) or SECRET_ID .

      • Optional: JDBC_ENFORCE_SSL , CUSTOM_JDBC_CERT , CUSTOM_JDBC_CERT_STRING , SKIP_CUSTOM_JDBC_CERT_VALIDATION . These parameters are used to configure SSL with JDBC.

    • KAFKA - Designates a connection to an Apache Kafka streaming platform. KAFKA Connections use the following ConnectionParameters.

      • Required: KAFKA_BOOTSTRAP_SERVERS .

      • Optional: KAFKA_SSL_ENABLED , KAFKA_CUSTOM_CERT , KAFKA_SKIP_CUSTOM_CERT_VALIDATION . These parameters are used to configure SSL with KAFKA .

      • Optional: KAFKA_CLIENT_KEYSTORE , KAFKA_CLIENT_KEYSTORE_PASSWORD , KAFKA_CLIENT_KEY_PASSWORD , ENCRYPTED_KAFKA_CLIENT_KEYSTORE_PASSWORD , ENCRYPTED_KAFKA_CLIENT_KEY_PASSWORD . These parameters are used to configure TLS client configuration with SSL in KAFKA .

      • Optional: KAFKA_SASL_MECHANISM . Can be specified as SCRAM-SHA-512 , GSSAPI , or AWS_MSK_IAM .

      • Optional: KAFKA_SASL_SCRAM_USERNAME , KAFKA_SASL_SCRAM_PASSWORD , ENCRYPTED_KAFKA_SASL_SCRAM_PASSWORD . These parameters are used to configure SASL/SCRAM-SHA-512 authentication with KAFKA .

      • Optional: KAFKA_SASL_GSSAPI_KEYTAB , KAFKA_SASL_GSSAPI_KRB5_CONF , KAFKA_SASL_GSSAPI_SERVICE , KAFKA_SASL_GSSAPI_PRINCIPAL . These parameters are used to configure SASL/GSSAPI authentication with KAFKA .

    • MONGODB - Designates a connection to a MongoDB document database. MONGODB Connections use the following ConnectionParameters.

      • Required: CONNECTION_URL .

      • Required: All of ( USERNAME , PASSWORD ) or SECRET_ID .

    • SALESFORCE - Designates a connection to Salesforce using OAuth authencation.

      • Requires the AuthenticationConfiguration member to be configured.

    • VIEW_VALIDATION_REDSHIFT - Designates a connection used for view validation by Amazon Redshift.

    • VIEW_VALIDATION_ATHENA - Designates a connection used for view validation by Amazon Athena.

    • NETWORK - Designates a network connection to a data source within an Amazon Virtual Private Cloud environment (Amazon VPC). NETWORK Connections do not require ConnectionParameters. Instead, provide a PhysicalConnectionRequirements.

    • MARKETPLACE - Uses configuration settings contained in a connector purchased from Amazon Web Services Marketplace to read from and write to data stores that are not natively supported by Glue. MARKETPLACE Connections use the following ConnectionParameters.

      • Required: CONNECTOR_TYPE , CONNECTOR_URL , CONNECTOR_CLASS_NAME , CONNECTION_URL .

      • Required for JDBC CONNECTOR_TYPE connections: All of ( USERNAME , PASSWORD ) or SECRET_ID .

    • CUSTOM - Uses configuration settings contained in a custom connector to read from and write to data stores that are not natively supported by Glue.

    SFTP is not supported.

    For more information about how optional ConnectionProperties are used to configure features in Glue, consult Glue connection properties.

    For more information about how optional ConnectionProperties are used to configure features in Glue Studio, consult Using connectors and connections.

  • MatchCriteria (list) --

    A list of criteria that can be used in selecting this connection.

    • (string) --

  • ConnectionProperties (dict) -- [REQUIRED]

    These key-value pairs define parameters for the connection.

    • (string) --

      • (string) --

  • AthenaProperties (dict) --

    This field is not currently used.

    • (string) --

      • (string) --

  • PhysicalConnectionRequirements (dict) --

    The physical connection requirements, such as virtual private cloud (VPC) and SecurityGroup , that are needed to successfully make this connection.

    • SubnetId (string) --

      The subnet ID used by the connection.

    • SecurityGroupIdList (list) --

      The security group ID list used by the connection.

      • (string) --

    • AvailabilityZone (string) --

      The connection's Availability Zone.

  • AuthenticationConfiguration (dict) --

    The authentication properties of the connection. Used for a Salesforce connection.

    • AuthenticationType (string) --

      A structure containing the authentication configuration in the CreateConnection request.

    • OAuth2Properties (dict) --

      The properties for OAuth2 authentication in the CreateConnection request.

      • OAuth2GrantType (string) --

        The OAuth2 grant type in the CreateConnection request. For example, AUTHORIZATION_CODE , JWT_BEARER , or CLIENT_CREDENTIALS .

      • OAuth2ClientApplication (dict) --

        The client application type in the CreateConnection request. For example, AWS_MANAGED or USER_MANAGED .

        • UserManagedClientApplicationClientId (string) --

          The client application clientID if the ClientAppType is USER_MANAGED .

        • AWSManagedClientApplicationReference (string) --

          The reference to the SaaS-side client app that is Amazon Web Services managed.

      • TokenUrl (string) --

        The URL of the provider's authentication server, to exchange an authorization code for an access token.

      • TokenUrlParametersMap (dict) --

        A map of parameters that are added to the token GET request.

        • (string) --

          • (string) --

      • AuthorizationCodeProperties (dict) --

        The set of properties required for the the OAuth2 AUTHORIZATION_CODE grant type.

        • AuthorizationCode (string) --

          An authorization code to be used in the third leg of the AUTHORIZATION_CODE grant workflow. This is a single-use code which becomes invalid once exchanged for an access token, thus it is acceptable to have this value as a request parameter.

        • RedirectUri (string) --

          The redirect URI where the user gets redirected to by authorization server when issuing an authorization code. The URI is subsequently used when the authorization code is exchanged for an access token.

    • SecretArn (string) --

      The secret manager ARN to store credentials in the CreateConnection request.

  • ValidateCredentials (boolean) --

    A flag to validate the credentials during create connection. Used for a Salesforce connection. Default is true.

rtype

dict

returns

Response Syntax

{}

Response Structure

  • (dict) --