EMR Serverless

2022/11/17 - EMR Serverless - 4 updated api methods

Changes  Adds support for AWS Graviton2 based applications. You can now select CPU architecture when creating new applications or updating existing ones.

CreateApplication (updated) Link ¶
Changes (request)
{'architecture': 'ARM64 | X86_64'}

Creates an application.

See also: AWS API Documentation

Request Syntax

client.create_application(
    name='string',
    releaseLabel='string',
    type='string',
    clientToken='string',
    initialCapacity={
        'string': {
            'workerCount': 123,
            'workerConfiguration': {
                'cpu': 'string',
                'memory': 'string',
                'disk': 'string'
            }
        }
    },
    maximumCapacity={
        'cpu': 'string',
        'memory': 'string',
        'disk': 'string'
    },
    tags={
        'string': 'string'
    },
    autoStartConfiguration={
        'enabled': True|False
    },
    autoStopConfiguration={
        'enabled': True|False,
        'idleTimeoutMinutes': 123
    },
    networkConfiguration={
        'subnetIds': [
            'string',
        ],
        'securityGroupIds': [
            'string',
        ]
    },
    architecture='ARM64'|'X86_64'
)
type name

string

param name

The name of the application.

type releaseLabel

string

param releaseLabel

[REQUIRED]

The EMR release version associated with the application.

type type

string

param type

[REQUIRED]

The type of application you want to start, such as Spark or Hive.

type clientToken

string

param clientToken

[REQUIRED]

The client idempotency token of the application to create. Its value must be unique for each request.

This field is autopopulated if not provided.

type initialCapacity

dict

param initialCapacity

The capacity to initialize when the application is created.

  • (string) --

    • (dict) --

      The initial capacity configuration per worker.

      • workerCount (integer) -- [REQUIRED]

        The number of workers in the initial capacity configuration.

      • workerConfiguration (dict) --

        The resource configuration of the initial capacity configuration.

        • cpu (string) -- [REQUIRED]

          The CPU requirements for every worker instance of the worker type.

        • memory (string) -- [REQUIRED]

          The memory requirements for every worker instance of the worker type.

        • disk (string) --

          The disk requirements for every worker instance of the worker type.

type maximumCapacity

dict

param maximumCapacity

The maximum capacity to allocate when the application is created. This is cumulative across all workers at any given point in time, not just when an application is created. No new resources will be created once any one of the defined limits is hit.

  • cpu (string) -- [REQUIRED]

    The maximum allowed CPU for an application.

  • memory (string) -- [REQUIRED]

    The maximum allowed resources for an application.

  • disk (string) --

    The maximum allowed disk for an application.

type tags

dict

param tags

The tags assigned to the application.

  • (string) --

    • (string) --

type autoStartConfiguration

dict

param autoStartConfiguration

The configuration for an application to automatically start on job submission.

  • enabled (boolean) --

    Enables the application to automatically start on job submission. Defaults to true.

type autoStopConfiguration

dict

param autoStopConfiguration

The configuration for an application to automatically stop after a certain amount of time being idle.

  • enabled (boolean) --

    Enables the application to automatically stop after a certain amount of time being idle. Defaults to true.

  • idleTimeoutMinutes (integer) --

    The amount of idle time in minutes after which your application will automatically stop. Defaults to 15 minutes.

type networkConfiguration

dict

param networkConfiguration

The network configuration for customer VPC connectivity.

  • subnetIds (list) --

    The array of subnet Ids for customer VPC connectivity.

    • (string) --

  • securityGroupIds (list) --

    The array of security group Ids for customer VPC connectivity.

    • (string) --

type architecture

string

param architecture

The CPU architecture of an application.

rtype

dict

returns

Response Syntax

{
    'applicationId': 'string',
    'name': 'string',
    'arn': 'string'
}

Response Structure

  • (dict) --

    • applicationId (string) --

      The output contains the application ID.

    • name (string) --

      The output contains the name of the application.

    • arn (string) --

      The output contains the ARN of the application.

GetApplication (updated) Link ¶
Changes (response)
{'application': {'architecture': 'ARM64 | X86_64'}}

Displays detailed information about a specified application.

See also: AWS API Documentation

Request Syntax

client.get_application(
    applicationId='string'
)
type applicationId

string

param applicationId

[REQUIRED]

The ID of the application that will be described.

rtype

dict

returns

Response Syntax

{
    'application': {
        'applicationId': 'string',
        'name': 'string',
        'arn': 'string',
        'releaseLabel': 'string',
        'type': 'string',
        'state': 'CREATING'|'CREATED'|'STARTING'|'STARTED'|'STOPPING'|'STOPPED'|'TERMINATED',
        'stateDetails': 'string',
        'initialCapacity': {
            'string': {
                'workerCount': 123,
                'workerConfiguration': {
                    'cpu': 'string',
                    'memory': 'string',
                    'disk': 'string'
                }
            }
        },
        'maximumCapacity': {
            'cpu': 'string',
            'memory': 'string',
            'disk': 'string'
        },
        'createdAt': datetime(2015, 1, 1),
        'updatedAt': datetime(2015, 1, 1),
        'tags': {
            'string': 'string'
        },
        'autoStartConfiguration': {
            'enabled': True|False
        },
        'autoStopConfiguration': {
            'enabled': True|False,
            'idleTimeoutMinutes': 123
        },
        'networkConfiguration': {
            'subnetIds': [
                'string',
            ],
            'securityGroupIds': [
                'string',
            ]
        },
        'architecture': 'ARM64'|'X86_64'
    }
}

Response Structure

  • (dict) --

    • application (dict) --

      The output displays information about the specified application.

      • applicationId (string) --

        The ID of the application.

      • name (string) --

        The name of the application.

      • arn (string) --

        The ARN of the application.

      • releaseLabel (string) --

        The EMR release version associated with the application.

      • type (string) --

        The type of application, such as Spark or Hive.

      • state (string) --

        The state of the application.

      • stateDetails (string) --

        The state details of the application.

      • initialCapacity (dict) --

        The initial capacity of the application.

        • (string) --

          • (dict) --

            The initial capacity configuration per worker.

            • workerCount (integer) --

              The number of workers in the initial capacity configuration.

            • workerConfiguration (dict) --

              The resource configuration of the initial capacity configuration.

              • cpu (string) --

                The CPU requirements for every worker instance of the worker type.

              • memory (string) --

                The memory requirements for every worker instance of the worker type.

              • disk (string) --

                The disk requirements for every worker instance of the worker type.

      • maximumCapacity (dict) --

        The maximum capacity of the application. This is cumulative across all workers at any given point in time during the lifespan of the application is created. No new resources will be created once any one of the defined limits is hit.

        • cpu (string) --

          The maximum allowed CPU for an application.

        • memory (string) --

          The maximum allowed resources for an application.

        • disk (string) --

          The maximum allowed disk for an application.

      • createdAt (datetime) --

        The date and time when the application run was created.

      • updatedAt (datetime) --

        The date and time when the application run was last updated.

      • tags (dict) --

        The tags assigned to the application.

        • (string) --

          • (string) --

      • autoStartConfiguration (dict) --

        The configuration for an application to automatically start on job submission.

        • enabled (boolean) --

          Enables the application to automatically start on job submission. Defaults to true.

      • autoStopConfiguration (dict) --

        The configuration for an application to automatically stop after a certain amount of time being idle.

        • enabled (boolean) --

          Enables the application to automatically stop after a certain amount of time being idle. Defaults to true.

        • idleTimeoutMinutes (integer) --

          The amount of idle time in minutes after which your application will automatically stop. Defaults to 15 minutes.

      • networkConfiguration (dict) --

        The network configuration for customer VPC connectivity for the application.

        • subnetIds (list) --

          The array of subnet Ids for customer VPC connectivity.

          • (string) --

        • securityGroupIds (list) --

          The array of security group Ids for customer VPC connectivity.

          • (string) --

      • architecture (string) --

        The CPU architecture of an application.

ListApplications (updated) Link ¶
Changes (response)
{'applications': {'architecture': 'ARM64 | X86_64'}}

Lists applications based on a set of parameters.

See also: AWS API Documentation

Request Syntax

client.list_applications(
    nextToken='string',
    maxResults=123,
    states=[
        'CREATING'|'CREATED'|'STARTING'|'STARTED'|'STOPPING'|'STOPPED'|'TERMINATED',
    ]
)
type nextToken

string

param nextToken

The token for the next set of application results.

type maxResults

integer

param maxResults

The maximum number of applications that can be listed.

type states

list

param states

An optional filter for application states. Note that if this filter contains multiple states, the resulting list will be grouped by the state.

  • (string) --

rtype

dict

returns

Response Syntax

{
    'applications': [
        {
            'id': 'string',
            'name': 'string',
            'arn': 'string',
            'releaseLabel': 'string',
            'type': 'string',
            'state': 'CREATING'|'CREATED'|'STARTING'|'STARTED'|'STOPPING'|'STOPPED'|'TERMINATED',
            'stateDetails': 'string',
            'createdAt': datetime(2015, 1, 1),
            'updatedAt': datetime(2015, 1, 1),
            'architecture': 'ARM64'|'X86_64'
        },
    ],
    'nextToken': 'string'
}

Response Structure

  • (dict) --

    • applications (list) --

      The output lists the specified applications.

      • (dict) --

        The summary of attributes associated with an application.

        • id (string) --

          The ID of the application.

        • name (string) --

          The name of the application.

        • arn (string) --

          The ARN of the application.

        • releaseLabel (string) --

          The EMR release version associated with the application.

        • type (string) --

          The type of application, such as Spark or Hive.

        • state (string) --

          The state of the application.

        • stateDetails (string) --

          The state details of the application.

        • createdAt (datetime) --

          The date and time when the application was created.

        • updatedAt (datetime) --

          The date and time when the application was last updated.

        • architecture (string) --

          The CPU architecture of an application.

    • nextToken (string) --

      The output displays the token for the next set of application results. This is required for pagination and is available as a response of the previous request.

UpdateApplication (updated) Link ¶
Changes (request, response)
Request
{'architecture': 'ARM64 | X86_64'}
Response
{'application': {'architecture': 'ARM64 | X86_64'}}

Updates a specified application. An application has to be in a stopped or created state in order to be updated.

See also: AWS API Documentation

Request Syntax

client.update_application(
    applicationId='string',
    clientToken='string',
    initialCapacity={
        'string': {
            'workerCount': 123,
            'workerConfiguration': {
                'cpu': 'string',
                'memory': 'string',
                'disk': 'string'
            }
        }
    },
    maximumCapacity={
        'cpu': 'string',
        'memory': 'string',
        'disk': 'string'
    },
    autoStartConfiguration={
        'enabled': True|False
    },
    autoStopConfiguration={
        'enabled': True|False,
        'idleTimeoutMinutes': 123
    },
    networkConfiguration={
        'subnetIds': [
            'string',
        ],
        'securityGroupIds': [
            'string',
        ]
    },
    architecture='ARM64'|'X86_64'
)
type applicationId

string

param applicationId

[REQUIRED]

The ID of the application to update.

type clientToken

string

param clientToken

[REQUIRED]

The client idempotency token of the application to update. Its value must be unique for each request.

This field is autopopulated if not provided.

type initialCapacity

dict

param initialCapacity

The capacity to initialize when the application is updated.

  • (string) --

    • (dict) --

      The initial capacity configuration per worker.

      • workerCount (integer) -- [REQUIRED]

        The number of workers in the initial capacity configuration.

      • workerConfiguration (dict) --

        The resource configuration of the initial capacity configuration.

        • cpu (string) -- [REQUIRED]

          The CPU requirements for every worker instance of the worker type.

        • memory (string) -- [REQUIRED]

          The memory requirements for every worker instance of the worker type.

        • disk (string) --

          The disk requirements for every worker instance of the worker type.

type maximumCapacity

dict

param maximumCapacity

The maximum capacity to allocate when the application is updated. This is cumulative across all workers at any given point in time during the lifespan of the application. No new resources will be created once any one of the defined limits is hit.

  • cpu (string) -- [REQUIRED]

    The maximum allowed CPU for an application.

  • memory (string) -- [REQUIRED]

    The maximum allowed resources for an application.

  • disk (string) --

    The maximum allowed disk for an application.

type autoStartConfiguration

dict

param autoStartConfiguration

The configuration for an application to automatically start on job submission.

  • enabled (boolean) --

    Enables the application to automatically start on job submission. Defaults to true.

type autoStopConfiguration

dict

param autoStopConfiguration

The configuration for an application to automatically stop after a certain amount of time being idle.

  • enabled (boolean) --

    Enables the application to automatically stop after a certain amount of time being idle. Defaults to true.

  • idleTimeoutMinutes (integer) --

    The amount of idle time in minutes after which your application will automatically stop. Defaults to 15 minutes.

type networkConfiguration

dict

param networkConfiguration

The network configuration for customer VPC connectivity.

  • subnetIds (list) --

    The array of subnet Ids for customer VPC connectivity.

    • (string) --

  • securityGroupIds (list) --

    The array of security group Ids for customer VPC connectivity.

    • (string) --

type architecture

string

param architecture

The CPU architecture of an application.

rtype

dict

returns

Response Syntax

{
    'application': {
        'applicationId': 'string',
        'name': 'string',
        'arn': 'string',
        'releaseLabel': 'string',
        'type': 'string',
        'state': 'CREATING'|'CREATED'|'STARTING'|'STARTED'|'STOPPING'|'STOPPED'|'TERMINATED',
        'stateDetails': 'string',
        'initialCapacity': {
            'string': {
                'workerCount': 123,
                'workerConfiguration': {
                    'cpu': 'string',
                    'memory': 'string',
                    'disk': 'string'
                }
            }
        },
        'maximumCapacity': {
            'cpu': 'string',
            'memory': 'string',
            'disk': 'string'
        },
        'createdAt': datetime(2015, 1, 1),
        'updatedAt': datetime(2015, 1, 1),
        'tags': {
            'string': 'string'
        },
        'autoStartConfiguration': {
            'enabled': True|False
        },
        'autoStopConfiguration': {
            'enabled': True|False,
            'idleTimeoutMinutes': 123
        },
        'networkConfiguration': {
            'subnetIds': [
                'string',
            ],
            'securityGroupIds': [
                'string',
            ]
        },
        'architecture': 'ARM64'|'X86_64'
    }
}

Response Structure

  • (dict) --

    • application (dict) --

      Information about the updated application.

      • applicationId (string) --

        The ID of the application.

      • name (string) --

        The name of the application.

      • arn (string) --

        The ARN of the application.

      • releaseLabel (string) --

        The EMR release version associated with the application.

      • type (string) --

        The type of application, such as Spark or Hive.

      • state (string) --

        The state of the application.

      • stateDetails (string) --

        The state details of the application.

      • initialCapacity (dict) --

        The initial capacity of the application.

        • (string) --

          • (dict) --

            The initial capacity configuration per worker.

            • workerCount (integer) --

              The number of workers in the initial capacity configuration.

            • workerConfiguration (dict) --

              The resource configuration of the initial capacity configuration.

              • cpu (string) --

                The CPU requirements for every worker instance of the worker type.

              • memory (string) --

                The memory requirements for every worker instance of the worker type.

              • disk (string) --

                The disk requirements for every worker instance of the worker type.

      • maximumCapacity (dict) --

        The maximum capacity of the application. This is cumulative across all workers at any given point in time during the lifespan of the application is created. No new resources will be created once any one of the defined limits is hit.

        • cpu (string) --

          The maximum allowed CPU for an application.

        • memory (string) --

          The maximum allowed resources for an application.

        • disk (string) --

          The maximum allowed disk for an application.

      • createdAt (datetime) --

        The date and time when the application run was created.

      • updatedAt (datetime) --

        The date and time when the application run was last updated.

      • tags (dict) --

        The tags assigned to the application.

        • (string) --

          • (string) --

      • autoStartConfiguration (dict) --

        The configuration for an application to automatically start on job submission.

        • enabled (boolean) --

          Enables the application to automatically start on job submission. Defaults to true.

      • autoStopConfiguration (dict) --

        The configuration for an application to automatically stop after a certain amount of time being idle.

        • enabled (boolean) --

          Enables the application to automatically stop after a certain amount of time being idle. Defaults to true.

        • idleTimeoutMinutes (integer) --

          The amount of idle time in minutes after which your application will automatically stop. Defaults to 15 minutes.

      • networkConfiguration (dict) --

        The network configuration for customer VPC connectivity for the application.

        • subnetIds (list) --

          The array of subnet Ids for customer VPC connectivity.

          • (string) --

        • securityGroupIds (list) --

          The array of security group Ids for customer VPC connectivity.

          • (string) --

      • architecture (string) --

        The CPU architecture of an application.