AWS DataSync

2019/09/24 - AWS DataSync - 7 updated api methods

Changes  Added S3StorageClass, OverwriteMode sync option, and ONLY_FILES_TRANSFERRED setting for the VerifyMode sync option.

CreateLocationS3 (updated) Link ¶
Changes (request)
{'S3StorageClass': 'STANDARD | STANDARD_IA | ONEZONE_IA | INTELLIGENT_TIERING '
                   '| GLACIER | DEEP_ARCHIVE'}

Creates an endpoint for an Amazon S3 bucket.

For AWS DataSync to access a destination S3 bucket, it needs an AWS Identity and Access Management (IAM) role that has the required permissions. You can set up the required permissions by creating an IAM policy that grants the required permissions and attaching the policy to the role. An example of such a policy is shown in the examples section.

For more information, see https://docs.aws.amazon.com/datasync/latest/userguide/working-with-locations.html#create-s3-location in the AWS DataSync User Guide.

See also: AWS API Documentation

Request Syntax

client.create_location_s3(
    Subdirectory='string',
    S3BucketArn='string',
    S3StorageClass='STANDARD'|'STANDARD_IA'|'ONEZONE_IA'|'INTELLIGENT_TIERING'|'GLACIER'|'DEEP_ARCHIVE',
    S3Config={
        'BucketAccessRoleArn': 'string'
    },
    Tags=[
        {
            'Key': 'string',
            'Value': 'string'
        },
    ]
)
type Subdirectory

string

param Subdirectory

A subdirectory in the Amazon S3 bucket. This subdirectory in Amazon S3 is used to read data from the S3 source location or write data to the S3 destination.

type S3BucketArn

string

param S3BucketArn

[REQUIRED]

The Amazon Resource Name (ARN) of the Amazon S3 bucket.

type S3StorageClass

string

param S3StorageClass

The Amazon S3 storage class that you want to store your files in when this location is used as a task destination. For more information about S3 storage classes, see Amazon S3 Storage Classes in the Amazon Simple Storage Service Developer Guide . Some storage classes have behaviors that can affect your S3 storage cost. For detailed information, see using-storage-classes.

type S3Config

dict

param S3Config

[REQUIRED]

The Amazon Resource Name (ARN) of the AWS Identity and Access Management (IAM) role that is used to access an Amazon S3 bucket.

For detailed information about using such a role, see Creating a Location for Amazon S3 in the AWS DataSync User Guide .

  • BucketAccessRoleArn (string) -- [REQUIRED]

    The Amazon S3 bucket to access. This bucket is used as a parameter in the CreateLocationS3 operation.

type Tags

list

param Tags

The key-value pair that represents the tag that you want to add to the location. The value can be an empty string. We recommend using tags to name your resources.

  • (dict) --

    Represents a single entry in a list of AWS resource tags. TagListEntry returns an array that contains a list of tasks when the ListTagsForResource operation is called.

    • Key (string) -- [REQUIRED]

      The key for an AWS resource tag.

    • Value (string) --

      The value for an AWS resource tag.

rtype

dict

returns

Response Syntax

{
    'LocationArn': 'string'
}

Response Structure

  • (dict) --

    CreateLocationS3Response

    • LocationArn (string) --

      The Amazon Resource Name (ARN) of the source Amazon S3 bucket location that is created.

CreateTask (updated) Link ¶
Changes (request)
{'Options': {'OverwriteMode': 'ALWAYS | NEVER',
             'VerifyMode': {'ONLY_FILES_TRANSFERRED'}}}

Creates a task. A task is a set of two locations (source and destination) and a set of Options that you use to control the behavior of a task. If you don't specify Options when you create a task, AWS DataSync populates them with service defaults.

When you create a task, it first enters the CREATING state. During CREATING AWS DataSync attempts to mount the on-premises Network File System (NFS) location. The task transitions to the AVAILABLE state without waiting for the AWS location to become mounted. If required, AWS DataSync mounts the AWS location before each task execution.

If an agent that is associated with a source (NFS) location goes offline, the task transitions to the UNAVAILABLE status. If the status of the task remains in the CREATING status for more than a few minutes, it means that your agent might be having trouble mounting the source NFS file system. Check the task's ErrorCode and ErrorDetail. Mount issues are often caused by either a misconfigured firewall or a mistyped NFS server host name.

See also: AWS API Documentation

Request Syntax

client.create_task(
    SourceLocationArn='string',
    DestinationLocationArn='string',
    CloudWatchLogGroupArn='string',
    Name='string',
    Options={
        'VerifyMode': 'POINT_IN_TIME_CONSISTENT'|'ONLY_FILES_TRANSFERRED'|'NONE',
        'OverwriteMode': 'ALWAYS'|'NEVER',
        'Atime': 'NONE'|'BEST_EFFORT',
        'Mtime': 'NONE'|'PRESERVE',
        'Uid': 'NONE'|'INT_VALUE'|'NAME'|'BOTH',
        'Gid': 'NONE'|'INT_VALUE'|'NAME'|'BOTH',
        'PreserveDeletedFiles': 'PRESERVE'|'REMOVE',
        'PreserveDevices': 'NONE'|'PRESERVE',
        'PosixPermissions': 'NONE'|'BEST_EFFORT'|'PRESERVE',
        'BytesPerSecond': 123
    },
    Excludes=[
        {
            'FilterType': 'SIMPLE_PATTERN',
            'Value': 'string'
        },
    ],
    Tags=[
        {
            'Key': 'string',
            'Value': 'string'
        },
    ]
)
type SourceLocationArn

string

param SourceLocationArn

[REQUIRED]

The Amazon Resource Name (ARN) of the source location for the task.

type DestinationLocationArn

string

param DestinationLocationArn

[REQUIRED]

The Amazon Resource Name (ARN) of an AWS storage resource's location.

type CloudWatchLogGroupArn

string

param CloudWatchLogGroupArn

The Amazon Resource Name (ARN) of the Amazon CloudWatch log group that is used to monitor and log events in the task.

For more information on these groups, see Working with Log Groups and Log Streams in the Amazon CloudWatch User Guide.

For more information about how to use CloudWatch Logs with DataSync, see Monitoring Your Task in the AWS DataSync User Guide.

type Name

string

param Name

The name of a task. This value is a text reference that is used to identify the task in the console.

type Options

dict

param Options

The set of configuration options that control the behavior of a single execution of the task that occurs when you call StartTaskExecution . You can configure these options to preserve metadata such as user ID (UID) and group ID (GID), file permissions, data integrity verification, and so on.

For each individual task execution, you can override these options by specifying the OverrideOptions before starting a the task execution. For more information, see the operation.

  • VerifyMode (string) --

    A value that determines whether a data integrity verification should be performed at the end of a task execution after all data and metadata have been transferred.

    Default value: POINT_IN_TIME_CONSISTENT.

    POINT_IN_TIME_CONSISTENT: Perform verification (recommended).

    ONLY_FILES_TRANSFERRED: Perform verification on only files that were transferred.

    NONE: Skip verification.

  • OverwriteMode (string) --

    A value that determines whether files at the destination should be overwritten or preserved when copying files. If set to NEVER a destination file will not be replaced by a source file, even if the destination file differs from the source file. If you modify files in the destination and you sync the files, you can use this value to protect against overwriting those changes.

    Some storage classes have specific behaviors that can affect your S3 storage cost. For detailed information, see using-storage-classes in the AWS DataSync User Guide .

  • Atime (string) --

    A file metadata value that shows the last time a file was accessed (that is, when the file was read or written to). If you set Atime to BEST_EFFORT, DataSync attempts to preserve the original Atime attribute on all source files (that is, the version before the PREPARING phase). However, Atime 's behavior is not fully standard across platforms, so AWS DataSync can only do this on a best-effort basis.

    Default value: BEST_EFFORT.

    BEST_EFFORT: Attempt to preserve the per-file Atime value (recommended).

    NONE: Ignore Atime .

    Note

    If Atime is set to BEST_EFFORT, Mtime must be set to PRESERVE.

    If Atime is set to NONE, Mtime must also be NONE.

  • Mtime (string) --

    A value that indicates the last time that a file was modified (that is, a file was written to) before the PREPARING phase.

    Default value: PRESERVE.

    PRESERVE: Preserve original Mtime (recommended)

    NONE: Ignore Mtime .

    Note

    If Mtime is set to PRESERVE, Atime must be set to BEST_EFFORT.

    If Mtime is set to NONE, Atime must also be set to NONE.

  • Uid (string) --

    The user ID (UID) of the file's owner.

    Default value: INT_VALUE. This preserves the integer value of the ID.

    INT_VALUE: Preserve the integer value of UID and group ID (GID) (recommended).

    NONE: Ignore UID and GID.

  • Gid (string) --

    The group ID (GID) of the file's owners.

    Default value: INT_VALUE. This preserves the integer value of the ID.

    INT_VALUE: Preserve the integer value of user ID (UID) and GID (recommended).

    NONE: Ignore UID and GID.

  • PreserveDeletedFiles (string) --

    A value that specifies whether files in the destination that don't exist in the source file system should be preserved. This option can affect your storage cost. If your task deletes objects, you might incur minimum storage duration charges for certain storage classes. For detailed information, see using-storage-classes in the AWS DataSync User Guide .

    Default value: PRESERVE.

    PRESERVE: Ignore such destination files (recommended).

    REMOVE: Delete destination files that aren’t present in the source.

  • PreserveDevices (string) --

    A value that determines whether AWS DataSync should preserve the metadata of block and character devices in the source file system, and recreate the files with that device name and metadata on the destination.

    Note

    AWS DataSync can't sync the actual contents of such devices, because they are nonterminal and don't return an end-of-file (EOF) marker.

    Default value: NONE.

    NONE: Ignore special devices (recommended).

    PRESERVE: Preserve character and block device metadata. This option isn't currently supported for Amazon EFS.

  • PosixPermissions (string) --

    A value that determines which users or groups can access a file for a specific purpose such as reading, writing, or execution of the file.

    Default value: PRESERVE.

    PRESERVE: Preserve POSIX-style permissions (recommended).

    NONE: Ignore permissions.

    Note

    AWS DataSync can preserve extant permissions of a source location.

  • BytesPerSecond (integer) --

    A value that limits the bandwidth used by AWS DataSync. For example, if you want AWS DataSync to use a maximum of 1 MB, set this value to 1048576 ( =1024*1024 ).

type Excludes

list

param Excludes

A list of filter rules that determines which files to exclude from a task. The list should contain a single filter string that consists of the patterns to exclude. The patterns are delimited by "|" (that is, a pipe), for example, "/folder1|/folder2"

  • (dict) --

    Specifies which files, folders and objects to include or exclude when transferring files from source to destination.

    • FilterType (string) --

      The type of filter rule to apply. AWS DataSync only supports the SIMPLE_PATTERN rule type.

    • Value (string) --

      A single filter string that consists of the patterns to include or exclude. The patterns are delimited by "|" (that is, a pipe), for example: /folder1|/folder2

type Tags

list

param Tags

The key-value pair that represents the tag that you want to add to the resource. The value can be an empty string.

  • (dict) --

    Represents a single entry in a list of AWS resource tags. TagListEntry returns an array that contains a list of tasks when the ListTagsForResource operation is called.

    • Key (string) -- [REQUIRED]

      The key for an AWS resource tag.

    • Value (string) --

      The value for an AWS resource tag.

rtype

dict

returns

Response Syntax

{
    'TaskArn': 'string'
}

Response Structure

  • (dict) --

    CreateTaskResponse

    • TaskArn (string) --

      The Amazon Resource Name (ARN) of the task.

DescribeLocationS3 (updated) Link ¶
Changes (response)
{'S3StorageClass': 'STANDARD | STANDARD_IA | ONEZONE_IA | INTELLIGENT_TIERING '
                   '| GLACIER | DEEP_ARCHIVE'}

Returns metadata, such as bucket name, about an Amazon S3 bucket location.

See also: AWS API Documentation

Request Syntax

client.describe_location_s3(
    LocationArn='string'
)
type LocationArn

string

param LocationArn

[REQUIRED]

The Amazon Resource Name (ARN) of the Amazon S3 bucket location to describe.

rtype

dict

returns

Response Syntax

{
    'LocationArn': 'string',
    'LocationUri': 'string',
    'S3StorageClass': 'STANDARD'|'STANDARD_IA'|'ONEZONE_IA'|'INTELLIGENT_TIERING'|'GLACIER'|'DEEP_ARCHIVE',
    'S3Config': {
        'BucketAccessRoleArn': 'string'
    },
    'CreationTime': datetime(2015, 1, 1)
}

Response Structure

  • (dict) --

    DescribeLocationS3Response

    • LocationArn (string) --

      The Amazon Resource Name (ARN) of the Amazon S3 bucket location.

    • LocationUri (string) --

      The URL of the Amazon S3 location that was described.

    • S3StorageClass (string) --

      The Amazon S3 storage class that you chose to store your files in when this location is used as a task destination. For more information about S3 storage classes, see Amazon S3 Storage Classes in the Amazon Simple Storage Service Developer Guide . Some storage classes have behaviors that can affect your S3 storage cost. For detailed information, see using-storage-classes.

    • S3Config (dict) --

      The Amazon Resource Name (ARN) of the AWS Identity and Access Management (IAM) role that is used to access an Amazon S3 bucket.

      For detailed information about using such a role, see Creating a Location for Amazon S3 in the AWS DataSync User Guide .

      • BucketAccessRoleArn (string) --

        The Amazon S3 bucket to access. This bucket is used as a parameter in the CreateLocationS3 operation.

    • CreationTime (datetime) --

      The time that the Amazon S3 bucket location was created.

DescribeTask (updated) Link ¶
Changes (response)
{'Options': {'OverwriteMode': 'ALWAYS | NEVER',
             'VerifyMode': {'ONLY_FILES_TRANSFERRED'}}}

Returns metadata about a task.

See also: AWS API Documentation

Request Syntax

client.describe_task(
    TaskArn='string'
)
type TaskArn

string

param TaskArn

[REQUIRED]

The Amazon Resource Name (ARN) of the task to describe.

rtype

dict

returns

Response Syntax

{
    'TaskArn': 'string',
    'Status': 'AVAILABLE'|'CREATING'|'RUNNING'|'UNAVAILABLE',
    'Name': 'string',
    'CurrentTaskExecutionArn': 'string',
    'SourceLocationArn': 'string',
    'DestinationLocationArn': 'string',
    'CloudWatchLogGroupArn': 'string',
    'SourceNetworkInterfaceArns': [
        'string',
    ],
    'DestinationNetworkInterfaceArns': [
        'string',
    ],
    'Options': {
        'VerifyMode': 'POINT_IN_TIME_CONSISTENT'|'ONLY_FILES_TRANSFERRED'|'NONE',
        'OverwriteMode': 'ALWAYS'|'NEVER',
        'Atime': 'NONE'|'BEST_EFFORT',
        'Mtime': 'NONE'|'PRESERVE',
        'Uid': 'NONE'|'INT_VALUE'|'NAME'|'BOTH',
        'Gid': 'NONE'|'INT_VALUE'|'NAME'|'BOTH',
        'PreserveDeletedFiles': 'PRESERVE'|'REMOVE',
        'PreserveDevices': 'NONE'|'PRESERVE',
        'PosixPermissions': 'NONE'|'BEST_EFFORT'|'PRESERVE',
        'BytesPerSecond': 123
    },
    'Excludes': [
        {
            'FilterType': 'SIMPLE_PATTERN',
            'Value': 'string'
        },
    ],
    'ErrorCode': 'string',
    'ErrorDetail': 'string',
    'CreationTime': datetime(2015, 1, 1)
}

Response Structure

  • (dict) --

    DescribeTaskResponse

    • TaskArn (string) --

      The Amazon Resource Name (ARN) of the task that was described.

    • Status (string) --

      The status of the task that was described.

      For detailed information about task execution statuses, see Understanding Task Statuses in the AWS DataSync User Guide.

    • Name (string) --

      The name of the task that was described.

    • CurrentTaskExecutionArn (string) --

      The Amazon Resource Name (ARN) of the task execution that is syncing files.

    • SourceLocationArn (string) --

      The Amazon Resource Name (ARN) of the source file system's location.

    • DestinationLocationArn (string) --

      The Amazon Resource Name (ARN) of the AWS storage resource's location.

    • CloudWatchLogGroupArn (string) --

      The Amazon Resource Name (ARN) of the Amazon CloudWatch log group that was used to monitor and log events in the task.

      For more information on these groups, see Working with Log Groups and Log Streams in the Amazon CloudWatch User Guide .

    • SourceNetworkInterfaceArns (list) --

      The Amazon Resource Name (ARN) of the source ENIs (Elastic Network Interface) that was created for your subnet.

      • (string) --

    • DestinationNetworkInterfaceArns (list) --

      The Amazon Resource Name (ARN) of the destination ENIs (Elastic Network Interface) that was created for your subnet.

      • (string) --

    • Options (dict) --

      The set of configuration options that control the behavior of a single execution of the task that occurs when you call StartTaskExecution . You can configure these options to preserve metadata such as user ID (UID) and group (GID), file permissions, data integrity verification, and so on.

      For each individual task execution, you can override these options by specifying the overriding OverrideOptions value to operation.

      • VerifyMode (string) --

        A value that determines whether a data integrity verification should be performed at the end of a task execution after all data and metadata have been transferred.

        Default value: POINT_IN_TIME_CONSISTENT.

        POINT_IN_TIME_CONSISTENT: Perform verification (recommended).

        ONLY_FILES_TRANSFERRED: Perform verification on only files that were transferred.

        NONE: Skip verification.

      • OverwriteMode (string) --

        A value that determines whether files at the destination should be overwritten or preserved when copying files. If set to NEVER a destination file will not be replaced by a source file, even if the destination file differs from the source file. If you modify files in the destination and you sync the files, you can use this value to protect against overwriting those changes.

        Some storage classes have specific behaviors that can affect your S3 storage cost. For detailed information, see using-storage-classes in the AWS DataSync User Guide .

      • Atime (string) --

        A file metadata value that shows the last time a file was accessed (that is, when the file was read or written to). If you set Atime to BEST_EFFORT, DataSync attempts to preserve the original Atime attribute on all source files (that is, the version before the PREPARING phase). However, Atime 's behavior is not fully standard across platforms, so AWS DataSync can only do this on a best-effort basis.

        Default value: BEST_EFFORT.

        BEST_EFFORT: Attempt to preserve the per-file Atime value (recommended).

        NONE: Ignore Atime .

        Note

        If Atime is set to BEST_EFFORT, Mtime must be set to PRESERVE.

        If Atime is set to NONE, Mtime must also be NONE.

      • Mtime (string) --

        A value that indicates the last time that a file was modified (that is, a file was written to) before the PREPARING phase.

        Default value: PRESERVE.

        PRESERVE: Preserve original Mtime (recommended)

        NONE: Ignore Mtime .

        Note

        If Mtime is set to PRESERVE, Atime must be set to BEST_EFFORT.

        If Mtime is set to NONE, Atime must also be set to NONE.

      • Uid (string) --

        The user ID (UID) of the file's owner.

        Default value: INT_VALUE. This preserves the integer value of the ID.

        INT_VALUE: Preserve the integer value of UID and group ID (GID) (recommended).

        NONE: Ignore UID and GID.

      • Gid (string) --

        The group ID (GID) of the file's owners.

        Default value: INT_VALUE. This preserves the integer value of the ID.

        INT_VALUE: Preserve the integer value of user ID (UID) and GID (recommended).

        NONE: Ignore UID and GID.

      • PreserveDeletedFiles (string) --

        A value that specifies whether files in the destination that don't exist in the source file system should be preserved. This option can affect your storage cost. If your task deletes objects, you might incur minimum storage duration charges for certain storage classes. For detailed information, see using-storage-classes in the AWS DataSync User Guide .

        Default value: PRESERVE.

        PRESERVE: Ignore such destination files (recommended).

        REMOVE: Delete destination files that aren’t present in the source.

      • PreserveDevices (string) --

        A value that determines whether AWS DataSync should preserve the metadata of block and character devices in the source file system, and recreate the files with that device name and metadata on the destination.

        Note

        AWS DataSync can't sync the actual contents of such devices, because they are nonterminal and don't return an end-of-file (EOF) marker.

        Default value: NONE.

        NONE: Ignore special devices (recommended).

        PRESERVE: Preserve character and block device metadata. This option isn't currently supported for Amazon EFS.

      • PosixPermissions (string) --

        A value that determines which users or groups can access a file for a specific purpose such as reading, writing, or execution of the file.

        Default value: PRESERVE.

        PRESERVE: Preserve POSIX-style permissions (recommended).

        NONE: Ignore permissions.

        Note

        AWS DataSync can preserve extant permissions of a source location.

      • BytesPerSecond (integer) --

        A value that limits the bandwidth used by AWS DataSync. For example, if you want AWS DataSync to use a maximum of 1 MB, set this value to 1048576 ( =1024*1024 ).

    • Excludes (list) --

      A list of filter rules that determines which files to exclude from a task. The list should contain a single filter string that consists of the patterns to exclude. The patterns are delimited by "|" (that is, a pipe), for example: "/folder1|/folder2"

      • (dict) --

        Specifies which files, folders and objects to include or exclude when transferring files from source to destination.

        • FilterType (string) --

          The type of filter rule to apply. AWS DataSync only supports the SIMPLE_PATTERN rule type.

        • Value (string) --

          A single filter string that consists of the patterns to include or exclude. The patterns are delimited by "|" (that is, a pipe), for example: /folder1|/folder2

    • ErrorCode (string) --

      Errors that AWS DataSync encountered during execution of the task. You can use this error code to help troubleshoot issues.

    • ErrorDetail (string) --

      Detailed description of an error that was encountered during the task execution. You can use this information to help troubleshoot issues.

    • CreationTime (datetime) --

      The time that the task was created.

DescribeTaskExecution (updated) Link ¶
Changes (response)
{'Options': {'OverwriteMode': 'ALWAYS | NEVER',
             'VerifyMode': {'ONLY_FILES_TRANSFERRED'}}}

Returns detailed metadata about a task that is being executed.

See also: AWS API Documentation

Request Syntax

client.describe_task_execution(
    TaskExecutionArn='string'
)
type TaskExecutionArn

string

param TaskExecutionArn

[REQUIRED]

The Amazon Resource Name (ARN) of the task that is being executed.

rtype

dict

returns

Response Syntax

{
    'TaskExecutionArn': 'string',
    'Status': 'LAUNCHING'|'PREPARING'|'TRANSFERRING'|'VERIFYING'|'SUCCESS'|'ERROR',
    'Options': {
        'VerifyMode': 'POINT_IN_TIME_CONSISTENT'|'ONLY_FILES_TRANSFERRED'|'NONE',
        'OverwriteMode': 'ALWAYS'|'NEVER',
        'Atime': 'NONE'|'BEST_EFFORT',
        'Mtime': 'NONE'|'PRESERVE',
        'Uid': 'NONE'|'INT_VALUE'|'NAME'|'BOTH',
        'Gid': 'NONE'|'INT_VALUE'|'NAME'|'BOTH',
        'PreserveDeletedFiles': 'PRESERVE'|'REMOVE',
        'PreserveDevices': 'NONE'|'PRESERVE',
        'PosixPermissions': 'NONE'|'BEST_EFFORT'|'PRESERVE',
        'BytesPerSecond': 123
    },
    'Excludes': [
        {
            'FilterType': 'SIMPLE_PATTERN',
            'Value': 'string'
        },
    ],
    'Includes': [
        {
            'FilterType': 'SIMPLE_PATTERN',
            'Value': 'string'
        },
    ],
    'StartTime': datetime(2015, 1, 1),
    'EstimatedFilesToTransfer': 123,
    'EstimatedBytesToTransfer': 123,
    'FilesTransferred': 123,
    'BytesWritten': 123,
    'BytesTransferred': 123,
    'Result': {
        'PrepareDuration': 123,
        'PrepareStatus': 'PENDING'|'SUCCESS'|'ERROR',
        'TransferDuration': 123,
        'TransferStatus': 'PENDING'|'SUCCESS'|'ERROR',
        'VerifyDuration': 123,
        'VerifyStatus': 'PENDING'|'SUCCESS'|'ERROR',
        'ErrorCode': 'string',
        'ErrorDetail': 'string'
    }
}

Response Structure

  • (dict) --

    DescribeTaskExecutionResponse

    • TaskExecutionArn (string) --

      The Amazon Resource Name (ARN) of the task execution that was described. TaskExecutionArn is hierarchical and includes TaskArn for the task that was executed.

      For example, a TaskExecution value with the ARN arn:aws:datasync:us-east-1:111222333444:task/task-0208075f79cedf4a2/execution/exec-08ef1e88ec491019b executed the task with the ARN arn:aws:datasync:us-east-1:111222333444:task/task-0208075f79cedf4a2 .

    • Status (string) --

      The status of the task execution.

      For detailed information about task execution statuses, see Understanding Task Statuses in the AWS DataSync User Guide.

    • Options (dict) --

      Represents the options that are available to control the behavior of a StartTaskExecution operation. Behavior includes preserving metadata such as user ID (UID), group ID (GID), and file permissions, and also overwriting files in the destination, data integrity verification, and so on.

      A task has a set of default options associated with it. If you don't specify an option in StartTaskExecution, the default value is used. You can override the defaults options on each task execution by specifying an overriding Options value to StartTaskExecution.

      • VerifyMode (string) --

        A value that determines whether a data integrity verification should be performed at the end of a task execution after all data and metadata have been transferred.

        Default value: POINT_IN_TIME_CONSISTENT.

        POINT_IN_TIME_CONSISTENT: Perform verification (recommended).

        ONLY_FILES_TRANSFERRED: Perform verification on only files that were transferred.

        NONE: Skip verification.

      • OverwriteMode (string) --

        A value that determines whether files at the destination should be overwritten or preserved when copying files. If set to NEVER a destination file will not be replaced by a source file, even if the destination file differs from the source file. If you modify files in the destination and you sync the files, you can use this value to protect against overwriting those changes.

        Some storage classes have specific behaviors that can affect your S3 storage cost. For detailed information, see using-storage-classes in the AWS DataSync User Guide .

      • Atime (string) --

        A file metadata value that shows the last time a file was accessed (that is, when the file was read or written to). If you set Atime to BEST_EFFORT, DataSync attempts to preserve the original Atime attribute on all source files (that is, the version before the PREPARING phase). However, Atime 's behavior is not fully standard across platforms, so AWS DataSync can only do this on a best-effort basis.

        Default value: BEST_EFFORT.

        BEST_EFFORT: Attempt to preserve the per-file Atime value (recommended).

        NONE: Ignore Atime .

        Note

        If Atime is set to BEST_EFFORT, Mtime must be set to PRESERVE.

        If Atime is set to NONE, Mtime must also be NONE.

      • Mtime (string) --

        A value that indicates the last time that a file was modified (that is, a file was written to) before the PREPARING phase.

        Default value: PRESERVE.

        PRESERVE: Preserve original Mtime (recommended)

        NONE: Ignore Mtime .

        Note

        If Mtime is set to PRESERVE, Atime must be set to BEST_EFFORT.

        If Mtime is set to NONE, Atime must also be set to NONE.

      • Uid (string) --

        The user ID (UID) of the file's owner.

        Default value: INT_VALUE. This preserves the integer value of the ID.

        INT_VALUE: Preserve the integer value of UID and group ID (GID) (recommended).

        NONE: Ignore UID and GID.

      • Gid (string) --

        The group ID (GID) of the file's owners.

        Default value: INT_VALUE. This preserves the integer value of the ID.

        INT_VALUE: Preserve the integer value of user ID (UID) and GID (recommended).

        NONE: Ignore UID and GID.

      • PreserveDeletedFiles (string) --

        A value that specifies whether files in the destination that don't exist in the source file system should be preserved. This option can affect your storage cost. If your task deletes objects, you might incur minimum storage duration charges for certain storage classes. For detailed information, see using-storage-classes in the AWS DataSync User Guide .

        Default value: PRESERVE.

        PRESERVE: Ignore such destination files (recommended).

        REMOVE: Delete destination files that aren’t present in the source.

      • PreserveDevices (string) --

        A value that determines whether AWS DataSync should preserve the metadata of block and character devices in the source file system, and recreate the files with that device name and metadata on the destination.

        Note

        AWS DataSync can't sync the actual contents of such devices, because they are nonterminal and don't return an end-of-file (EOF) marker.

        Default value: NONE.

        NONE: Ignore special devices (recommended).

        PRESERVE: Preserve character and block device metadata. This option isn't currently supported for Amazon EFS.

      • PosixPermissions (string) --

        A value that determines which users or groups can access a file for a specific purpose such as reading, writing, or execution of the file.

        Default value: PRESERVE.

        PRESERVE: Preserve POSIX-style permissions (recommended).

        NONE: Ignore permissions.

        Note

        AWS DataSync can preserve extant permissions of a source location.

      • BytesPerSecond (integer) --

        A value that limits the bandwidth used by AWS DataSync. For example, if you want AWS DataSync to use a maximum of 1 MB, set this value to 1048576 ( =1024*1024 ).

    • Excludes (list) --

      A list of filter rules that determines which files to exclude from a task. The list should contain a single filter string that consists of the patterns to exclude. The patterns are delimited by "|" (that is, a pipe), for example: "/folder1|/folder2"

      • (dict) --

        Specifies which files, folders and objects to include or exclude when transferring files from source to destination.

        • FilterType (string) --

          The type of filter rule to apply. AWS DataSync only supports the SIMPLE_PATTERN rule type.

        • Value (string) --

          A single filter string that consists of the patterns to include or exclude. The patterns are delimited by "|" (that is, a pipe), for example: /folder1|/folder2

    • Includes (list) --

      A list of filter rules that determines which files to include when running a task. The list should contain a single filter string that consists of the patterns to include. The patterns are delimited by "|" (that is, a pipe), for example: "/folder1|/folder2"

      • (dict) --

        Specifies which files, folders and objects to include or exclude when transferring files from source to destination.

        • FilterType (string) --

          The type of filter rule to apply. AWS DataSync only supports the SIMPLE_PATTERN rule type.

        • Value (string) --

          A single filter string that consists of the patterns to include or exclude. The patterns are delimited by "|" (that is, a pipe), for example: /folder1|/folder2

    • StartTime (datetime) --

      The time that the task execution was started.

    • EstimatedFilesToTransfer (integer) --

      The expected number of files that is to be transferred over the network. This value is calculated during the PREPARING phase, before the TRANSFERRING phase. This value is the expected number of files to be transferred. It's calculated based on comparing the content of the source and destination locations and finding the delta that needs to be transferred.

    • EstimatedBytesToTransfer (integer) --

      The estimated physical number of bytes that is to be transferred over the network.

    • FilesTransferred (integer) --

      The actual number of files that was transferred over the network. This value is calculated and updated on an ongoing basis during the TRANSFERRING phase. It's updated periodically when each file is read from the source and sent over the network.

      If failures occur during a transfer, this value can be less than EstimatedFilesToTransfer . This value can also be greater than EstimatedFilesTransferred in some cases. This element is implementation-specific for some location types, so don't use it as an indicator for a correct file number or to monitor your task execution.

    • BytesWritten (integer) --

      The number of logical bytes written to the destination AWS storage resource.

    • BytesTransferred (integer) --

      The physical number of bytes transferred over the network.

    • Result (dict) --

      The result of the task execution.

      • PrepareDuration (integer) --

        The total time in milliseconds that AWS DataSync spent in the PREPARING phase.

      • PrepareStatus (string) --

        The status of the PREPARING phase.

      • TransferDuration (integer) --

        The total time in milliseconds that AWS DataSync spent in the TRANSFERRING phase.

      • TransferStatus (string) --

        The status of the TRANSFERRING Phase.

      • VerifyDuration (integer) --

        The total time in milliseconds that AWS DataSync spent in the VERIFYING phase.

      • VerifyStatus (string) --

        The status of the VERIFYING Phase.

      • ErrorCode (string) --

        Errors that AWS DataSync encountered during execution of the task. You can use this error code to help troubleshoot issues.

      • ErrorDetail (string) --

        Detailed description of an error that was encountered during the task execution. You can use this information to help troubleshoot issues.

StartTaskExecution (updated) Link ¶
Changes (request)
{'OverrideOptions': {'OverwriteMode': 'ALWAYS | NEVER',
                     'VerifyMode': {'ONLY_FILES_TRANSFERRED'}}}

Starts a specific invocation of a task. A TaskExecution value represents an individual run of a task. Each task can have at most one TaskExecution at a time.

TaskExecution has the following transition phases: INITIALIZING | PREPARING | TRANSFERRING | VERIFYING | SUCCESS/FAILURE.

For detailed information, see the Task Execution section in the Components and Terminology topic in the AWS DataSync User Guide .

See also: AWS API Documentation

Request Syntax

client.start_task_execution(
    TaskArn='string',
    OverrideOptions={
        'VerifyMode': 'POINT_IN_TIME_CONSISTENT'|'ONLY_FILES_TRANSFERRED'|'NONE',
        'OverwriteMode': 'ALWAYS'|'NEVER',
        'Atime': 'NONE'|'BEST_EFFORT',
        'Mtime': 'NONE'|'PRESERVE',
        'Uid': 'NONE'|'INT_VALUE'|'NAME'|'BOTH',
        'Gid': 'NONE'|'INT_VALUE'|'NAME'|'BOTH',
        'PreserveDeletedFiles': 'PRESERVE'|'REMOVE',
        'PreserveDevices': 'NONE'|'PRESERVE',
        'PosixPermissions': 'NONE'|'BEST_EFFORT'|'PRESERVE',
        'BytesPerSecond': 123
    },
    Includes=[
        {
            'FilterType': 'SIMPLE_PATTERN',
            'Value': 'string'
        },
    ]
)
type TaskArn

string

param TaskArn

[REQUIRED]

The Amazon Resource Name (ARN) of the task to start.

type OverrideOptions

dict

param OverrideOptions

Represents the options that are available to control the behavior of a StartTaskExecution operation. Behavior includes preserving metadata such as user ID (UID), group ID (GID), and file permissions, and also overwriting files in the destination, data integrity verification, and so on.

A task has a set of default options associated with it. If you don't specify an option in StartTaskExecution, the default value is used. You can override the defaults options on each task execution by specifying an overriding Options value to StartTaskExecution.

  • VerifyMode (string) --

    A value that determines whether a data integrity verification should be performed at the end of a task execution after all data and metadata have been transferred.

    Default value: POINT_IN_TIME_CONSISTENT.

    POINT_IN_TIME_CONSISTENT: Perform verification (recommended).

    ONLY_FILES_TRANSFERRED: Perform verification on only files that were transferred.

    NONE: Skip verification.

  • OverwriteMode (string) --

    A value that determines whether files at the destination should be overwritten or preserved when copying files. If set to NEVER a destination file will not be replaced by a source file, even if the destination file differs from the source file. If you modify files in the destination and you sync the files, you can use this value to protect against overwriting those changes.

    Some storage classes have specific behaviors that can affect your S3 storage cost. For detailed information, see using-storage-classes in the AWS DataSync User Guide .

  • Atime (string) --

    A file metadata value that shows the last time a file was accessed (that is, when the file was read or written to). If you set Atime to BEST_EFFORT, DataSync attempts to preserve the original Atime attribute on all source files (that is, the version before the PREPARING phase). However, Atime 's behavior is not fully standard across platforms, so AWS DataSync can only do this on a best-effort basis.

    Default value: BEST_EFFORT.

    BEST_EFFORT: Attempt to preserve the per-file Atime value (recommended).

    NONE: Ignore Atime .

    Note

    If Atime is set to BEST_EFFORT, Mtime must be set to PRESERVE.

    If Atime is set to NONE, Mtime must also be NONE.

  • Mtime (string) --

    A value that indicates the last time that a file was modified (that is, a file was written to) before the PREPARING phase.

    Default value: PRESERVE.

    PRESERVE: Preserve original Mtime (recommended)

    NONE: Ignore Mtime .

    Note

    If Mtime is set to PRESERVE, Atime must be set to BEST_EFFORT.

    If Mtime is set to NONE, Atime must also be set to NONE.

  • Uid (string) --

    The user ID (UID) of the file's owner.

    Default value: INT_VALUE. This preserves the integer value of the ID.

    INT_VALUE: Preserve the integer value of UID and group ID (GID) (recommended).

    NONE: Ignore UID and GID.

  • Gid (string) --

    The group ID (GID) of the file's owners.

    Default value: INT_VALUE. This preserves the integer value of the ID.

    INT_VALUE: Preserve the integer value of user ID (UID) and GID (recommended).

    NONE: Ignore UID and GID.

  • PreserveDeletedFiles (string) --

    A value that specifies whether files in the destination that don't exist in the source file system should be preserved. This option can affect your storage cost. If your task deletes objects, you might incur minimum storage duration charges for certain storage classes. For detailed information, see using-storage-classes in the AWS DataSync User Guide .

    Default value: PRESERVE.

    PRESERVE: Ignore such destination files (recommended).

    REMOVE: Delete destination files that aren’t present in the source.

  • PreserveDevices (string) --

    A value that determines whether AWS DataSync should preserve the metadata of block and character devices in the source file system, and recreate the files with that device name and metadata on the destination.

    Note

    AWS DataSync can't sync the actual contents of such devices, because they are nonterminal and don't return an end-of-file (EOF) marker.

    Default value: NONE.

    NONE: Ignore special devices (recommended).

    PRESERVE: Preserve character and block device metadata. This option isn't currently supported for Amazon EFS.

  • PosixPermissions (string) --

    A value that determines which users or groups can access a file for a specific purpose such as reading, writing, or execution of the file.

    Default value: PRESERVE.

    PRESERVE: Preserve POSIX-style permissions (recommended).

    NONE: Ignore permissions.

    Note

    AWS DataSync can preserve extant permissions of a source location.

  • BytesPerSecond (integer) --

    A value that limits the bandwidth used by AWS DataSync. For example, if you want AWS DataSync to use a maximum of 1 MB, set this value to 1048576 ( =1024*1024 ).

type Includes

list

param Includes

A list of filter rules that determines which files to include when running a task. The pattern should contain a single filter string that consists of the patterns to include. The patterns are delimited by "|" (that is, a pipe). For example: "/folder1|/folder2"

  • (dict) --

    Specifies which files, folders and objects to include or exclude when transferring files from source to destination.

    • FilterType (string) --

      The type of filter rule to apply. AWS DataSync only supports the SIMPLE_PATTERN rule type.

    • Value (string) --

      A single filter string that consists of the patterns to include or exclude. The patterns are delimited by "|" (that is, a pipe), for example: /folder1|/folder2

rtype

dict

returns

Response Syntax

{
    'TaskExecutionArn': 'string'
}

Response Structure

  • (dict) --

    StartTaskExecutionResponse

    • TaskExecutionArn (string) --

      The Amazon Resource Name (ARN) of the specific task execution that was started.

UpdateTask (updated) Link ¶
Changes (request)
{'Options': {'OverwriteMode': 'ALWAYS | NEVER',
             'VerifyMode': {'ONLY_FILES_TRANSFERRED'}}}

Updates the metadata associated with a task.

See also: AWS API Documentation

Request Syntax

client.update_task(
    TaskArn='string',
    Options={
        'VerifyMode': 'POINT_IN_TIME_CONSISTENT'|'ONLY_FILES_TRANSFERRED'|'NONE',
        'OverwriteMode': 'ALWAYS'|'NEVER',
        'Atime': 'NONE'|'BEST_EFFORT',
        'Mtime': 'NONE'|'PRESERVE',
        'Uid': 'NONE'|'INT_VALUE'|'NAME'|'BOTH',
        'Gid': 'NONE'|'INT_VALUE'|'NAME'|'BOTH',
        'PreserveDeletedFiles': 'PRESERVE'|'REMOVE',
        'PreserveDevices': 'NONE'|'PRESERVE',
        'PosixPermissions': 'NONE'|'BEST_EFFORT'|'PRESERVE',
        'BytesPerSecond': 123
    },
    Excludes=[
        {
            'FilterType': 'SIMPLE_PATTERN',
            'Value': 'string'
        },
    ],
    Name='string',
    CloudWatchLogGroupArn='string'
)
type TaskArn

string

param TaskArn

[REQUIRED]

The Amazon Resource Name (ARN) of the resource name of the task to update.

type Options

dict

param Options

Represents the options that are available to control the behavior of a StartTaskExecution operation. Behavior includes preserving metadata such as user ID (UID), group ID (GID), and file permissions, and also overwriting files in the destination, data integrity verification, and so on.

A task has a set of default options associated with it. If you don't specify an option in StartTaskExecution, the default value is used. You can override the defaults options on each task execution by specifying an overriding Options value to StartTaskExecution.

  • VerifyMode (string) --

    A value that determines whether a data integrity verification should be performed at the end of a task execution after all data and metadata have been transferred.

    Default value: POINT_IN_TIME_CONSISTENT.

    POINT_IN_TIME_CONSISTENT: Perform verification (recommended).

    ONLY_FILES_TRANSFERRED: Perform verification on only files that were transferred.

    NONE: Skip verification.

  • OverwriteMode (string) --

    A value that determines whether files at the destination should be overwritten or preserved when copying files. If set to NEVER a destination file will not be replaced by a source file, even if the destination file differs from the source file. If you modify files in the destination and you sync the files, you can use this value to protect against overwriting those changes.

    Some storage classes have specific behaviors that can affect your S3 storage cost. For detailed information, see using-storage-classes in the AWS DataSync User Guide .

  • Atime (string) --

    A file metadata value that shows the last time a file was accessed (that is, when the file was read or written to). If you set Atime to BEST_EFFORT, DataSync attempts to preserve the original Atime attribute on all source files (that is, the version before the PREPARING phase). However, Atime 's behavior is not fully standard across platforms, so AWS DataSync can only do this on a best-effort basis.

    Default value: BEST_EFFORT.

    BEST_EFFORT: Attempt to preserve the per-file Atime value (recommended).

    NONE: Ignore Atime .

    Note

    If Atime is set to BEST_EFFORT, Mtime must be set to PRESERVE.

    If Atime is set to NONE, Mtime must also be NONE.

  • Mtime (string) --

    A value that indicates the last time that a file was modified (that is, a file was written to) before the PREPARING phase.

    Default value: PRESERVE.

    PRESERVE: Preserve original Mtime (recommended)

    NONE: Ignore Mtime .

    Note

    If Mtime is set to PRESERVE, Atime must be set to BEST_EFFORT.

    If Mtime is set to NONE, Atime must also be set to NONE.

  • Uid (string) --

    The user ID (UID) of the file's owner.

    Default value: INT_VALUE. This preserves the integer value of the ID.

    INT_VALUE: Preserve the integer value of UID and group ID (GID) (recommended).

    NONE: Ignore UID and GID.

  • Gid (string) --

    The group ID (GID) of the file's owners.

    Default value: INT_VALUE. This preserves the integer value of the ID.

    INT_VALUE: Preserve the integer value of user ID (UID) and GID (recommended).

    NONE: Ignore UID and GID.

  • PreserveDeletedFiles (string) --

    A value that specifies whether files in the destination that don't exist in the source file system should be preserved. This option can affect your storage cost. If your task deletes objects, you might incur minimum storage duration charges for certain storage classes. For detailed information, see using-storage-classes in the AWS DataSync User Guide .

    Default value: PRESERVE.

    PRESERVE: Ignore such destination files (recommended).

    REMOVE: Delete destination files that aren’t present in the source.

  • PreserveDevices (string) --

    A value that determines whether AWS DataSync should preserve the metadata of block and character devices in the source file system, and recreate the files with that device name and metadata on the destination.

    Note

    AWS DataSync can't sync the actual contents of such devices, because they are nonterminal and don't return an end-of-file (EOF) marker.

    Default value: NONE.

    NONE: Ignore special devices (recommended).

    PRESERVE: Preserve character and block device metadata. This option isn't currently supported for Amazon EFS.

  • PosixPermissions (string) --

    A value that determines which users or groups can access a file for a specific purpose such as reading, writing, or execution of the file.

    Default value: PRESERVE.

    PRESERVE: Preserve POSIX-style permissions (recommended).

    NONE: Ignore permissions.

    Note

    AWS DataSync can preserve extant permissions of a source location.

  • BytesPerSecond (integer) --

    A value that limits the bandwidth used by AWS DataSync. For example, if you want AWS DataSync to use a maximum of 1 MB, set this value to 1048576 ( =1024*1024 ).

type Excludes

list

param Excludes

A list of filter rules that determines which files to exclude from a task. The list should contain a single filter string that consists of the patterns to exclude. The patterns are delimited by "|" (that is, a pipe), for example: "/folder1|/folder2"

  • (dict) --

    Specifies which files, folders and objects to include or exclude when transferring files from source to destination.

    • FilterType (string) --

      The type of filter rule to apply. AWS DataSync only supports the SIMPLE_PATTERN rule type.

    • Value (string) --

      A single filter string that consists of the patterns to include or exclude. The patterns are delimited by "|" (that is, a pipe), for example: /folder1|/folder2

type Name

string

param Name

The name of the task to update.

type CloudWatchLogGroupArn

string

param CloudWatchLogGroupArn

The Amazon Resource Name (ARN) of the resource name of the CloudWatch LogGroup.

rtype

dict

returns

Response Syntax

{}

Response Structure

  • (dict) --