Amazon Omics

2023/11/09 - Amazon Omics - 6 updated api methods

Changes  Support UBAM filetype for Omics Storage and make referenceArn optional

CreateMultipartReadSetUpload (updated) Link ¶
Changes (both)
{'sourceFileType': {'UBAM'}}

Begins a multipart read set upload.

See also: AWS API Documentation

Request Syntax

client.create_multipart_read_set_upload(
    sequenceStoreId='string',
    clientToken='string',
    sourceFileType='FASTQ'|'BAM'|'CRAM'|'UBAM',
    subjectId='string',
    sampleId='string',
    generatedFrom='string',
    referenceArn='string',
    name='string',
    description='string',
    tags={
        'string': 'string'
    }
)
type sequenceStoreId:

string

param sequenceStoreId:

[REQUIRED]

The sequence store ID for the store that is the destination of the multipart uploads.

type clientToken:

string

param clientToken:

An idempotency token that can be used to avoid triggering multiple multipart uploads.

type sourceFileType:

string

param sourceFileType:

[REQUIRED]

The type of file being uploaded.

type subjectId:

string

param subjectId:

[REQUIRED]

The source's subject ID.

type sampleId:

string

param sampleId:

[REQUIRED]

The source's sample ID.

type generatedFrom:

string

param generatedFrom:

Where the source originated.

type referenceArn:

string

param referenceArn:

The ARN of the reference.

type name:

string

param name:

[REQUIRED]

The name of the read set.

type description:

string

param description:

The description of the read set.

type tags:

dict

param tags:

Any tags to add to the read set.

  • (string) --

    • (string) --

rtype:

dict

returns:

Response Syntax

{
    'sequenceStoreId': 'string',
    'uploadId': 'string',
    'sourceFileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
    'subjectId': 'string',
    'sampleId': 'string',
    'generatedFrom': 'string',
    'referenceArn': 'string',
    'name': 'string',
    'description': 'string',
    'tags': {
        'string': 'string'
    },
    'creationTime': datetime(2015, 1, 1)
}

Response Structure

  • (dict) --

    • sequenceStoreId (string) --

      The sequence store ID for the store that the read set will be created in.

    • uploadId (string) --

      he ID for the initiated multipart upload.

    • sourceFileType (string) --

      The file type of the read set source.

    • subjectId (string) --

      The source's subject ID.

    • sampleId (string) --

      The source's sample ID.

    • generatedFrom (string) --

      The source of the read set.

    • referenceArn (string) --

      The read set source's reference ARN.

    • name (string) --

      The name of the read set.

    • description (string) --

      The description of the read set.

    • tags (dict) --

      The tags to add to the read set.

      • (string) --

        • (string) --

    • creationTime (datetime) --

      The creation time of the multipart upload.

GetReadSetImportJob (updated) Link ¶
Changes (response)
{'sources': {'sourceFileType': {'UBAM'}}}

Gets information about a read set import job.

See also: AWS API Documentation

Request Syntax

client.get_read_set_import_job(
    id='string',
    sequenceStoreId='string'
)
type id:

string

param id:

[REQUIRED]

The job's ID.

type sequenceStoreId:

string

param sequenceStoreId:

[REQUIRED]

The job's sequence store ID.

rtype:

dict

returns:

Response Syntax

{
    'id': 'string',
    'sequenceStoreId': 'string',
    'roleArn': 'string',
    'status': 'SUBMITTED'|'IN_PROGRESS'|'CANCELLING'|'CANCELLED'|'FAILED'|'COMPLETED'|'COMPLETED_WITH_FAILURES',
    'statusMessage': 'string',
    'creationTime': datetime(2015, 1, 1),
    'completionTime': datetime(2015, 1, 1),
    'sources': [
        {
            'sourceFiles': {
                'source1': 'string',
                'source2': 'string'
            },
            'sourceFileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
            'status': 'NOT_STARTED'|'IN_PROGRESS'|'FINISHED'|'FAILED',
            'statusMessage': 'string',
            'subjectId': 'string',
            'sampleId': 'string',
            'generatedFrom': 'string',
            'referenceArn': 'string',
            'name': 'string',
            'description': 'string',
            'tags': {
                'string': 'string'
            }
        },
    ]
}

Response Structure

  • (dict) --

    • id (string) --

      The job's ID.

    • sequenceStoreId (string) --

      The job's sequence store ID.

    • roleArn (string) --

      The job's service role ARN.

    • status (string) --

      The job's status.

    • statusMessage (string) --

      The job's status message.

    • creationTime (datetime) --

      When the job was created.

    • completionTime (datetime) --

      When the job completed.

    • sources (list) --

      The job's source files.

      • (dict) --

        A source for an import read set job.

        • sourceFiles (dict) --

          The source files' location in Amazon S3.

          • source1 (string) --

            The location of the first file in Amazon S3.

          • source2 (string) --

            The location of the second file in Amazon S3.

        • sourceFileType (string) --

          The source's file type.

        • status (string) --

          The source's status.

        • statusMessage (string) --

          The source's status message.

        • subjectId (string) --

          The source's subject ID.

        • sampleId (string) --

          The source's sample ID.

        • generatedFrom (string) --

          Where the source originated.

        • referenceArn (string) --

          The source's genome reference ARN.

        • name (string) --

          The source's name.

        • description (string) --

          The source's description.

        • tags (dict) --

          The source's tags.

          • (string) --

            • (string) --

GetReadSetMetadata (updated) Link ¶
Changes (response)
{'fileType': {'UBAM'}}

Gets details about a read set.

See also: AWS API Documentation

Request Syntax

client.get_read_set_metadata(
    id='string',
    sequenceStoreId='string'
)
type id:

string

param id:

[REQUIRED]

The read set's ID.

type sequenceStoreId:

string

param sequenceStoreId:

[REQUIRED]

The read set's sequence store ID.

rtype:

dict

returns:

Response Syntax

{
    'id': 'string',
    'arn': 'string',
    'sequenceStoreId': 'string',
    'subjectId': 'string',
    'sampleId': 'string',
    'status': 'ARCHIVED'|'ACTIVATING'|'ACTIVE'|'DELETING'|'DELETED'|'PROCESSING_UPLOAD'|'UPLOAD_FAILED',
    'name': 'string',
    'description': 'string',
    'fileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
    'creationTime': datetime(2015, 1, 1),
    'sequenceInformation': {
        'totalReadCount': 123,
        'totalBaseCount': 123,
        'generatedFrom': 'string',
        'alignment': 'string'
    },
    'referenceArn': 'string',
    'files': {
        'source1': {
            'totalParts': 123,
            'partSize': 123,
            'contentLength': 123
        },
        'source2': {
            'totalParts': 123,
            'partSize': 123,
            'contentLength': 123
        },
        'index': {
            'totalParts': 123,
            'partSize': 123,
            'contentLength': 123
        }
    },
    'statusMessage': 'string',
    'creationType': 'IMPORT'|'UPLOAD',
    'etag': {
        'algorithm': 'FASTQ_MD5up'|'BAM_MD5up'|'CRAM_MD5up',
        'source1': 'string',
        'source2': 'string'
    }
}

Response Structure

  • (dict) --

    • id (string) --

      The read set's ID.

    • arn (string) --

      The read set's ARN.

    • sequenceStoreId (string) --

      The read set's sequence store ID.

    • subjectId (string) --

      The read set's subject ID.

    • sampleId (string) --

      The read set's sample ID.

    • status (string) --

      The read set's status.

    • name (string) --

      The read set's name.

    • description (string) --

      The read set's description.

    • fileType (string) --

      The read set's file type.

    • creationTime (datetime) --

      When the read set was created.

    • sequenceInformation (dict) --

      The read set's sequence information.

      • totalReadCount (integer) --

        The sequence's total read count.

      • totalBaseCount (integer) --

        The sequence's total base count.

      • generatedFrom (string) --

        Where the sequence originated.

      • alignment (string) --

        The sequence's alignment setting.

    • referenceArn (string) --

      The read set's genome reference ARN.

    • files (dict) --

      The read set's files.

      • source1 (dict) --

        The location of the first file in Amazon S3.

        • totalParts (integer) --

          The file's total parts.

        • partSize (integer) --

          The file's part size.

        • contentLength (integer) --

          The file's content length.

      • source2 (dict) --

        The location of the second file in Amazon S3.

        • totalParts (integer) --

          The file's total parts.

        • partSize (integer) --

          The file's part size.

        • contentLength (integer) --

          The file's content length.

      • index (dict) --

        The files' index.

        • totalParts (integer) --

          The file's total parts.

        • partSize (integer) --

          The file's part size.

        • contentLength (integer) --

          The file's content length.

    • statusMessage (string) --

      The status message for a read set. It provides more detail as to why the read set has a status.

    • creationType (string) --

      The creation type of the read set.

    • etag (dict) --

      The entity tag (ETag) is a hash of the object meant to represent its semantic content.

      • algorithm (string) --

        The algorithm used to calculate the read set’s ETag(s).

      • source1 (string) --

        The ETag hash calculated on Source1 of the read set.

      • source2 (string) --

        The ETag hash calculated on Source2 of the read set.

ListMultipartReadSetUploads (updated) Link ¶
Changes (response)
{'uploads': {'sourceFileType': {'UBAM'}}}

Lists all multipart read set uploads and their statuses.

See also: AWS API Documentation

Request Syntax

client.list_multipart_read_set_uploads(
    sequenceStoreId='string',
    maxResults=123,
    nextToken='string'
)
type sequenceStoreId:

string

param sequenceStoreId:

[REQUIRED]

The Sequence Store ID used for the multipart uploads.

type maxResults:

integer

param maxResults:

The maximum number of multipart uploads returned in a page.

type nextToken:

string

param nextToken:

Next token returned in the response of a previous ListMultipartReadSetUploads call. Used to get the next page of results.

rtype:

dict

returns:

Response Syntax

{
    'nextToken': 'string',
    'uploads': [
        {
            'sequenceStoreId': 'string',
            'uploadId': 'string',
            'sourceFileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
            'subjectId': 'string',
            'sampleId': 'string',
            'generatedFrom': 'string',
            'referenceArn': 'string',
            'name': 'string',
            'description': 'string',
            'tags': {
                'string': 'string'
            },
            'creationTime': datetime(2015, 1, 1)
        },
    ]
}

Response Structure

  • (dict) --

    • nextToken (string) --

      Next token returned in the response of a previous ListMultipartReadSetUploads call. Used to get the next page of results.

    • uploads (list) --

      An array of multipart uploads.

      • (dict) --

        Part of the response to ListMultipartReadSetUploads, excluding completed and aborted multipart uploads.

        • sequenceStoreId (string) --

          The sequence store ID used for the multipart upload.

        • uploadId (string) --

          The ID for the initiated multipart upload.

        • sourceFileType (string) --

          The type of file the read set originated from.

        • subjectId (string) --

          The read set source's subject ID.

        • sampleId (string) --

          The read set source's sample ID.

        • generatedFrom (string) --

          The source of an uploaded part.

        • referenceArn (string) --

          The source's reference ARN.

        • name (string) --

          The name of a read set.

        • description (string) --

          The description of a read set.

        • tags (dict) --

          Any tags you wish to add to a read set.

          • (string) --

            • (string) --

        • creationTime (datetime) --

          The time stamp for when a direct upload was created.

ListReadSets (updated) Link ¶
Changes (response)
{'readSets': {'fileType': {'UBAM'}}}

Retrieves a list of read sets.

See also: AWS API Documentation

Request Syntax

client.list_read_sets(
    sequenceStoreId='string',
    maxResults=123,
    nextToken='string',
    filter={
        'name': 'string',
        'status': 'ARCHIVED'|'ACTIVATING'|'ACTIVE'|'DELETING'|'DELETED'|'PROCESSING_UPLOAD'|'UPLOAD_FAILED',
        'referenceArn': 'string',
        'createdAfter': datetime(2015, 1, 1),
        'createdBefore': datetime(2015, 1, 1),
        'sampleId': 'string',
        'subjectId': 'string',
        'generatedFrom': 'string',
        'creationType': 'IMPORT'|'UPLOAD'
    }
)
type sequenceStoreId:

string

param sequenceStoreId:

[REQUIRED]

The jobs' sequence store ID.

type maxResults:

integer

param maxResults:

The maximum number of read sets to return in one page of results.

type nextToken:

string

param nextToken:

Specify the pagination token from a previous request to retrieve the next page of results.

type filter:

dict

param filter:

A filter to apply to the list.

  • name (string) --

    A name to filter on.

  • status (string) --

    A status to filter on.

  • referenceArn (string) --

    A genome reference ARN to filter on.

  • createdAfter (datetime) --

    The filter's start date.

  • createdBefore (datetime) --

    The filter's end date.

  • sampleId (string) --

    The read set source's sample ID.

  • subjectId (string) --

    The read set source's subject ID.

  • generatedFrom (string) --

    Where the source originated.

  • creationType (string) --

    The creation type of the read set.

rtype:

dict

returns:

Response Syntax

{
    'nextToken': 'string',
    'readSets': [
        {
            'id': 'string',
            'arn': 'string',
            'sequenceStoreId': 'string',
            'subjectId': 'string',
            'sampleId': 'string',
            'status': 'ARCHIVED'|'ACTIVATING'|'ACTIVE'|'DELETING'|'DELETED'|'PROCESSING_UPLOAD'|'UPLOAD_FAILED',
            'name': 'string',
            'description': 'string',
            'referenceArn': 'string',
            'fileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
            'sequenceInformation': {
                'totalReadCount': 123,
                'totalBaseCount': 123,
                'generatedFrom': 'string',
                'alignment': 'string'
            },
            'creationTime': datetime(2015, 1, 1),
            'statusMessage': 'string',
            'creationType': 'IMPORT'|'UPLOAD',
            'etag': {
                'algorithm': 'FASTQ_MD5up'|'BAM_MD5up'|'CRAM_MD5up',
                'source1': 'string',
                'source2': 'string'
            }
        },
    ]
}

Response Structure

  • (dict) --

    • nextToken (string) --

      A pagination token that's included if more results are available.

    • readSets (list) --

      A list of read sets.

      • (dict) --

        A read set.

        • id (string) --

          The read set's ID.

        • arn (string) --

          The read set's ARN.

        • sequenceStoreId (string) --

          The read set's sequence store ID.

        • subjectId (string) --

          The read set's subject ID.

        • sampleId (string) --

          The read set's sample ID.

        • status (string) --

          The read set's status.

        • name (string) --

          The read set's name.

        • description (string) --

          The read set's description.

        • referenceArn (string) --

          The read set's genome reference ARN.

        • fileType (string) --

          The read set's file type.

        • sequenceInformation (dict) --

          Details about a sequence.

          • totalReadCount (integer) --

            The sequence's total read count.

          • totalBaseCount (integer) --

            The sequence's total base count.

          • generatedFrom (string) --

            Where the sequence originated.

          • alignment (string) --

            The sequence's alignment setting.

        • creationTime (datetime) --

          When the read set was created.

        • statusMessage (string) --

          The status for a read set. It provides more detail as to why the read set has a status.

        • creationType (string) --

          The creation type of the read set.

        • etag (dict) --

          The entity tag (ETag) is a hash of the object representing its semantic content.

          • algorithm (string) --

            The algorithm used to calculate the read set’s ETag(s).

          • source1 (string) --

            The ETag hash calculated on Source1 of the read set.

          • source2 (string) --

            The ETag hash calculated on Source2 of the read set.

StartReadSetImportJob (updated) Link ¶
Changes (request)
{'sources': {'sourceFileType': {'UBAM'}}}

Starts a read set import job.

See also: AWS API Documentation

Request Syntax

client.start_read_set_import_job(
    sequenceStoreId='string',
    roleArn='string',
    clientToken='string',
    sources=[
        {
            'sourceFiles': {
                'source1': 'string',
                'source2': 'string'
            },
            'sourceFileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
            'subjectId': 'string',
            'sampleId': 'string',
            'generatedFrom': 'string',
            'referenceArn': 'string',
            'name': 'string',
            'description': 'string',
            'tags': {
                'string': 'string'
            }
        },
    ]
)
type sequenceStoreId:

string

param sequenceStoreId:

[REQUIRED]

The read set's sequence store ID.

type roleArn:

string

param roleArn:

[REQUIRED]

A service role for the job.

type clientToken:

string

param clientToken:

To ensure that jobs don't run multiple times, specify a unique token for each job.

type sources:

list

param sources:

[REQUIRED]

The job's source files.

  • (dict) --

    A source for a read set import job.

    • sourceFiles (dict) -- [REQUIRED]

      The source files' location in Amazon S3.

      • source1 (string) -- [REQUIRED]

        The location of the first file in Amazon S3.

      • source2 (string) --

        The location of the second file in Amazon S3.

    • sourceFileType (string) -- [REQUIRED]

      The source's file type.

    • subjectId (string) -- [REQUIRED]

      The source's subject ID.

    • sampleId (string) -- [REQUIRED]

      The source's sample ID.

    • generatedFrom (string) --

      Where the source originated.

    • referenceArn (string) --

      The source's reference ARN.

    • name (string) --

      The source's name.

    • description (string) --

      The source's description.

    • tags (dict) --

      The source's tags.

      • (string) --

        • (string) --

rtype:

dict

returns:

Response Syntax

{
    'id': 'string',
    'sequenceStoreId': 'string',
    'roleArn': 'string',
    'status': 'SUBMITTED'|'IN_PROGRESS'|'CANCELLING'|'CANCELLED'|'FAILED'|'COMPLETED'|'COMPLETED_WITH_FAILURES',
    'creationTime': datetime(2015, 1, 1)
}

Response Structure

  • (dict) --

    • id (string) --

      The job's ID.

    • sequenceStoreId (string) --

      The read set's sequence store ID.

    • roleArn (string) --

      The job's service role ARN.

    • status (string) --

      The job's status.

    • creationTime (datetime) --

      When the job was created.