AWS Comprehend Medical

2020/05/06 - AWS Comprehend Medical - 8 new api methods

Changes  New Batch Ontology APIs for ICD-10 and RxNorm will provide batch capability of linking the information extracted by Comprehend Medical to medical ontologies. The new ontology linking APIs make it easy to detect medications and medical conditions in unstructured clinical text and link them to RxNorm and ICD-10-CM codes respectively. This new feature can help you reduce the cost, time and effort of processing large amounts of unstructured medical text with high accuracy.

DescribeRxNormInferenceJob (new) Link ¶

Gets the properties associated with an InferRxNorm job. Use this operation to get the status of an inference job.

See also: AWS API Documentation

Request Syntax

client.describe_rx_norm_inference_job(
    JobId='string'
)
type JobId

string

param JobId

[REQUIRED]

The identifier that Amazon Comprehend Medical generated for the job. The StartRxNormInferenceJob operation returns this identifier in its response.

rtype

dict

returns

Response Syntax

{
    'ComprehendMedicalAsyncJobProperties': {
        'JobId': 'string',
        'JobName': 'string',
        'JobStatus': 'SUBMITTED'|'IN_PROGRESS'|'COMPLETED'|'PARTIAL_SUCCESS'|'FAILED'|'STOP_REQUESTED'|'STOPPED',
        'Message': 'string',
        'SubmitTime': datetime(2015, 1, 1),
        'EndTime': datetime(2015, 1, 1),
        'ExpirationTime': datetime(2015, 1, 1),
        'InputDataConfig': {
            'S3Bucket': 'string',
            'S3Key': 'string'
        },
        'OutputDataConfig': {
            'S3Bucket': 'string',
            'S3Key': 'string'
        },
        'LanguageCode': 'en',
        'DataAccessRoleArn': 'string',
        'ManifestFilePath': 'string',
        'KMSKey': 'string',
        'ModelVersion': 'string'
    }
}

Response Structure

  • (dict) --

    • ComprehendMedicalAsyncJobProperties (dict) --

      An object that contains the properties associated with a detection job.

      • JobId (string) --

        The identifier assigned to the detection job.

      • JobName (string) --

        The name that you assigned to the detection job.

      • JobStatus (string) --

        The current status of the detection job. If the status is FAILED , the Message field shows the reason for the failure.

      • Message (string) --

        A description of the status of a job.

      • SubmitTime (datetime) --

        The time that the detection job was submitted for processing.

      • EndTime (datetime) --

        The time that the detection job completed.

      • ExpirationTime (datetime) --

        The date and time that job metadata is deleted from the server. Output files in your S3 bucket will not be deleted. After the metadata is deleted, the job will no longer appear in the results of the ListEntitiesDetectionV2Job or the ListPHIDetectionJobs operation.

      • InputDataConfig (dict) --

        The input data configuration that you supplied when you created the detection job.

        • S3Bucket (string) --

          The URI of the S3 bucket that contains the input data. The bucket must be in the same region as the API endpoint that you are calling.

          Each file in the document collection must be less than 40 KB. You can store a maximum of 30 GB in the bucket.

        • S3Key (string) --

          The path to the input data files in the S3 bucket.

      • OutputDataConfig (dict) --

        The output data configuration that you supplied when you created the detection job.

        • S3Bucket (string) --

          When you use the OutputDataConfig object with asynchronous operations, you specify the Amazon S3 location where you want to write the output data. The URI must be in the same region as the API endpoint that you are calling. The location is used as the prefix for the actual location of the output.

        • S3Key (string) --

          The path to the output data files in the S3 bucket. Amazon Comprehend Medical creates an output directory using the job ID so that the output from one job does not overwrite the output of another.

      • LanguageCode (string) --

        The language code of the input documents.

      • DataAccessRoleArn (string) --

        The Amazon Resource Name (ARN) that gives Amazon Comprehend Medical read access to your input data.

      • ManifestFilePath (string) --

        The path to the file that describes the results of a batch job.

      • KMSKey (string) --

        The AWS Key Management Service key, if any, used to encrypt the output files.

      • ModelVersion (string) --

        The version of the model used to analyze the documents. The version number looks like X.X.X. You can use this information to track the model used for a particular batch of documents.

ListRxNormInferenceJobs (new) Link ¶

Gets a list of InferRxNorm jobs that you have submitted.

See also: AWS API Documentation

Request Syntax

client.list_rx_norm_inference_jobs(
    Filter={
        'JobName': 'string',
        'JobStatus': 'SUBMITTED'|'IN_PROGRESS'|'COMPLETED'|'PARTIAL_SUCCESS'|'FAILED'|'STOP_REQUESTED'|'STOPPED',
        'SubmitTimeBefore': datetime(2015, 1, 1),
        'SubmitTimeAfter': datetime(2015, 1, 1)
    },
    NextToken='string',
    MaxResults=123
)
type Filter

dict

param Filter

Filters the jobs that are returned. You can filter jobs based on their names, status, or the date and time that they were submitted. You can only set one filter at a time.

  • JobName (string) --

    Filters on the name of the job.

  • JobStatus (string) --

    Filters the list of jobs based on job status. Returns only jobs with the specified status.

  • SubmitTimeBefore (datetime) --

    Filters the list of jobs based on the time that the job was submitted for processing. Returns only jobs submitted before the specified time. Jobs are returned in ascending order, oldest to newest.

  • SubmitTimeAfter (datetime) --

    Filters the list of jobs based on the time that the job was submitted for processing. Returns only jobs submitted after the specified time. Jobs are returned in descending order, newest to oldest.

type NextToken

string

param NextToken

Identifies the next page of results to return.

type MaxResults

integer

param MaxResults

Identifies the next page of results to return.

rtype

dict

returns

Response Syntax

{
    'ComprehendMedicalAsyncJobPropertiesList': [
        {
            'JobId': 'string',
            'JobName': 'string',
            'JobStatus': 'SUBMITTED'|'IN_PROGRESS'|'COMPLETED'|'PARTIAL_SUCCESS'|'FAILED'|'STOP_REQUESTED'|'STOPPED',
            'Message': 'string',
            'SubmitTime': datetime(2015, 1, 1),
            'EndTime': datetime(2015, 1, 1),
            'ExpirationTime': datetime(2015, 1, 1),
            'InputDataConfig': {
                'S3Bucket': 'string',
                'S3Key': 'string'
            },
            'OutputDataConfig': {
                'S3Bucket': 'string',
                'S3Key': 'string'
            },
            'LanguageCode': 'en',
            'DataAccessRoleArn': 'string',
            'ManifestFilePath': 'string',
            'KMSKey': 'string',
            'ModelVersion': 'string'
        },
    ],
    'NextToken': 'string'
}

Response Structure

  • (dict) --

    • ComprehendMedicalAsyncJobPropertiesList (list) --

      The maximum number of results to return in each page. The default is 100.

      • (dict) --

        Provides information about a detection job.

        • JobId (string) --

          The identifier assigned to the detection job.

        • JobName (string) --

          The name that you assigned to the detection job.

        • JobStatus (string) --

          The current status of the detection job. If the status is FAILED , the Message field shows the reason for the failure.

        • Message (string) --

          A description of the status of a job.

        • SubmitTime (datetime) --

          The time that the detection job was submitted for processing.

        • EndTime (datetime) --

          The time that the detection job completed.

        • ExpirationTime (datetime) --

          The date and time that job metadata is deleted from the server. Output files in your S3 bucket will not be deleted. After the metadata is deleted, the job will no longer appear in the results of the ListEntitiesDetectionV2Job or the ListPHIDetectionJobs operation.

        • InputDataConfig (dict) --

          The input data configuration that you supplied when you created the detection job.

          • S3Bucket (string) --

            The URI of the S3 bucket that contains the input data. The bucket must be in the same region as the API endpoint that you are calling.

            Each file in the document collection must be less than 40 KB. You can store a maximum of 30 GB in the bucket.

          • S3Key (string) --

            The path to the input data files in the S3 bucket.

        • OutputDataConfig (dict) --

          The output data configuration that you supplied when you created the detection job.

          • S3Bucket (string) --

            When you use the OutputDataConfig object with asynchronous operations, you specify the Amazon S3 location where you want to write the output data. The URI must be in the same region as the API endpoint that you are calling. The location is used as the prefix for the actual location of the output.

          • S3Key (string) --

            The path to the output data files in the S3 bucket. Amazon Comprehend Medical creates an output directory using the job ID so that the output from one job does not overwrite the output of another.

        • LanguageCode (string) --

          The language code of the input documents.

        • DataAccessRoleArn (string) --

          The Amazon Resource Name (ARN) that gives Amazon Comprehend Medical read access to your input data.

        • ManifestFilePath (string) --

          The path to the file that describes the results of a batch job.

        • KMSKey (string) --

          The AWS Key Management Service key, if any, used to encrypt the output files.

        • ModelVersion (string) --

          The version of the model used to analyze the documents. The version number looks like X.X.X. You can use this information to track the model used for a particular batch of documents.

    • NextToken (string) --

      Identifies the next page of results to return.

StopICD10CMInferenceJob (new) Link ¶

Stops an InferICD10CM inference job in progress.

See also: AWS API Documentation

Request Syntax

client.stop_icd10_cm_inference_job(
    JobId='string'
)
type JobId

string

param JobId

[REQUIRED]

The identifier of the job.

rtype

dict

returns

Response Syntax

{
    'JobId': 'string'
}

Response Structure

  • (dict) --

    • JobId (string) --

      The identifier generated for the job. To get the status of job, use this identifier with the DescribeICD10CMInferenceJob operation.

StartICD10CMInferenceJob (new) Link ¶

Starts an asynchronous job to detect medical conditions and link them to the ICD-10-CM ontology. Use the DescribeICD10CMInferenceJob operation to track the status of a job.

See also: AWS API Documentation

Request Syntax

client.start_icd10_cm_inference_job(
    InputDataConfig={
        'S3Bucket': 'string',
        'S3Key': 'string'
    },
    OutputDataConfig={
        'S3Bucket': 'string',
        'S3Key': 'string'
    },
    DataAccessRoleArn='string',
    JobName='string',
    ClientRequestToken='string',
    KMSKey='string',
    LanguageCode='en'
)
type InputDataConfig

dict

param InputDataConfig

[REQUIRED]

Specifies the format and location of the input data for the job.

  • S3Bucket (string) -- [REQUIRED]

    The URI of the S3 bucket that contains the input data. The bucket must be in the same region as the API endpoint that you are calling.

    Each file in the document collection must be less than 40 KB. You can store a maximum of 30 GB in the bucket.

  • S3Key (string) --

    The path to the input data files in the S3 bucket.

type OutputDataConfig

dict

param OutputDataConfig

[REQUIRED]

Specifies where to send the output files.

  • S3Bucket (string) -- [REQUIRED]

    When you use the OutputDataConfig object with asynchronous operations, you specify the Amazon S3 location where you want to write the output data. The URI must be in the same region as the API endpoint that you are calling. The location is used as the prefix for the actual location of the output.

  • S3Key (string) --

    The path to the output data files in the S3 bucket. Amazon Comprehend Medical creates an output directory using the job ID so that the output from one job does not overwrite the output of another.

type DataAccessRoleArn

string

param DataAccessRoleArn

[REQUIRED]

The Amazon Resource Name (ARN) of the AWS Identity and Access Management (IAM) role that grants Amazon Comprehend Medical read access to your input data. For more information, see Role-Based Permissions Required for Asynchronous Operations.

type JobName

string

param JobName

The identifier of the job.

type ClientRequestToken

string

param ClientRequestToken

A unique identifier for the request. If you don't set the client request token, Amazon Comprehend Medical generates one.

This field is autopopulated if not provided.

type KMSKey

string

param KMSKey

An AWS Key Management Service key to encrypt your output files. If you do not specify a key, the files are written in plain text.

type LanguageCode

string

param LanguageCode

[REQUIRED]

The language of the input documents. All documents must be in the same language.

rtype

dict

returns

Response Syntax

{
    'JobId': 'string'
}

Response Structure

  • (dict) --

    • JobId (string) --

      The identifier generated for the job. To get the status of a job, use this identifier with the StartICD10CMInferenceJob operation.

ListICD10CMInferenceJobs (new) Link ¶

Gets a list of InferICD10CM jobs that you have submitted.

See also: AWS API Documentation

Request Syntax

client.list_icd10_cm_inference_jobs(
    Filter={
        'JobName': 'string',
        'JobStatus': 'SUBMITTED'|'IN_PROGRESS'|'COMPLETED'|'PARTIAL_SUCCESS'|'FAILED'|'STOP_REQUESTED'|'STOPPED',
        'SubmitTimeBefore': datetime(2015, 1, 1),
        'SubmitTimeAfter': datetime(2015, 1, 1)
    },
    NextToken='string',
    MaxResults=123
)
type Filter

dict

param Filter

Filters the jobs that are returned. You can filter jobs based on their names, status, or the date and time that they were submitted. You can only set one filter at a time.

  • JobName (string) --

    Filters on the name of the job.

  • JobStatus (string) --

    Filters the list of jobs based on job status. Returns only jobs with the specified status.

  • SubmitTimeBefore (datetime) --

    Filters the list of jobs based on the time that the job was submitted for processing. Returns only jobs submitted before the specified time. Jobs are returned in ascending order, oldest to newest.

  • SubmitTimeAfter (datetime) --

    Filters the list of jobs based on the time that the job was submitted for processing. Returns only jobs submitted after the specified time. Jobs are returned in descending order, newest to oldest.

type NextToken

string

param NextToken

Identifies the next page of results to return.

type MaxResults

integer

param MaxResults

The maximum number of results to return in each page. The default is 100.

rtype

dict

returns

Response Syntax

{
    'ComprehendMedicalAsyncJobPropertiesList': [
        {
            'JobId': 'string',
            'JobName': 'string',
            'JobStatus': 'SUBMITTED'|'IN_PROGRESS'|'COMPLETED'|'PARTIAL_SUCCESS'|'FAILED'|'STOP_REQUESTED'|'STOPPED',
            'Message': 'string',
            'SubmitTime': datetime(2015, 1, 1),
            'EndTime': datetime(2015, 1, 1),
            'ExpirationTime': datetime(2015, 1, 1),
            'InputDataConfig': {
                'S3Bucket': 'string',
                'S3Key': 'string'
            },
            'OutputDataConfig': {
                'S3Bucket': 'string',
                'S3Key': 'string'
            },
            'LanguageCode': 'en',
            'DataAccessRoleArn': 'string',
            'ManifestFilePath': 'string',
            'KMSKey': 'string',
            'ModelVersion': 'string'
        },
    ],
    'NextToken': 'string'
}

Response Structure

  • (dict) --

    • ComprehendMedicalAsyncJobPropertiesList (list) --

      A list containing the properties of each job that is returned.

      • (dict) --

        Provides information about a detection job.

        • JobId (string) --

          The identifier assigned to the detection job.

        • JobName (string) --

          The name that you assigned to the detection job.

        • JobStatus (string) --

          The current status of the detection job. If the status is FAILED , the Message field shows the reason for the failure.

        • Message (string) --

          A description of the status of a job.

        • SubmitTime (datetime) --

          The time that the detection job was submitted for processing.

        • EndTime (datetime) --

          The time that the detection job completed.

        • ExpirationTime (datetime) --

          The date and time that job metadata is deleted from the server. Output files in your S3 bucket will not be deleted. After the metadata is deleted, the job will no longer appear in the results of the ListEntitiesDetectionV2Job or the ListPHIDetectionJobs operation.

        • InputDataConfig (dict) --

          The input data configuration that you supplied when you created the detection job.

          • S3Bucket (string) --

            The URI of the S3 bucket that contains the input data. The bucket must be in the same region as the API endpoint that you are calling.

            Each file in the document collection must be less than 40 KB. You can store a maximum of 30 GB in the bucket.

          • S3Key (string) --

            The path to the input data files in the S3 bucket.

        • OutputDataConfig (dict) --

          The output data configuration that you supplied when you created the detection job.

          • S3Bucket (string) --

            When you use the OutputDataConfig object with asynchronous operations, you specify the Amazon S3 location where you want to write the output data. The URI must be in the same region as the API endpoint that you are calling. The location is used as the prefix for the actual location of the output.

          • S3Key (string) --

            The path to the output data files in the S3 bucket. Amazon Comprehend Medical creates an output directory using the job ID so that the output from one job does not overwrite the output of another.

        • LanguageCode (string) --

          The language code of the input documents.

        • DataAccessRoleArn (string) --

          The Amazon Resource Name (ARN) that gives Amazon Comprehend Medical read access to your input data.

        • ManifestFilePath (string) --

          The path to the file that describes the results of a batch job.

        • KMSKey (string) --

          The AWS Key Management Service key, if any, used to encrypt the output files.

        • ModelVersion (string) --

          The version of the model used to analyze the documents. The version number looks like X.X.X. You can use this information to track the model used for a particular batch of documents.

    • NextToken (string) --

      Identifies the next page of results to return.

StopRxNormInferenceJob (new) Link ¶

Stops an InferRxNorm inference job in progress.

See also: AWS API Documentation

Request Syntax

client.stop_rx_norm_inference_job(
    JobId='string'
)
type JobId

string

param JobId

[REQUIRED]

The identifier of the job.

rtype

dict

returns

Response Syntax

{
    'JobId': 'string'
}

Response Structure

  • (dict) --

    • JobId (string) --

      The identifier generated for the job. To get the status of job, use this identifier with the DescribeRxNormInferenceJob operation.

DescribeICD10CMInferenceJob (new) Link ¶

Gets the properties associated with an InferICD10CM job. Use this operation to get the status of an inference job.

See also: AWS API Documentation

Request Syntax

client.describe_icd10_cm_inference_job(
    JobId='string'
)
type JobId

string

param JobId

[REQUIRED]

The identifier that Amazon Comprehend Medical generated for the job. The StartICD10CMInferenceJob operation returns this identifier in its response.

rtype

dict

returns

Response Syntax

{
    'ComprehendMedicalAsyncJobProperties': {
        'JobId': 'string',
        'JobName': 'string',
        'JobStatus': 'SUBMITTED'|'IN_PROGRESS'|'COMPLETED'|'PARTIAL_SUCCESS'|'FAILED'|'STOP_REQUESTED'|'STOPPED',
        'Message': 'string',
        'SubmitTime': datetime(2015, 1, 1),
        'EndTime': datetime(2015, 1, 1),
        'ExpirationTime': datetime(2015, 1, 1),
        'InputDataConfig': {
            'S3Bucket': 'string',
            'S3Key': 'string'
        },
        'OutputDataConfig': {
            'S3Bucket': 'string',
            'S3Key': 'string'
        },
        'LanguageCode': 'en',
        'DataAccessRoleArn': 'string',
        'ManifestFilePath': 'string',
        'KMSKey': 'string',
        'ModelVersion': 'string'
    }
}

Response Structure

  • (dict) --

    • ComprehendMedicalAsyncJobProperties (dict) --

      An object that contains the properties associated with a detection job.

      • JobId (string) --

        The identifier assigned to the detection job.

      • JobName (string) --

        The name that you assigned to the detection job.

      • JobStatus (string) --

        The current status of the detection job. If the status is FAILED , the Message field shows the reason for the failure.

      • Message (string) --

        A description of the status of a job.

      • SubmitTime (datetime) --

        The time that the detection job was submitted for processing.

      • EndTime (datetime) --

        The time that the detection job completed.

      • ExpirationTime (datetime) --

        The date and time that job metadata is deleted from the server. Output files in your S3 bucket will not be deleted. After the metadata is deleted, the job will no longer appear in the results of the ListEntitiesDetectionV2Job or the ListPHIDetectionJobs operation.

      • InputDataConfig (dict) --

        The input data configuration that you supplied when you created the detection job.

        • S3Bucket (string) --

          The URI of the S3 bucket that contains the input data. The bucket must be in the same region as the API endpoint that you are calling.

          Each file in the document collection must be less than 40 KB. You can store a maximum of 30 GB in the bucket.

        • S3Key (string) --

          The path to the input data files in the S3 bucket.

      • OutputDataConfig (dict) --

        The output data configuration that you supplied when you created the detection job.

        • S3Bucket (string) --

          When you use the OutputDataConfig object with asynchronous operations, you specify the Amazon S3 location where you want to write the output data. The URI must be in the same region as the API endpoint that you are calling. The location is used as the prefix for the actual location of the output.

        • S3Key (string) --

          The path to the output data files in the S3 bucket. Amazon Comprehend Medical creates an output directory using the job ID so that the output from one job does not overwrite the output of another.

      • LanguageCode (string) --

        The language code of the input documents.

      • DataAccessRoleArn (string) --

        The Amazon Resource Name (ARN) that gives Amazon Comprehend Medical read access to your input data.

      • ManifestFilePath (string) --

        The path to the file that describes the results of a batch job.

      • KMSKey (string) --

        The AWS Key Management Service key, if any, used to encrypt the output files.

      • ModelVersion (string) --

        The version of the model used to analyze the documents. The version number looks like X.X.X. You can use this information to track the model used for a particular batch of documents.

StartRxNormInferenceJob (new) Link ¶

Starts an asynchronous job to detect medication entities and link them to the RxNorm ontology. Use the DescribeRxNormInferenceJob operation to track the status of a job.

See also: AWS API Documentation

Request Syntax

client.start_rx_norm_inference_job(
    InputDataConfig={
        'S3Bucket': 'string',
        'S3Key': 'string'
    },
    OutputDataConfig={
        'S3Bucket': 'string',
        'S3Key': 'string'
    },
    DataAccessRoleArn='string',
    JobName='string',
    ClientRequestToken='string',
    KMSKey='string',
    LanguageCode='en'
)
type InputDataConfig

dict

param InputDataConfig

[REQUIRED]

Specifies the format and location of the input data for the job.

  • S3Bucket (string) -- [REQUIRED]

    The URI of the S3 bucket that contains the input data. The bucket must be in the same region as the API endpoint that you are calling.

    Each file in the document collection must be less than 40 KB. You can store a maximum of 30 GB in the bucket.

  • S3Key (string) --

    The path to the input data files in the S3 bucket.

type OutputDataConfig

dict

param OutputDataConfig

[REQUIRED]

Specifies where to send the output files.

  • S3Bucket (string) -- [REQUIRED]

    When you use the OutputDataConfig object with asynchronous operations, you specify the Amazon S3 location where you want to write the output data. The URI must be in the same region as the API endpoint that you are calling. The location is used as the prefix for the actual location of the output.

  • S3Key (string) --

    The path to the output data files in the S3 bucket. Amazon Comprehend Medical creates an output directory using the job ID so that the output from one job does not overwrite the output of another.

type DataAccessRoleArn

string

param DataAccessRoleArn

[REQUIRED]

The Amazon Resource Name (ARN) of the AWS Identity and Access Management (IAM) role that grants Amazon Comprehend Medical read access to your input data. For more information, see Role-Based Permissions Required for Asynchronous Operations.

type JobName

string

param JobName

The identifier of the job.

type ClientRequestToken

string

param ClientRequestToken

A unique identifier for the request. If you don't set the client request token, Amazon Comprehend Medical generates one.

This field is autopopulated if not provided.

type KMSKey

string

param KMSKey

An AWS Key Management Service key to encrypt your output files. If you do not specify a key, the files are written in plain text.

type LanguageCode

string

param LanguageCode

[REQUIRED]

The language of the input documents. All documents must be in the same language.

rtype

dict

returns

Response Syntax

{
    'JobId': 'string'
}

Response Structure

  • (dict) --

    • JobId (string) --

      The identifier of the job.