AWS API Changes

2023/10/02 - Amazon Bedrock Runtime - 1 updated api methods

Changes Provisioned throughput feature with Amazon and third-party base models, and update validators for model identifier and taggable resource ARNs.

InvokeModelWithResponseStream (updated)

Link ¶
Changes (response)

{'body': {'modelTimeoutException': {'message': 'string'}}}

Invoke the specified Bedrock model to run inference using the input provided. Return the response in a stream.

For more information, see Run inference in the Bedrock User Guide.

For an example request and response, see Examples (after the Errors section).

See also: AWS API Documentation

Request Syntax

client.invoke_model_with_response_stream(
    accept='string',
    body=b'bytes'|file,
    contentType='string',
    modelId='string'
)

type accept:

string

param accept:

The desired MIME type of the inference body in the response. The default value is application/json.

type body:

bytes or seekable file-like object

param body:

[REQUIRED]

Inference input in the format specified by the content-type. To see the format and content of this field for different models, refer to Inference parameters.

type contentType:

string

param contentType:

The MIME type of the input data in the request. The default value is application/json.

type modelId:

string

param modelId:

[REQUIRED]

Id of the model to invoke using the streaming request.

rtype:

dict

returns:

The response of this operation contains an :class:`.EventStream` member. When iterated the :class:`.EventStream` will yield events based on the structure below, where only one of the top level keys will be present for any given event.

Response Syntax

{
    'body': EventStream({
        'chunk': {
            'bytes': b'bytes'
        },
        'internalServerException': {
            'message': 'string'
        },
        'modelStreamErrorException': {
            'message': 'string',
            'originalMessage': 'string',
            'originalStatusCode': 123
        },
        'modelTimeoutException': {
            'message': 'string'
        },
        'throttlingException': {
            'message': 'string'
        },
        'validationException': {
            'message': 'string'
        }
    }),
    'contentType': 'string'
}

Response Structure

(dict) --
- body (:class:`.EventStream`) --
  
  Inference response from the model in the format specified by Content-Type. To see the format and content of this field for different models, refer to Inference parameters.
  - chunk (dict) --
    
    Content included in the response.
    - bytes (bytes) --
      
      Base64-encoded bytes of payload data.
  - internalServerException (dict) --
    
    An internal server error occurred. Retry your request.
    - message (string) --
  - modelStreamErrorException (dict) --
    
    An error occurred while streaming the response.
    - message (string) --
    - originalMessage (string) --
      
      The original message.
    - originalStatusCode (integer) --
      
      The original status code.
  - modelTimeoutException (dict) --
    
    The request took too long to process. Processing time exceeded the model timeout length.
    - message (string) --
  - throttlingException (dict) --
    
    The number of requests exceeds the limit. Resubmit your request later.
    - message (string) --
  - validationException (dict) --
    
    Input validation failed. Check your request parameters and retry the request.
    - message (string) --
- contentType (string) --
  
  The MIME type of the inference result.