FinSpace Public API

2021/12/20 - FinSpace Public API - 5 updated api methods

Changes  Make dataset description optional and allow s3 export for dataviews

CreateDataView (updated) Link ¶
Changes (request)
{'destinationTypeParams': {'s3DestinationExportFileFormat': 'PARQUET | '
                                                            'DELIMITED_TEXT',
                           's3DestinationExportFileFormatOptions': {'string': 'string'}}}

Creates a Dataview for a Dataset.

See also: AWS API Documentation

Request Syntax

client.create_data_view(
    clientToken='string',
    datasetId='string',
    autoUpdate=True|False,
    sortColumns=[
        'string',
    ],
    partitionColumns=[
        'string',
    ],
    asOfTimestamp=123,
    destinationTypeParams={
        'destinationType': 'string',
        's3DestinationExportFileFormat': 'PARQUET'|'DELIMITED_TEXT',
        's3DestinationExportFileFormatOptions': {
            'string': 'string'
        }
    }
)
type clientToken

string

param clientToken

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId

string

param datasetId

[REQUIRED]

The unique Dataset identifier that is used to create a Dataview.

type autoUpdate

boolean

param autoUpdate

Flag to indicate Dataview should be updated automatically.

type sortColumns

list

param sortColumns

Columns to be used for sorting the data.

  • (string) -- 1 - 255 character String

type partitionColumns

list

param partitionColumns

Ordered set of column names used to partition data.

  • (string) -- 1 - 255 character String

type asOfTimestamp

integer

param asOfTimestamp

Beginning time to use for the Dataview. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

type destinationTypeParams

dict

param destinationTypeParams

[REQUIRED]

Options that define the destination type for the Dataview.

  • destinationType (string) -- [REQUIRED]

    Destination type for a Dataview.

    • GLUE_TABLE - Glue table destination type.

  • s3DestinationExportFileFormat (string) -- Data View Export File Format

  • s3DestinationExportFileFormatOptions (dict) --

    • (string) --

      • (string) --

rtype

dict

returns

Response Syntax

{
    'datasetId': 'string',
    'dataViewId': 'string'
}

Response Structure

  • (dict) -- Response for creating a data view.

    • datasetId (string) --

      The unique identifier of the Dataset used for the Dataview.

    • dataViewId (string) --

      The unique identifier for the created Dataview.

GetChangeset (updated) Link ¶
Changes (response)
{'activeFromTimestamp': 'long'}

Get information about a Changeset.

See also: AWS API Documentation

Request Syntax

client.get_changeset(
    datasetId='string',
    changesetId='string'
)
type datasetId

string

param datasetId

[REQUIRED]

The unique identifier for the FinSpace Dataset where the Changeset is created.

type changesetId

string

param changesetId

[REQUIRED]

The unique identifier of the Changeset for which to get data.

rtype

dict

returns

Response Syntax

{
    'changesetId': 'string',
    'changesetArn': 'string',
    'datasetId': 'string',
    'changeType': 'REPLACE'|'APPEND'|'MODIFY',
    'sourceParams': {
        'string': 'string'
    },
    'formatParams': {
        'string': 'string'
    },
    'createTime': 123,
    'status': 'PENDING'|'FAILED'|'SUCCESS'|'RUNNING'|'STOP_REQUESTED',
    'errorInfo': {
        'errorMessage': 'string',
        'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
    },
    'activeUntilTimestamp': 123,
    'activeFromTimestamp': 123,
    'updatesChangesetId': 'string',
    'updatedByChangesetId': 'string'
}

Response Structure

  • (dict) -- The response from a describe changeset operation

    • changesetId (string) --

      The unique identifier for a Changeset.

    • changesetArn (string) --

      The ARN identifier of the Changeset.

    • datasetId (string) --

      The unique identifier for the FinSpace Dataset where the Changeset is created.

    • changeType (string) --

      Type that indicates how a Changeset is applied to a Dataset.

      • REPLACE - Changeset is considered as a replacement to all prior loaded Changesets.

      • APPEND - Changeset is considered as an addition to the end of all prior loaded Changesets.

      • MODIFY - Changeset is considered as a replacement to a specific prior ingested Changeset.

    • sourceParams (dict) --

      Options that define the location of the data being ingested.

      • (string) --

        • (string) --

    • formatParams (dict) --

      Structure of the source file(s).

      • (string) --

        • (string) --

    • createTime (integer) --

      The timestamp at which the Changeset was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • status (string) --

      The status of Changeset creation operation.

    • errorInfo (dict) --

      The structure with error messages.

      • errorMessage (string) --

        The text of the error message.

      • errorCategory (string) --

        The category of the error.

        • VALIDATION -The inputs to this request are invalid.

        • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

        • ACCESS_DENIED - Missing required permission to perform this request.

        • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

        • THROTTLING - The system temporarily lacks sufficient resources to process the request.

        • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

        • CANCELLED - Cancelled.

        • USER_RECOVERABLE - A user recoverable error has occurred.

    • activeUntilTimestamp (integer) --

      Time until which the Changeset is active. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • activeFromTimestamp (integer) -- Milliseconds since UTC epoch

    • updatesChangesetId (string) --

      The unique identifier of the Changeset that is being updated.

    • updatedByChangesetId (string) --

      The unique identifier of the updated Changeset.

GetDataView (updated) Link ¶
Changes (response)
{'destinationTypeParams': {'s3DestinationExportFileFormat': 'PARQUET | '
                                                            'DELIMITED_TEXT',
                           's3DestinationExportFileFormatOptions': {'string': 'string'}}}

Gets information about a Dataview.

See also: AWS API Documentation

Request Syntax

client.get_data_view(
    dataViewId='string',
    datasetId='string'
)
type dataViewId

string

param dataViewId

[REQUIRED]

The unique identifier for the Dataview.

type datasetId

string

param datasetId

[REQUIRED]

The unique identifier for the Dataset used in the Dataview.

rtype

dict

returns

Response Syntax

{
    'autoUpdate': True|False,
    'partitionColumns': [
        'string',
    ],
    'datasetId': 'string',
    'asOfTimestamp': 123,
    'errorInfo': {
        'errorMessage': 'string',
        'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
    },
    'lastModifiedTime': 123,
    'createTime': 123,
    'sortColumns': [
        'string',
    ],
    'dataViewId': 'string',
    'dataViewArn': 'string',
    'destinationTypeParams': {
        'destinationType': 'string',
        's3DestinationExportFileFormat': 'PARQUET'|'DELIMITED_TEXT',
        's3DestinationExportFileFormatOptions': {
            'string': 'string'
        }
    },
    'status': 'RUNNING'|'STARTING'|'FAILED'|'CANCELLED'|'TIMEOUT'|'SUCCESS'|'PENDING'|'FAILED_CLEANUP_FAILED'
}

Response Structure

  • (dict) -- Response from retrieving a dataview, which includes details on the target database and table name

    • autoUpdate (boolean) --

      Flag to indicate Dataview should be updated automatically.

    • partitionColumns (list) --

      Ordered set of column names used to partition data.

      • (string) -- 1 - 255 character String

    • datasetId (string) --

      The unique identifier for the Dataset used in the Dataview.

    • asOfTimestamp (integer) --

      Time range to use for the Dataview. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • errorInfo (dict) --

      Information about an error that occurred for the Dataview.

      • errorMessage (string) --

        The text of the error message.

      • errorCategory (string) --

        The category of the error.

        • VALIDATION -The inputs to this request are invalid.

        • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

        • ACCESS_DENIED - Missing required permission to perform this request.

        • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

        • THROTTLING - The system temporarily lacks sufficient resources to process the request.

        • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

        • CANCELLED - Cancelled.

        • USER_RECOVERABLE - A user recoverable error has occurred.

    • lastModifiedTime (integer) --

      The last time that a Dataview was modified. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • createTime (integer) --

      The timestamp at which the Dataview was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • sortColumns (list) --

      Columns to be used for sorting the data.

      • (string) -- 1 - 255 character String

    • dataViewId (string) --

      The unique identifier for the Dataview.

    • dataViewArn (string) --

      The ARN identifier of the Dataview.

    • destinationTypeParams (dict) --

      Options that define the destination type for the Dataview.

      • destinationType (string) --

        Destination type for a Dataview.

        • GLUE_TABLE - Glue table destination type.

      • s3DestinationExportFileFormat (string) -- Data View Export File Format

      • s3DestinationExportFileFormatOptions (dict) --

        • (string) --

          • (string) --

    • status (string) --

      The status of a Dataview creation.

      • RUNNING - Dataview creation is running.

      • STARTING - Dataview creation is starting.

      • FAILED - Dataview creation has failed.

      • CANCELLED - Dataview creation has been cancelled.

      • TIMEOUT - Dataview creation has timed out.

      • SUCCESS - Dataview creation has succeeded.

      • PENDING - Dataview creation is pending.

      • FAILED_CLEANUP_FAILED - Dataview creation failed and resource cleanup failed.

ListChangesets (updated) Link ¶
Changes (response)
{'changesets': {'activeFromTimestamp': 'long'}}

Lists the FinSpace Changesets for a Dataset.

See also: AWS API Documentation

Request Syntax

client.list_changesets(
    datasetId='string',
    maxResults=123,
    nextToken='string'
)
type datasetId

string

param datasetId

[REQUIRED]

The unique identifier for the FinSpace Dataset to which the Changeset belongs.

type maxResults

integer

param maxResults

The maximum number of results per page.

type nextToken

string

param nextToken

A token indicating where a results page should begin.

rtype

dict

returns

Response Syntax

{
    'changesets': [
        {
            'changesetId': 'string',
            'changesetArn': 'string',
            'datasetId': 'string',
            'changeType': 'REPLACE'|'APPEND'|'MODIFY',
            'sourceParams': {
                'string': 'string'
            },
            'formatParams': {
                'string': 'string'
            },
            'createTime': 123,
            'status': 'PENDING'|'FAILED'|'SUCCESS'|'RUNNING'|'STOP_REQUESTED',
            'errorInfo': {
                'errorMessage': 'string',
                'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
            },
            'activeUntilTimestamp': 123,
            'activeFromTimestamp': 123,
            'updatesChangesetId': 'string',
            'updatedByChangesetId': 'string'
        },
    ],
    'nextToken': 'string'
}

Response Structure

  • (dict) -- Response to ListChangesetsResponse. This returns a list of dataset changesets that match the query criteria.

    • changesets (list) --

      List of Changesets found.

      • (dict) --

        A Changeset is unit of data in a Dataset.

        • changesetId (string) --

          The unique identifier for a Changeset.

        • changesetArn (string) --

          The ARN identifier of the Changeset.

        • datasetId (string) --

          The unique identifier for the FinSpace Dataset in which the Changeset is created.

        • changeType (string) --

          Type that indicates how a Changeset is applied to a Dataset.

          • REPLACE - Changeset is considered as a replacement to all prior loaded Changesets.

          • APPEND - Changeset is considered as an addition to the end of all prior loaded Changesets.

          • MODIFY - Changeset is considered as a replacement to a specific prior ingested Changeset.

        • sourceParams (dict) --

          Options that define the location of the data being ingested.

          • (string) --

            • (string) --

        • formatParams (dict) --

          Options that define the structure of the source file(s).

          • (string) --

            • (string) --

        • createTime (integer) --

          The timestamp at which the Changeset was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • status (string) --

          Status of the Changeset ingestion.

          • PENDING - Changeset is pending creation.

          • FAILED - Changeset creation has failed.

          • SUCCESS - Changeset creation has succeeded.

          • RUNNING - Changeset creation is running.

          • STOP_REQUESTED - User requested Changeset creation to stop.

        • errorInfo (dict) --

          The structure with error messages.

          • errorMessage (string) --

            The text of the error message.

          • errorCategory (string) --

            The category of the error.

            • VALIDATION -The inputs to this request are invalid.

            • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

            • ACCESS_DENIED - Missing required permission to perform this request.

            • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

            • THROTTLING - The system temporarily lacks sufficient resources to process the request.

            • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

            • CANCELLED - Cancelled.

            • USER_RECOVERABLE - A user recoverable error has occurred.

        • activeUntilTimestamp (integer) --

          Time until which the Changeset is active. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • activeFromTimestamp (integer) -- Milliseconds since UTC epoch

        • updatesChangesetId (string) --

          The unique identifier of the Changeset that is updated.

        • updatedByChangesetId (string) --

          The unique identifier of the updated Changeset.

    • nextToken (string) --

      A token indicating where a results page should begin.

ListDataViews (updated) Link ¶
Changes (response)
{'dataViews': {'destinationTypeProperties': {'s3DestinationExportFileFormat': 'PARQUET '
                                                                              '| '
                                                                              'DELIMITED_TEXT',
                                             's3DestinationExportFileFormatOptions': {'string': 'string'}}}}

Lists all available Dataviews for a Dataset.

See also: AWS API Documentation

Request Syntax

client.list_data_views(
    datasetId='string',
    nextToken='string',
    maxResults=123
)
type datasetId

string

param datasetId

[REQUIRED]

The unique identifier of the Dataset for which to retrieve Dataviews.

type nextToken

string

param nextToken

A token indicating where a results page should begin.

type maxResults

integer

param maxResults

The maximum number of results per page.

rtype

dict

returns

Response Syntax

{
    'nextToken': 'string',
    'dataViews': [
        {
            'dataViewId': 'string',
            'dataViewArn': 'string',
            'datasetId': 'string',
            'asOfTimestamp': 123,
            'partitionColumns': [
                'string',
            ],
            'sortColumns': [
                'string',
            ],
            'status': 'RUNNING'|'STARTING'|'FAILED'|'CANCELLED'|'TIMEOUT'|'SUCCESS'|'PENDING'|'FAILED_CLEANUP_FAILED',
            'errorInfo': {
                'errorMessage': 'string',
                'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
            },
            'destinationTypeProperties': {
                'destinationType': 'string',
                's3DestinationExportFileFormat': 'PARQUET'|'DELIMITED_TEXT',
                's3DestinationExportFileFormatOptions': {
                    'string': 'string'
                }
            },
            'autoUpdate': True|False,
            'createTime': 123,
            'lastModifiedTime': 123
        },
    ]
}

Response Structure

  • (dict) --

    • nextToken (string) --

      A token indicating where a results page should begin.

    • dataViews (list) --

      A list of Dataviews.

      • (dict) --

        Structure for the summary of a Dataview.

        • dataViewId (string) --

          The unique identifier for the Dataview.

        • dataViewArn (string) --

          The ARN identifier of the Dataview.

        • datasetId (string) --

          Th unique identifier for the Dataview Dataset.

        • asOfTimestamp (integer) --

          Time range to use for the Dataview. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • partitionColumns (list) --

          Ordered set of column names used to partition data.

          • (string) -- 1 - 255 character String

        • sortColumns (list) --

          Columns to be used for sorting the data.

          • (string) -- 1 - 255 character String

        • status (string) --

          The status of a Dataview creation.

          • RUNNING - Dataview creation is running.

          • STARTING - Dataview creation is starting.

          • FAILED - Dataview creation has failed.

          • CANCELLED - Dataview creation has been cancelled.

          • TIMEOUT - Dataview creation has timed out.

          • SUCCESS - Dataview creation has succeeded.

          • PENDING - Dataview creation is pending.

          • FAILED_CLEANUP_FAILED - Dataview creation failed and resource cleanup failed.

        • errorInfo (dict) --

          The structure with error messages.

          • errorMessage (string) --

            The text of the error message.

          • errorCategory (string) --

            The category of the error.

            • VALIDATION -The inputs to this request are invalid.

            • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

            • ACCESS_DENIED - Missing required permission to perform this request.

            • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

            • THROTTLING - The system temporarily lacks sufficient resources to process the request.

            • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

            • CANCELLED - Cancelled.

            • USER_RECOVERABLE - A user recoverable error has occurred.

        • destinationTypeProperties (dict) --

          Information about the Dataview destination.

          • destinationType (string) --

            Destination type for a Dataview.

            • GLUE_TABLE - Glue table destination type.

          • s3DestinationExportFileFormat (string) -- Data View Export File Format

          • s3DestinationExportFileFormatOptions (dict) --

            • (string) --

              • (string) --

        • autoUpdate (boolean) --

          The flag to indicate Dataview should be updated automatically.

        • createTime (integer) --

          The timestamp at which the Dataview was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • lastModifiedTime (integer) --

          The last time that a Dataview was modified. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.